Ghosh et al., 2012 - Google Patents
A comparative study of performance of fpga based mel filter bank & bark filter bankGhosh et al., 2012
View PDF- Document ID
- 11522437609007698739
- Author
- Ghosh D
- Debnath D
- Bose S
- Publication year
- Publication venue
- arXiv preprint arXiv:1206.1450
External Links
Snippet
The sensitivity of human ear is dependent on frequency which is nonlinearly resolved across the audio spectrum. Now to improve the recognition performance in a similar non linear approach requires a front-end design, suggested by empirical evidences. A popular …
- 230000000052 comparative effect 0 title abstract description 6
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/031—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H2250/00—Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
- G10H2250/471—General musical sound synthesis principles, i.e. sound category-independent synthesis methods
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS
- G10H1/00—Details of electrophonic musical instruments
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Virtanen | Sound source separation using sparse coding with temporal continuity objective | |
Grimm et al. | The master hearing aid: A PC-based platform for algorithm development and evaluation | |
CN103730131B (en) | The method and apparatus of speech quality evaluation | |
BRPI0616903A2 (en) | method for separating audio sources from a single audio signal, and, audio source classifier | |
Enzinger et al. | Empirical test of the performance of an acoustic-phonetic approach to forensic voice comparison under conditions similar to those of a real case | |
Li et al. | Speech transmission index from running speech: A neural network approach | |
Ghosh et al. | A comparative study of performance of fpga based mel filter bank & bark filter bank | |
CN111696580A (en) | Voice detection method and device, electronic equipment and storage medium | |
Vaca et al. | An open audio processing platform with zync fpga | |
Vaca et al. | Real-time automatic music transcription (AMT) with Zync FPGA | |
Leong et al. | An FPGA-based electronic cochlea | |
Bank et al. | Robust loss filter design for digital waveguide synthesis of string tones | |
JP3918315B2 (en) | Impulse response measurement method | |
Primavera et al. | Objective and subjective investigation on a novel method for digital reverberator parameters estimation | |
CN110739006B (en) | Audio processing method and device, storage medium and electronic equipment | |
Al-Shamma et al. | Employing FPGA accelerator in real-time speaker identification systems | |
Srinivas et al. | An efficient hardware architecture for detection of vowel-like regions in speech signal | |
Schafer | A survey of digital speech processing techniques | |
Perez-Carrillo | Statistical models for the indirect acquisition of violin bowing controls from audio analysis | |
Kumar et al. | Performance evaluation of a wavelet-based pitch detection scheme | |
Singh | pyAudioProcessing: Audio Processing, Feature Extraction, and Machine Learning Modeling. | |
Ehkan et al. | Hardware implementation of MFCC-based feature extraction for speaker recognition | |
Mobini et al. | An FPGA based implementation of G. 729 | |
Ward et al. | Real-time excitation based binaural loudness meters | |
Paiva et al. | The helmholtz resonator tree |