Mokhov, 2008 - Google Patents
Choosing best algorithm combinations for speech processing tasks in machine learning using MARFMokhov, 2008
- Document ID
- 7854362427931658161
- Author
- Mokhov S
- Publication year
- Publication venue
- Advances in Artificial Intelligence: 21st Conference of the Canadian Society for Computational Studies of Intelligence, Canadian AI 2008 Windsor, Canada, May 28-30, 2008 Proceedings 21
External Links
Snippet
This work reports experimental results in various speech processing tasks using an application based on the Modular Audio Recognition Framework (MARF) in terms of the best of the available algorithm configurations for each particular task. This study focuses on the …
- 238000010801 machine learning 0 title abstract description 5
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Neal et al. | Time-frequency segmentation of bird song in noisy acoustic environments | |
CN110457432B (en) | Interview scoring method, interview scoring device, interview scoring equipment and interview scoring storage medium | |
Huang et al. | Depression Detection from Short Utterances via Diverse Smartphones in Natural Environmental Conditions. | |
Rajisha et al. | Performance analysis of Malayalam language speech emotion recognition system using ANN/SVM | |
CN105702251B (en) | Reinforce the speech-emotion recognition method of audio bag of words based on Top-k | |
Ivanov et al. | Modulation Spectrum Analysis for Speaker Personality Trait Recognition. | |
Abercrombie et al. | ParlVote: A corpus for sentiment analysis of political debates | |
CN113256751B (en) | Voice-based image generation method, device, equipment and storage medium | |
CN110853648A (en) | Bad voice detection method and device, electronic equipment and storage medium | |
Mokhov | Choosing best algorithm combinations for speech processing tasks in machine learning using MARF | |
CN109448756A (en) | A kind of voice age recognition methods and system | |
Whitehill et al. | Whosecough: In-the-wild cougher verification using multitask learning | |
Chuchra et al. | A deep learning approach for splicing detection in digital audios | |
Luo et al. | Singing voice separation using spectro-temporal modulation features | |
Sephus et al. | Modulation spectral features: In pursuit of invariant representations of music with application to unsupervised source identification | |
Chen et al. | A robust feature extraction algorithm for audio fingerprinting | |
Tu et al. | Discriminative feature analysis based on the crossing level for leakage classification in water pipelines | |
Singhal et al. | Estimation of Accuracy in Human Gender Identification and Recall Values Based on Voice Signals Using Different Classifiers | |
Lei et al. | Robust scream sound detection via sound event partitioning | |
Zhang et al. | Computer-assisted sampling of acoustic data for more efficient determination of bird species richness | |
Mokhov | Experimental results and statistics in the implementation of the modular audio recognition framework’s API for text-independent speaker identification | |
Rochlani et al. | Machine Learning Approach for Detection of Speech Emotions for RAVDESS Audio Dataset | |
Cao et al. | Identification of electronic disguised voices in the noisy environment | |
Cheng et al. | A novel chicken voice recognition method using the orthogonal matching pursuit algorithm | |
Gosztolya et al. | Ensemble Bag-of-Audio-Words representation improves paralinguistic classification accuracy |