Mokhov, 2008 - Google Patents

Choosing best algorithm combinations for speech processing tasks in machine learning using MARF

Mokhov, 2008

Document ID: 7854362427931658161
Author: Mokhov S
Publication year: 2008
Publication venue: Advances in Artificial Intelligence: 21st Conference of the Canadian Society for Computational Studies of Intelligence, Canadian AI 2008 Windsor, Canada, May 28-30, 2008 Proceedings 21

External Links

Cited by

Snippet

This work reports experimental results in various speech processing tasks using an application based on the Modular Audio Recognition Framework (MARF) in terms of the best of the available algorithm configurations for each particular task. This study focuses on the …

Continue reading at link.springer.com (other versions)

238000010801 machine learning 0 title abstract description 5

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image

Similar Documents

Publication	Publication Date	Title
Neal et al.	2011	Time-frequency segmentation of bird song in noisy acoustic environments
CN110457432B (en)	2023-05-30	Interview scoring method, interview scoring device, interview scoring equipment and interview scoring storage medium
Huang et al.	2018	Depression Detection from Short Utterances via Diverse Smartphones in Natural Environmental Conditions.
Rajisha et al.	2016	Performance analysis of Malayalam language speech emotion recognition system using ANN/SVM
CN105702251B (en)	2019-10-22	Reinforce the speech-emotion recognition method of audio bag of words based on Top-k
Ivanov et al.	2012	Modulation Spectrum Analysis for Speaker Personality Trait Recognition.
Abercrombie et al.	2020	ParlVote: A corpus for sentiment analysis of political debates
CN113256751B (en)	2023-09-29	Voice-based image generation method, device, equipment and storage medium
CN110853648A (en)	2020-02-28	Bad voice detection method and device, electronic equipment and storage medium
Mokhov	2008	Choosing best algorithm combinations for speech processing tasks in machine learning using MARF
CN109448756A (en)	2019-03-08	A kind of voice age recognition methods and system
Whitehill et al.	2020	Whosecough: In-the-wild cougher verification using multitask learning
Chuchra et al.	2022	A deep learning approach for splicing detection in digital audios
Luo et al.	2014	Singing voice separation using spectro-temporal modulation features
Sephus et al.	2015	Modulation spectral features: In pursuit of invariant representations of music with application to unsupervised source identification
Chen et al.	2008	A robust feature extraction algorithm for audio fingerprinting
Tu et al.	2019	Discriminative feature analysis based on the crossing level for leakage classification in water pipelines
Singhal et al.	2022	Estimation of Accuracy in Human Gender Identification and Recall Values Based on Voice Signals Using Different Classifiers
Lei et al.	2016	Robust scream sound detection via sound event partitioning
Zhang et al.	2015	Computer-assisted sampling of acoustic data for more efficient determination of bird species richness
Mokhov	2008	Experimental results and statistics in the implementation of the modular audio recognition framework’s API for text-independent speaker identification
Rochlani et al.	2024	Machine Learning Approach for Detection of Speech Emotions for RAVDESS Audio Dataset
Cao et al.	2017	Identification of electronic disguised voices in the noisy environment
Cheng et al.	2015	A novel chicken voice recognition method using the orthogonal matching pursuit algorithm
Gosztolya et al.	2020	Ensemble Bag-of-Audio-Words representation improves paralinguistic classification accuracy