[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Mokhov, 2008 - Google Patents

Choosing best algorithm combinations for speech processing tasks in machine learning using MARF

Mokhov, 2008

Document ID
7854362427931658161
Author
Mokhov S
Publication year
Publication venue
Advances in Artificial Intelligence: 21st Conference of the Canadian Society for Computational Studies of Intelligence, Canadian AI 2008 Windsor, Canada, May 28-30, 2008 Proceedings 21

External Links

Snippet

This work reports experimental results in various speech processing tasks using an application based on the Modular Audio Recognition Framework (MARF) in terms of the best of the available algorithm configurations for each particular task. This study focuses on the …
Continue reading at link.springer.com (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6267Classification techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/26Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image

Similar Documents

Publication Publication Date Title
Neal et al. Time-frequency segmentation of bird song in noisy acoustic environments
CN110457432B (en) Interview scoring method, interview scoring device, interview scoring equipment and interview scoring storage medium
Huang et al. Depression Detection from Short Utterances via Diverse Smartphones in Natural Environmental Conditions.
Rajisha et al. Performance analysis of Malayalam language speech emotion recognition system using ANN/SVM
CN105702251B (en) Reinforce the speech-emotion recognition method of audio bag of words based on Top-k
Ivanov et al. Modulation Spectrum Analysis for Speaker Personality Trait Recognition.
Abercrombie et al. ParlVote: A corpus for sentiment analysis of political debates
CN113256751B (en) Voice-based image generation method, device, equipment and storage medium
CN110853648A (en) Bad voice detection method and device, electronic equipment and storage medium
Mokhov Choosing best algorithm combinations for speech processing tasks in machine learning using MARF
CN109448756A (en) A kind of voice age recognition methods and system
Whitehill et al. Whosecough: In-the-wild cougher verification using multitask learning
Chuchra et al. A deep learning approach for splicing detection in digital audios
Luo et al. Singing voice separation using spectro-temporal modulation features
Sephus et al. Modulation spectral features: In pursuit of invariant representations of music with application to unsupervised source identification
Chen et al. A robust feature extraction algorithm for audio fingerprinting
Tu et al. Discriminative feature analysis based on the crossing level for leakage classification in water pipelines
Singhal et al. Estimation of Accuracy in Human Gender Identification and Recall Values Based on Voice Signals Using Different Classifiers
Lei et al. Robust scream sound detection via sound event partitioning
Zhang et al. Computer-assisted sampling of acoustic data for more efficient determination of bird species richness
Mokhov Experimental results and statistics in the implementation of the modular audio recognition framework’s API for text-independent speaker identification
Rochlani et al. Machine Learning Approach for Detection of Speech Emotions for RAVDESS Audio Dataset
Cao et al. Identification of electronic disguised voices in the noisy environment
Cheng et al. A novel chicken voice recognition method using the orthogonal matching pursuit algorithm
Gosztolya et al. Ensemble Bag-of-Audio-Words representation improves paralinguistic classification accuracy