Meudt et al., 2013 - Google Patents

Multi classifier systems and forward backward feature selection algorithms to classify emotional coloured speech

Meudt et al., 2013

Document ID: 8346384541126204444
Author: Meudt S; Zharkov D; Kächele M; Schwenker F
Publication year: 2013
Publication venue: Proceedings of the 15th ACM on International conference on multimodal interaction

External Links

Cited by

Snippet

Systems for the recognition of psychological characteristics such as the emotional state in real world scenarios have to deal with several difficulties. Amongst those are unconstrained environments and uncertainties in one or several input channels. However a more crucial …

Continue reading at projet.liris.cnrs.fr (PDF) (other versions)

230000002996 emotional 0 title abstract description 10

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/90—Pitch determination of speech signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass

Similar Documents

Publication	Publication Date	Title
Zadeh et al.	2018	Memory fusion network for multi-view sequential learning
Meudt et al.	2013	Multi classifier systems and forward backward feature selection algorithms to classify emotional coloured speech
Yang et al.	2017	Enhanced multiclass SVM with thresholding fusion for speech-based emotion classification
Mannepalli et al.	2016	FDBN: Design and development of Fractional Deep Belief Networks for speaker emotion recognition
Somandepalli et al.	2016	Online affect tracking with multimodal kalman filters
Yasmin et al.	2022	A rough set theory and deep learning-based predictive system for gender recognition using audio speech
Sidorov et al.	2014	Automatic recognition of personality traits: A multimodal approach
Sreevidya et al.	2022	Elder emotion classification through multimodal fusion of intermediate layers and cross-modal transfer learning
Jing et al.	2023	A deep interpretable representation learning method for speech emotion recognition
Karthikeyan	2022	Adaptive boosted random forest-support vector machine based classification scheme for speaker identification
Deb et al.	2016	Classification of speech under stress using harmonic peak to energy ratio
Yang et al.	2024	Speech emotion recognition based on multi-feature speed rate and LSTM
Fonnegra et al.	2018	Speech emotion recognition integrating paralinguistic features and auto-encoders in a deep learning model
Zhao et al.	2024	Knowledge enhancement for speech emotion recognition via multi-level acoustic feature
Paleari et al.	2010	Features for multimodal emotion recognition: An extensive study
Meudt et al.	2014	Enhanced autocorrelation in real world emotion recognition
Abel et al.	2016	A data driven approach to audiovisual speech mapping
Singhal et al.	2022	Estimation of Accuracy in Human Gender Identification and Recall Values Based on Voice Signals Using Different Classifiers
Yalamanchili et al.	2021	Survey on multimodal emotion recognition (mer) systems
Fonnegra et al.	2017	Speech emotion recognition based on a recurrent neural network classification model
Dropuljić et al.	2013	Estimation of emotional states enhanced by a priori knowledge
Lopez‐Otero et al.	2017	Influence of speaker de‐identification in depression detection
Tyagi et al.	2023	Semantic speech analysis using machine learning and deep learning techniques: a comprehensive review
Noor et al.	2017	Audio visual emotion recognition using cross correlation and wavelet packet domain features
Kim et al.	2018	A study on the Recommendation of Contents using Speech Emotion Information and Emotion Collaborative Filtering