Drygajlo, 2012 - Google Patents

Automatic speaker recognition for forensic case assessment and interpretation

Drygajlo, 2012

Document ID: 8320378715142295493
Author: Drygajlo A
Publication year: 2012
Publication venue: Forensic Speaker Recognition: Law Enforcement and Counter-Terrorism

External Links

Cited by

Snippet

Forensic speaker recognition (FSR) is the process of determining if a specific individual (suspected speaker) is the source of a questioned voice recording (trace). The forensic expert's role is to testify to the worth of the voice evidence by using, if possible, a quantitative …

Continue reading at link.springer.com (other versions)

238000000034 method 0 abstract description 32

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G10L2021/0135—Voice conversion or morphing
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass

Similar Documents

Publication	Publication Date	Title
Drygajlo	2012	Automatic speaker recognition for forensic case assessment and interpretation
Eyben et al.	2015	The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing
WO2021164147A1 (en)	2021-08-26	Artificial intelligence-based service evaluation method and apparatus, device and storage medium
Wang et al.	2020	Recognition of audio depression based on convolutional neural network and generative antagonism network model
Sefara	2019	The effects of normalisation methods on speech emotion recognition
Sethu et al.	2014	Speech based emotion recognition
Krothapalli et al.	2012	Neural network based feature transformation for emotion independent speaker identification
Parra-Gallego et al.	2022	Classification of emotions and evaluation of customer satisfaction from speech in real world acoustic environments
Alexander	2005	Forensic automatic speaker recognition using Bayesian interpretation and statistical compensation for mismatched conditions
Lopez-Otero et al.	2017	Depression Detection Using Automatic Transcriptions of De-Identified Speech.
Scherer et al.	2009	Multimodal laughter detection in natural discourses
Deepa et al.	2022	Speech technology in healthcare
Koolagudi et al.	2017	Dravidian language classification from speech signal using spectral and prosodic features
Yücesoy	2022	Speaker age and gender classification using GMM supervector and NAP channel compensation method
Canovas et al.	2023	AI-driven Teacher Analytics: Informative Insights on Classroom Activities
Zourmand et al.	2013	Gender classification in children based on speech characteristics: using fundamental and formant frequencies of Malay vowels
Potapova et al.	2022	Forensic identification of foreign-language speakers by the method of structural-melodic analysis of phonograms
Mansour et al.	2015	Speaker recognition in emotional context
Safavi	2015	Speaker characterization using adult and children’s speech
Leuzzi et al.	2017	A Statistical Approach to Speaker Identification in Forensic Phonetics
CN111341346A (en)	2020-06-26	Language expression capability evaluation method and system for fusion depth language generation model
Jaiswal et al.	2019	A generative adversarial network based ensemble technique for automatic evaluation of machine synthesized speech
Sukvichai et al.	2021	Automatic speech recognition for Thai sentence based on MFCC and CNNs
Gomes et al.	2021	Person identification based on voice recognition
Singh et al.	2015	Automatic articulation error detection tool for Punjabi language with aid for hearing impaired people