Walther, 2024 - Google Patents

An AI-Based Framework for Speech and Voice Analytics to Automatically Assess the Quality of Service Conversations

Walther, 2024

Document ID: 9801440164933055507
Author: Walther M
Publication year: 2024
Publication venue: Artificial intelligence in application: Legal aspects, application potentials and use scenarios

External Links

Cited by

Snippet

In this chapter, an innovative two-stage classification framework is presented that can predict quality-inducing criteria in call center conversations with explainable rules based on multiple models for speech expression. Through this basic classification, a symbolic representation …

Continue reading at link.springer.com (other versions)

238000000034 method 0 abstract description 29

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management

Similar Documents

Publication	Publication Date	Title
US11380327B2 (en)	2022-07-05	Speech communication system and method with human-machine coordination
US10038784B2 (en)	2018-07-31	System and method for providing agent guidance
Eyben et al.	2015	The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing
US20190253558A1 (en)	2019-08-15	System and method to automatically monitor service level agreement compliance in call centers
Rachman et al.	2018	DAVID: An open-source platform for real-time transformation of infra-segmental emotional cues in running speech
Klaylat et al.	2018	Emotion recognition in Arabic speech
Bromuri et al.	2021	Using AI to predict service agent stress from emotion patterns in service interactions
US11735208B2 (en)	2023-08-22	Systems and methods for classification and rating of calls based on voice and text analysis
Kopparapu	2015	Non-linguistic analysis of call center conversations
US20230154457A1 (en)	2023-05-18	Communication System And Related Methods
Huang et al.	2016	Emotional speech feature normalization and recognition based on speaker-sensitive feature clustering
Potapova et al.	2022	Forensic identification of foreign-language speakers by the method of structural-melodic analysis of phonograms
Walther	2024	An AI-Based Framework for Speech and Voice Analytics to Automatically Assess the Quality of Service Conversations
Szekrényes et al.	2017	Classification of formal and informal dialogues based on turn-taking and intonation using deep neural networks
Walther et al.	2015	Towards a conversational expert system for rhetorical and vocal quality assessment in call center talks.
CN118588112B (en)	2024-10-01	Alternating current state analysis method, equipment and medium for nonverbal signals
Favaro et al.	2022	ITAcotron 2: The Power of Transfer Learning in Expressive TTS Synthesis
Raptis et al.	2015	A framework towards expressive speech analysis and synthesis with preliminary results
Devi et al.	2023	Speech Recognition Via Machine Learning in Recording Studio
van Kesteren	2019	Predicting switching behavior on health insurer, by acoustic features from call center speech
Addagarla et al.	2023	Intelligent Call Prioritization Using Speech Emotion Recognition
KR20220075015A (en)	2022-06-07	Method for evaluating call counselor for inside sales and data analysis apparatus therefor
TR2024004192A2 (en)	2024-08-21	A SYSTEM THAT PROVIDES PERSONALIZED SERVICE DURING A CALL
EP2546790A1 (en)	2013-01-16	Computer-implemented system and method for assessing and utilizing user traits in an automated call center environment
Polzehl et al.	2015	Speech-Based Personality Assessment