Escher et al., 1999 - Google Patents

User interactive MPEG-4 compatible facial animation system

Escher et al., 1999

Document ID: 6993110202823866792
Author: Escher M; Goto T; Kshirsagar S; Zanardi C; Thalmann N
Publication year: 1999
Publication venue: International Workshop on Synthetic-Natural Hybrid Coding and Three Dimensional Imaging (IWSNHC3DI'99), Santorini, Greece

External Links

Cited by

Snippet

This paper describes different processes and their interactions needed to generate a virtual environment inhabited by a clone representing real people and virtual autonomous actors. It requires communication between a cloned face (or avatar) and virtual face. This needs the …

Continue reading at www.academia.edu (PDF) (other versions)

230000001815 facial 0 title abstract description 23

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
- G10L2021/105—Synthesis of the lips movements from speech, e.g. for talking heads
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/205—3D [Three Dimensional] animation driven by audio data
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/027—Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation, e.g. computer aided management of electronic mail or groupware; Time management, e.g. calendars, reminders, meetings or time accounting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition

Similar Documents

Publication	Publication Date	Title
US7844467B1 (en)	2010-11-30	System and method of providing conversational visual prosody for talking heads
US7136818B1 (en)	2006-11-14	System and method of providing conversational visual prosody for talking heads
US8224652B2 (en)	2012-07-17	Speech and text driven HMM-based body animation synthesis
CN114357135B (en)	2024-11-01	Interaction method, interaction device, electronic equipment and storage medium
US20120130717A1 (en)	2012-05-24	Real-time Animation for an Expressive Avatar
ES2230290T3 (en)	2005-05-01	CHARACTER ANIMATION.
CN110688911A (en)	2020-01-14	Video processing method, device, system, terminal equipment and storage medium
Cosatto et al.	2003	Lifelike talking faces for interactive services
EP3915108B1 (en)	2023-11-29	Real-time generation of speech animation
CN116309984A (en)	2023-06-23	Mouth shape animation generation method and system based on text driving
Oralbayeva et al.	2024	Data-Driven Communicative Behaviour Generation: A Survey
Escher et al.	1999	User interactive MPEG-4 compatible facial animation system
CN115311731B (en)	2023-01-31	Expression generation method and device for sign language digital person
Kshirsagar et al.	1999	Multimodal animation system based on the MPEG-4 standard
Smid et al.	2004	Autonomous speaker agent
Verma et al.	2004	Animating expressive faces across languages
Cerezo et al.	2007	Interactive agents for multimodal emotional user interaction
Godenschweger et al.	1998	Modeling and generating sign language as animated line drawings
Mukashev et al.	2021	Facial expression generation of 3D avatar based on semantic analysis
Chollet et al.	2009	Multimodal human machine interactions in virtual and augmented reality
Chen et al.	2002	Text to avatar in multimodal human computer interface
Chae et al.	2020	Text-driven speech animation with emotion control
Magnenat Thalmann et al.	1998	Communicating with virtual characters
Al Moubayed et al.	2008	Multimodal feedback from robots and agents in a storytelling experiment
Karunaratne et al.	2006	Modelling and combining emotions, visual speech and gestures in virtual head models