Escher et al., 1999 - Google Patents
User interactive MPEG-4 compatible facial animation systemEscher et al., 1999
View PDF- Document ID
- 6993110202823866792
- Author
- Escher M
- Goto T
- Kshirsagar S
- Zanardi C
- Thalmann N
- Publication year
- Publication venue
- International Workshop on Synthetic-Natural Hybrid Coding and Three Dimensional Imaging (IWSNHC3DI'99), Santorini, Greece
External Links
Snippet
This paper describes different processes and their interactions needed to generate a virtual environment inhabited by a clone representing real people and virtual autonomous actors. It requires communication between a cloned face (or avatar) and virtual face. This needs the …
- 230000001815 facial 0 title abstract description 23
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
- G10L2021/105—Synthesis of the lips movements from speech, e.g. for talking heads
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/205—3D [Three Dimensional] animation driven by audio data
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/027—Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation, e.g. computer aided management of electronic mail or groupware; Time management, e.g. calendars, reminders, meetings or time accounting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7844467B1 (en) | System and method of providing conversational visual prosody for talking heads | |
US7136818B1 (en) | System and method of providing conversational visual prosody for talking heads | |
US8224652B2 (en) | Speech and text driven HMM-based body animation synthesis | |
CN114357135B (en) | Interaction method, interaction device, electronic equipment and storage medium | |
US20120130717A1 (en) | Real-time Animation for an Expressive Avatar | |
ES2230290T3 (en) | CHARACTER ANIMATION. | |
CN110688911A (en) | Video processing method, device, system, terminal equipment and storage medium | |
Cosatto et al. | Lifelike talking faces for interactive services | |
EP3915108B1 (en) | Real-time generation of speech animation | |
CN116309984A (en) | Mouth shape animation generation method and system based on text driving | |
Oralbayeva et al. | Data-Driven Communicative Behaviour Generation: A Survey | |
Escher et al. | User interactive MPEG-4 compatible facial animation system | |
CN115311731B (en) | Expression generation method and device for sign language digital person | |
Kshirsagar et al. | Multimodal animation system based on the MPEG-4 standard | |
Smid et al. | Autonomous speaker agent | |
Verma et al. | Animating expressive faces across languages | |
Cerezo et al. | Interactive agents for multimodal emotional user interaction | |
Godenschweger et al. | Modeling and generating sign language as animated line drawings | |
Mukashev et al. | Facial expression generation of 3D avatar based on semantic analysis | |
Chollet et al. | Multimodal human machine interactions in virtual and augmented reality | |
Chen et al. | Text to avatar in multimodal human computer interface | |
Chae et al. | Text-driven speech animation with emotion control | |
Magnenat Thalmann et al. | Communicating with virtual characters | |
Al Moubayed et al. | Multimodal feedback from robots and agents in a storytelling experiment | |
Karunaratne et al. | Modelling and combining emotions, visual speech and gestures in virtual head models |