[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

Escher et al., 1999 - Google Patents

User interactive MPEG-4 compatible facial animation system

Escher et al., 1999

View PDF
Document ID
6993110202823866792
Author
Escher M
Goto T
Kshirsagar S
Zanardi C
Thalmann N
Publication year
Publication venue
International Workshop on Synthetic-Natural Hybrid Coding and Three Dimensional Imaging (IWSNHC3DI'99), Santorini, Greece

External Links

Snippet

This paper describes different processes and their interactions needed to generate a virtual environment inhabited by a clone representing real people and virtual autonomous actors. It requires communication between a cloned face (or avatar) and virtual face. This needs the …
Continue reading at www.academia.edu (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/203D [Three Dimensional] animation
    • G06T13/403D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/203D [Three Dimensional] animation
    • G06T13/2053D [Three Dimensional] animation driven by audio data
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/027Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation, e.g. computer aided management of electronic mail or groupware; Time management, e.g. calendars, reminders, meetings or time accounting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/66Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition

Similar Documents

Publication Publication Date Title
US7844467B1 (en) System and method of providing conversational visual prosody for talking heads
US7136818B1 (en) System and method of providing conversational visual prosody for talking heads
US8224652B2 (en) Speech and text driven HMM-based body animation synthesis
CN114357135B (en) Interaction method, interaction device, electronic equipment and storage medium
US20120130717A1 (en) Real-time Animation for an Expressive Avatar
ES2230290T3 (en) CHARACTER ANIMATION.
CN110688911A (en) Video processing method, device, system, terminal equipment and storage medium
Cosatto et al. Lifelike talking faces for interactive services
EP3915108B1 (en) Real-time generation of speech animation
CN116309984A (en) Mouth shape animation generation method and system based on text driving
Oralbayeva et al. Data-Driven Communicative Behaviour Generation: A Survey
Escher et al. User interactive MPEG-4 compatible facial animation system
CN115311731B (en) Expression generation method and device for sign language digital person
Kshirsagar et al. Multimodal animation system based on the MPEG-4 standard
Smid et al. Autonomous speaker agent
Verma et al. Animating expressive faces across languages
Cerezo et al. Interactive agents for multimodal emotional user interaction
Godenschweger et al. Modeling and generating sign language as animated line drawings
Mukashev et al. Facial expression generation of 3D avatar based on semantic analysis
Chollet et al. Multimodal human machine interactions in virtual and augmented reality
Chen et al. Text to avatar in multimodal human computer interface
Chae et al. Text-driven speech animation with emotion control
Magnenat Thalmann et al. Communicating with virtual characters
Al Moubayed et al. Multimodal feedback from robots and agents in a storytelling experiment
Karunaratne et al. Modelling and combining emotions, visual speech and gestures in virtual head models