Zhang et al., 2024 - Google Patents

Hierarchical Feature Warping and Blending for Talking Head Animation

Zhang et al., 2024

Document ID: 9872442513670392327
Author: Zhang J; Liu C; Xian K; Cao Z
Publication year: 2024
Publication venue: IEEE Transactions on Circuits and Systems for Video Technology

External Links

Cited by

Snippet

Talking head animation transforms a source anime image to a target pose, where the transformation includes the change of facial expression and head movement. In contrast to existing approaches that operate on the low-resolution image (256× 256), we study this task …

Continue reading at ieeexplore.ieee.org (other versions)

238000002156 mixing 0 title abstract description 27

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
- G06T15/205—Image-based rendering
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/60—Editing figures and text; Combining figures or text
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G06T2200/24—Indexing scheme for image data processing or generation, in general involving graphical user interfaces [GUIs]
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
- G06T3/40—Scaling the whole image or part thereof
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2219/00—Indexing scheme for manipulating 3D models or images for computer graphics

Similar Documents

Publication	Publication Date	Title
Liu et al.	2022	Semantic-aware implicit neural audio-driven video portrait generation
Yin et al.	2022	Styleheat: One-shot high-resolution editable talking face generation via pre-trained stylegan
Wang et al.	2020	A state-of-the-art review on image synthesis with generative adversarial networks
Liu et al.	2021	Generative adversarial networks for image and video synthesis: Algorithms and applications
He et al.	2019	Attgan: Facial attribute editing by only changing what you want
Li et al.	2021	Deep sketch-guided cartoon video inbetweening
Ye et al.	2024	Real3d-portrait: One-shot realistic 3d talking portrait synthesis
Zhang et al.	2020	Dual in-painting model for unsupervised gaze correction and animation in the wild
Xia et al.	2020	Controllable continuous gaze redirection
Bahmani et al.	2025	Tc4d: Trajectory-conditioned text-to-4d generation
Kim et al.	2023	Collaborative score distillation for consistent visual synthesis
Wang et al.	2024	UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Ouyang et al.	2022	Real-time neural character rendering with pose-guided multiplane images
Tang et al.	2023	3DFaceShop: Explicitly controllable 3D-aware portrait generation
Zhang et al.	2024	Hierarchical Feature Warping and Blending for Talking Head Animation
Huang et al.	2024	Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework
Hwang et al.	2023	Faceclipnerf: Text-driven 3d face manipulation using deformable neural radiance fields
Zhong et al.	2025	Deco: Decoupled human-centered diffusion video editing with motion consistency
Sun et al.	2023	SSAT $++ $: A Semantic-Aware and Versatile Makeup Transfer Network With Local Color Consistency Constraint
Qiu et al.	2024	Relitalk: Relightable talking portrait generation from a single video
Xu et al.	2025	3d gaussian parametric head model
Huang et al.	2024	Efficient neural implicit representation for 3D human reconstruction
Zhao et al.	2023	Regional Traditional Painting Generation Based on Controllable Disentanglement Model
Zhang et al.	2023	Large motion anime head animation using a cascade pose transform network
Chen et al.	2023	Three stages of 3D virtual try-on network with appearance flow and shape field