Zhang et al., 2024 - Google Patents
Hierarchical Feature Warping and Blending for Talking Head AnimationZhang et al., 2024
- Document ID
- 9872442513670392327
- Author
- Zhang J
- Liu C
- Xian K
- Cao Z
- Publication year
- Publication venue
- IEEE Transactions on Circuits and Systems for Video Technology
External Links
Snippet
Talking head animation transforms a source anime image to a target pose, where the transformation includes the change of facial expression and head movement. In contrast to existing approaches that operate on the low-resolution image (256× 256), we study this task …
- 238000002156 mixing 0 title abstract description 27
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
- G06T15/205—Image-based rendering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/60—Editing figures and text; Combining figures or text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G06T2200/24—Indexing scheme for image data processing or generation, in general involving graphical user interfaces [GUIs]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
- G06T3/40—Scaling the whole image or part thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2219/00—Indexing scheme for manipulating 3D models or images for computer graphics
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Liu et al. | Semantic-aware implicit neural audio-driven video portrait generation | |
Yin et al. | Styleheat: One-shot high-resolution editable talking face generation via pre-trained stylegan | |
Wang et al. | A state-of-the-art review on image synthesis with generative adversarial networks | |
Liu et al. | Generative adversarial networks for image and video synthesis: Algorithms and applications | |
He et al. | Attgan: Facial attribute editing by only changing what you want | |
Li et al. | Deep sketch-guided cartoon video inbetweening | |
Ye et al. | Real3d-portrait: One-shot realistic 3d talking portrait synthesis | |
Zhang et al. | Dual in-painting model for unsupervised gaze correction and animation in the wild | |
Xia et al. | Controllable continuous gaze redirection | |
Bahmani et al. | Tc4d: Trajectory-conditioned text-to-4d generation | |
Kim et al. | Collaborative score distillation for consistent visual synthesis | |
Wang et al. | UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation | |
Ouyang et al. | Real-time neural character rendering with pose-guided multiplane images | |
Tang et al. | 3DFaceShop: Explicitly controllable 3D-aware portrait generation | |
Zhang et al. | Hierarchical Feature Warping and Blending for Talking Head Animation | |
Huang et al. | Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework | |
Hwang et al. | Faceclipnerf: Text-driven 3d face manipulation using deformable neural radiance fields | |
Zhong et al. | Deco: Decoupled human-centered diffusion video editing with motion consistency | |
Sun et al. | SSAT $++ $: A Semantic-Aware and Versatile Makeup Transfer Network With Local Color Consistency Constraint | |
Qiu et al. | Relitalk: Relightable talking portrait generation from a single video | |
Xu et al. | 3d gaussian parametric head model | |
Huang et al. | Efficient neural implicit representation for 3D human reconstruction | |
Zhao et al. | Regional Traditional Painting Generation Based on Controllable Disentanglement Model | |
Zhang et al. | Large motion anime head animation using a cascade pose transform network | |
Chen et al. | Three stages of 3D virtual try-on network with appearance flow and shape field |