Stars
📖 A curated list of resources dedicated to talking face.
Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"
[CSUR] A Survey on Video Diffusion Models
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
PyTorch code for "Motor Imagery Decoding Using Ensemble Curriculum Learning and Collaborative Training"
Code for our AAAI'19 paper "Towards Highly Accurate and Stable Face Alignment for High-Resolution Videos"
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalized Head Movement From Short Video and Speech Signal" (TMM 2022)
The official code of our ICCV2023 work: Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation
Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"
Official PyTorch implementation of PDAE (NeurIPS 2022)
The Official PyTorch Implementation for Face2Face^ρ (ECCV2022)
ICCV23 "Householder Projector for Unsupervised Latent Semantics Discovery"
Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)
[NeurIPS'23] Parts of Speech–Grounded Subspaces in Vision-Language Models
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
[CVPR 2023, top-10%] Authors official PyTorch implementation of the "Attribute-preserving Face Dataset Anonymization via Latent Code Optimization".
An official implementation of "Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding" (CVPR 2023) in PyTorch.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
[ICCV 2023] StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces
A GAN inversion toolbox based on PyTorch library. We design a unified pipeline for inversion methods and conduct a comprehensive benchmark.
[ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis
[BMVC 2022] This is the official code of our Paper "Revisiting Self-Supervised Contrastive Learning for Facial Expression Recognition"
A collection of resources and papers on Diffusion Models