-
Shanghai Jiao Tong University
- Shanghai, China
-
03:48
(UTC +08:00)
Stars
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
Wan: Open and Advanced Large-Scale Video Generative Models
Human Motion Video Generation: A Survey (https://www.techrxiv.org/users/836049/articles/1228135-human-motion-video-generation-a-survey)
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
Unofficial Implementation of Animate Anyone
Unofficial Implementation of Animate Anyone by Novita AI
Character Animation (AnimateAnyone, Face Reenactment)
[ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation
Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"
[CVPR 2025] A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation
The official Implementation of PeriodWave and PeriodWave-Turbo
Code for Motion Representations for Articulated Animation paper
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"