8000 wyhsirius (Yaohui) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View wyhsirius's full-sized avatar

Block or report wyhsirius

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Video-GPT via Next Clip Diffusion.

Python 18 Updated May 21, 2025

[CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models

Python 271 22 Updated May 17, 2025

Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion (CVPR2025)

Python 121 8 Updated Mar 4, 2025

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,552 343 Updated May 20, 2025

[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.

Python 1,827 188 Updated Apr 8, 2025

[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts

Python 296 11 Updated Jun 9, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,003 58 Updated May 22, 2025

[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 926 62 Updated Nov 13, 2024

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Python 937 63 Updated Nov 13, 2024

[ICCV 2023] Latent Action Composition for Skeleton-based Action Segmentation

Python 20 1 Updated Oct 25, 2023

Training-Free Condition-Guided Text-to-Video Generation

Python 61 Updated Apr 14, 2025

Long-Term Rhythmic Video Soundtracker, ICML2023

Python 58 1 Updated Jul 5, 2024

An open-source tool-augmented conversational language model from Fudan University

Python 12,048 1,147 Updated Jul 13, 2024

Official PyTorch implementation of LongVideoGAN

Python 320 30 Updated Nov 5, 2022

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Python 3,507 391 Updated May 7, 2025
Python 3,304 360 Updated Jun 10, 2023

3D-Aware Video Generation

Python 76 2 Updated Nov 15, 2022

[ICLR 22, TPAMI 24] LIA: Latent Image Animator

Python 626 66 Updated Jan 7, 2025

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Python 1,735 279 Updated Feb 15, 2023

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Python 8,265 783 Updated Oct 7, 2024

[CVPR 2022] StyleSwin: Transformer-based GAN for High-resolution Image Generation

Python 530 52 Updated Jul 30, 2024

A curated list of awesome 3d generation papers

1,148 57 Updated Mar 9, 2023

Official PyTorch implementation of "Playable Environments: Video Manipulation in Space and Time", CVPR 2022

Python 72 10 Updated Oct 16, 2022

A curated list of resources on implicit neural representations.

2,526 143 Updated Feb 11, 2024

[WACV 2021]"Guided Attentive Feature Fusion for Multispectral Pedestrian Detection"

27 2 Updated Jan 13, 2021

Localize to Classify and Classify to Localize: Mutual Guidance in Object Detection

Python 112 11 Updated Jan 28, 2023

[BMVC 2021 Oral] Official implementation of our paper "A Unified Framework for Real-world Skeleton-based Action Recognition" on Toyota Smarthome/Penn Action/NTU-RGB+D/Posetics datasets

Python 50 10 Updated Sep 2, 2022

AutoML with MCTS

Python 17 6 Updated May 4, 2022

🔥 2D and 3D Face alignment library build using pytorch

Python 7,302 1,365 Updated Aug 30, 2024

Two time-scale update rule for training GANs

Jupyter Notebook 884 172 Updated Aug 22, 2021
Next
0