Stars
Python implementation of convertion between equirectangular, cubemap and perspective. (equirect2cube, cube2equirect, equirect2perspec)
A curated list of recent diffusion models for video generation, editing, and various other applications.
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Enjoy the magic of Diffusion models!
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Official implementations for paper: VACE: All-in-One Video Creation and Editing
EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 https://arxiv.org/abs/1901.00212
[ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Source code for paper GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking
[SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"
[CVPR 2025] Code for Segment Any Motion in Videos
[ICCV 2025] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
Wan: Open and Advanced Large-Scale Video Generative Models
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
[CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation
[CVPR25 Highlight] Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
A framework to easily use 32 (and growing) different image matching methods
[ICLR2025] The official implementation of Less is More: Masking Elements in Image Condition Features Avoids Content Leakages in Style Transfer Diffusion Models
Open-source Multi-agent Poster Generation from Papers
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
Official PyTorch implementation for "FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis".