Highlights
- Pro
Stars
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
[CVPR 2025 Oral] VGGT: Visual Geometry Grounded Transformer
real time face swap and one-click video deepfake with only a single image
Generative Models by Stability AI
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
A library for efficient similarity search and clustering of dense vectors.
[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
A curated list of recent diffusion models for video generation, editing, and various other applications.
Code release for https://kovenyu.com/WonderWorld/
Annotations for the ScanNet dataset generated using scannotate and HOC-Search.
Repository for WACV23 paper "Automatically Annotating Indoor Images with CAD Models via RGB-D Scans"
[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
SpatialLM: Large Language Model for Spatial Understanding
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
DiffusionFastForward: a free course and experimental framework for diffusion-based generative models
[CVPR'25 Highlight] Official repository of Sonata: Self-Supervised Learning of Reliable Point Representations
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
[CSUR] A Survey on Video Diffusion Models
[SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control