Starred repositories
[CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network
CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Dead simple FLUX LoRA training UI with LOW VRAM support
[CVPR 2025] Official implementation of "Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation"
[CVPR 2025 Oral & Best Paper Award Candidate] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation
Direct3D‑S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention
Next Generation Experimental Tracking for Machine Learning Operations
Official implementation of "UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes"
CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!
[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.
Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets
LAVIS - A One-stop Library for Language-Vision Intelligence
Official code repository of paper "D(R, O) Grasp: A Unified Representation of Robot and Object Interaction for Cross-Embodiment Dexterous Grasping"
Official repository for BrickGPT, the first approach for generating physically stable toy brick models from text prompts.
Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".
Efficient Triton Kernels for LLM Training
Repository for TetWeave: Isosurface Extraction using On-The-Fly Delaunay Tetrahedral Grids for Gradient-Based Mesh Optimization (SIGGRAPH 2025)
[ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization
Pytorch Implementation of "HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image", In ICRA 2024
A Modular Toolkit for Robot Kinematic Optimization