Stars
OmniGen2: Exploration to Advanced Multimodal Generation.
This is the code for Deformable Neural Radiance Fields, a.k.a. Nerfies.
Official inference repo for FLUX.1 models
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
One for All Modalities Evaluation Toolkit - including text, image, video, audio tasks.
Build-your-own DiffuserCam tutorial
[ICLR 2024] Controlling Vision-Language Models for Universal Image Restoration. 5th place in the NTIRE 2024 Restore Any Image Model in the Wild Challenge.
Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).
Code release for BiGS: Bidirectional Primitives for Relightable 3D Gaussian Splatting
FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser, Trae AI & Cluely (And other Open Sourced) System Prompts, Tools & AI Models.
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Here is the official repository of WF-Diff reproductions.
Corruption and Perturbation Robustness (ICLR 2019)
Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"
Official implementation of "Uni-Renderer: Unifying Rendering and Inverse Rendering Via Dual Stream Diffusion" [CVPR2025]
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
[TVCG2024] PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction
(ICCV 2023) NEMTO: Neural Environment Matting for Novel View and Relighting Synthesis of Transparent Objects
Implementation of "GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors" (ICLR 2024)
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
A collection of reflection removal methods
Official implementation for "Single Image Reflection Separation via Component Synergy"
Code for the paper "Location-aware Single Image Reflection Removal"
[ICML'25] Official Implementation of "PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting"