Starred repositories
[CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga
[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting
[CVPR 2025 Oral & Award Candidate] Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models
[CVPR 2023] 😈BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields
[ECCV 2024] "BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting". ⚡Train a scene from real-world blurry images in minutes!
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Generative Agents: Interactive Simulacra of Human Behavior
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
A powerful tool for creating fine-tuning datasets for LLM
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
This is the official implementation of "Clustering based Point Cloud Representation Learning for 3D Analysis" (Accepted at ICCV 2023).
[ECCV 2024] Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
StarCraft II Client - protocol definitions used to communicate with StarCraft II.
Jupyter notebook tutorials for MMSegmentation
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
[ECCV 2024] 3D World Model for Autonomous Driving
[CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance Queries
A Unified Framework for Surface Reconstruction
[NeurIPS'22] MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface Reconstruction
[ICRA 2024] RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision. (Early version: UniOcc)
[AAAI 2024] Official implementation of "SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation", and more.