Starred repositories
[SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in Large Language Models
A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)
ECCV‘24, a novel attention-alike structural re-parameterization (ASR)
A framework for 4D reconstruction from monocular videos.
Distilling Neural Fields for Real-Time Articulated Shape Reconstruction (CVPR'23)
Janus-Series: Unified Multimodal Understanding and Generation Models
CoTracker is a model for tracking any point (pixel) on a video.
Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
Code from the ECCV 2024 paper "Animal Avatar Reconstructing Animatable 3D Animals from Casual Videos".
Collections of CS PhD Application Fee Waivers of schools in North America
Example code for the FLAME 3D head model. The code demonstrates how to sample 3D heads from the model, fit the model to 3D keypoints and 3D scans.
Documents used for grad school application
Select a portrait, click to move the head around (please use your own space / GPU!)
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.