-
the Chinese University of Hong Kong, Shenzhen
Stars
PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers
Efficient Part-level 3D Object Generation via Dual Volume Packing
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
AutoAudit—— the LLM for Cyber Security 网络安全大语言模型
Towards Robust Multimodal Sentiment Analysis with Incomplete Data
Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis (ALMT)
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
[CVPR 2025 Highlight] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion. Semi-supervised VOS as well!
Make 2DGS Great Again!
GeoMaster: Advanced Geometry Enhancement for High-Resolution 3D Modeling
StableDelight: Revealing Hidden Textures by Removing Specular Reflections
[SIGGRAPH 2023, TPAMI 2024] Code for NeRF-Texture: Texture Synthesis with Neural Radiance Fields
[NeurIPS 2024] MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting
Implementation of ECCV'24: GaussReg: Fast 3D Registration with Gaussian Splatting
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
CraftsMan: High-fidelity Mesh Generation with 3D Native Diffusion and Interactive Geometry Refiner
High-resolution models for human tasks.
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
EBStore dataset for "EMS: 3D Eyebrow Modeling from Single-view Images"(SIGGRAPH Aisa 2023)
[SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal