-
City University of Hong Kong
- Hong Kong
-
23:53
(UTC +08:00) - https://quzefan.github.io
Highlights
- Pro
Stars
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion.
[CVPR2024] Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion
[ICCV 2025] Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
"Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Liang, Zhangyang Wang, Konstantinos N. Plataniotis, Yao Zhao, …
[NeurIPS 2024] "DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation"
Official code for 4Diffusion: Multi-view Video Diffusion Model for 4D Generation.
[CVPR 2024] Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models, a no lighting baked texture generative model
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
[CVPR 2023] StyleRF: Zero-shot 3D Style Transfer of Neural Radiance Fields
PyTorch code and models for the DINOv2 self-supervised learning method.
[AAAI'24] Official PyTorch implementation of the paper "FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields"
Blender plugin which generates a dataset for colmap by exporting blender camera poses and rendering scene.
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
This is the official code for the paper Tailor3D
Official repo for FaceShot: Bring Any Character into Life
[SIGGRAPH Asia 2024] StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting
Evolving ReID: Harnessing Large Pre-trained Models, Multi-Task Learning, Privacy-Preserving and Attack Techniques
[SIGGRAPH 2024] Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning
Official code for "Style Aligned Image Generation via Shared Attention"
Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"