-
UCLA
- Los Angeles, California
- https://xuan-li.github.io/
- channel/UCcJTfc8FrR_lVUb1Y2tLvsw
- @xuanli917
- in/xuanli1030
Highlights
- Pro
Stars
A paper list of my history reading. Robotics, Learning, Vision.
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
A latent text-to-image diffusion model
Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Official repository for paper "MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement"
DressCode: Autoregressively Sewing and Generating Garments from Text Guidance.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields
Simulation-Ready Garment Optimization with Differentiable Simulation
[CSUR] A Survey on Video Diffusion Models
an unofficial 2DGS implementation based on GauStudio
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
PositionBasedDynamics is a library for the physically-based simulation of rigid bodies, deformable solids and fluids.
VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model
A curated list of papers and open-source resources focused on 3D AIGC.
A generative world for general-purpose robotics & embodied AI learning.
[CVPR 2024] Official code for EgoGen: An Egocentric Synthetic Data Generator
[3DV-2025] Official implementation of "Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting"
Open-Sora: Democratizing Efficient Video Production for All
[CVPR 2024] PIE-NeRF🍕: Physics-based Interactive Elastodynamics with NeRF