-
ShanghaiTech University
- Shanghai, China
- https://zhaofuq.github.io/
- https://neudim.com/about
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
Generative Models by Stability AI
Open-Sora: Democratizing Efficient Video Production for All
HunyuanVideo: A Systematic Framework For Large Video Generation Model
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Self-reimplemented version of Long-LRM.
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
[NeurIPS 2024] L4GM: Large 4D Gaussian Reconstruction Model
[SIGGRAPH Asia & TOG 2024] This is the official implementation of our SIGGRAPH Asia journal artical: TEXGen: a Generative Diffusion Model for Mesh Textures
[SIGGRAPH Asia 2024] V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Render Gaussian Splats using Metal on Apple platforms (iOS/iPhone/iPad, macOS, and visionOS)
File format for 3D Gaussian splats. About 10x smaller than the PLY equivalent with virtually no perceptible loss in visual quality. Offered as open source by Niantic Labs. More details at https://s…
Inpaint anything using Segment Anything and inpainting models.
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Offical codes for "AutoVFX: Physically Realistic Video Editing from Natural Language Instructions."
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
⏰ AI conference deadline countdowns
[arXiv'24] VistaDream: Sampling multiview consistent images for single-view scene reconstruction
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.