8000 wangzheallen (Zhe Wang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View wangzheallen's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report wangzheallen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 804 42 Updated Mar 5, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 8,194 502 Updated May 18, 2025

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,632 936 Updated Aug 21, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 7,406 656 Updated May 31, 2024

🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"

Python 276 20 Updated May 17, 2024

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,895 124 Updated May 8, 2025

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,038 303 Updated Feb 27, 2025

Fast Diffusion Models with Transformers

Python 831 110 Updated Apr 1, 2025

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,731 497 Updated May 31, 2024

Modeling, training, eval, and inference code for OLMo

Python 5,667 614 Updated Jun 12, 2025

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,655 843 Updated Jul 18, 2024

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,113 1,029 Updated Apr 1, 2025

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Python 1,433 87 Updated Sep 7, 2023

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,401 272 Updated May 31, 2024

[ECCV 2024 Best Paper Candidate] PointLLM: Empowering Large Language Models to Understand Point Clouds

Python 821 40 Updated May 22, 2025

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Python 2,152 310 Updated Mar 3, 2025

An open-source framework for training large multimodal models.

Python 3,946 305 Updated Aug 31, 2024

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Jupyter Notebook 4,147 707 Updated Jun 22, 2024

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 646 38 Updated Oct 22, 2024

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Python 3,323 600 Updated May 15, 2024

A simple, performant and scalable Jax LLM!

Python 1,760 360 Updated Jun 12, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,222 433 Updated Feb 19, 2025

[T-PAMI 2025] PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

Python 350 8 Updated Apr 14, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,488 853 Updated Jun 10, 2024

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Python 681 77 Updated Jun 26, 2024

Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection

Python 310 41 Updated Jul 6, 2023

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)

Python 4,985 490 Updated Jul 17, 2023

BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).

Python 486 81 Updated Nov 20, 2023

[ICLR 2023] DiffMimic: Efficient Motion Mimicking with Differentiable Physics https://arxiv.org/abs/2304.03274

Python 292 21 Updated Jan 22, 2025

Metric depth estimation from a single image

Jupyter Notebook 2,617 241 Updated May 5, 2025
Next
0