wangzheallen

🎯

Focusing

Zhe Wang wangzheallen

🎯

Focusing

183 followers · 129 following

Computer Vision Researcher
SF, US
https://wangzheallen.github.io

Achievements

Highlights

Stars

tianweiy / DMD2

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 804 42 Updated Mar 5, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 8,194 502 Updated May 18, 2025

HumanAIGC / EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,632 936 Updated Aug 21, 2024

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 7,406 656 Updated May 31, 2024

DiT-3D / DiT-3D

🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"

Python 276 20 Updated May 17, 2024

siliconflow / onediff

OneDiff: An out-of-the-box acceleration library for diffusion models.

Jupyter Notebook 1,895 124 Updated May 8, 2025

facebookresearch / jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,038 303 Updated Feb 27, 2025

chuanyangjin / fast-DiT

Fast Diffusion Models with Transformers

Python 831 110 Updated Apr 1, 2025

gaomingqi / Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,731 497 Updated May 31, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 5,667 614 Updated Jun 12, 2025

instantX-research / InstantID

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,655 843 Updated Jul 18, 2024

leptonai / search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,113 1,029 Updated Apr 1, 2025

rese1f / StableVideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Python 1,433 87 Updated Sep 7, 2023

MooreThreads / Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,401 272 Updated May 31, 2024

OpenRobotLab / PointLLM

[ECCV 2024 Best Paper Candidate] PointLLM: Empowering Large Language Models to Understand Point Clouds

Python 821 40 Updated May 22, 2025

NVlabs / FoundationPose

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Python 2,152 310 Updated Mar 3, 2025

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 3,946 305 Updated Aug 31, 2024

MarkFzp / mobile-aloha

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Jupyter Notebook 4,147 707 Updated Jun 22, 2024

csuhan / OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 646 38 Updated Oct 22, 2024

MarkFzp / act-plus-plus

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Python 3,323 600 Updated May 15, 2024

AI-Hypercomputer / maxtext

A simple, performant and scalable Jax LLM!

Python 1,760 360 Updated Jun 12, 2025

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving for Local Deployment

C++ 8,222 433 Updated Feb 19, 2025

OpenGVLab / PonderV2

[T-PAMI 2025] PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

Python 350 8 Updated Apr 14, 2025

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,488 853 Updated Jun 10, 2024

exiawsh / StreamPETR

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Python 681 77 Updated Jun 26, 2024

Tsinghua-MARS-Lab / futr3d

Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection

Python 310 41 Updated Jul 6, 2023

OpenGVLab / DragGAN

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" （DragGAN 全功能实现，在线Demo，本地部署试用，代码、模型已全部开源，支持Windows, macOS, Linux）

Python 4,985 490 Updated Jul 17, 2023

DerryHub / BEVFormer_tensorrt

BEVFormer inference on TensorRT, including INT8 Quantization and Custom TensorRT Plugins (float/half/half2/int8).

Python 486 81 Updated Nov 20, 2023

jiawei-ren / diffmimic

[ICLR 2023] DiffMimic: Efficient Motion Mimicking with Differentiable Physics https://arxiv.org/abs/2304.03274

Python 292 21 Updated Jan 22, 2025

isl-org / ZoeDepth

Metric depth estimation from a single image

Jupyter Notebook 2,617 241 Updated May 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhe Wang wangzheallen

Achievements

Achievements

Highlights

Block or report wangzheallen

Stars

tianweiy / DMD2

FoundationVision / VAR

HumanAIGC / EMO

facebookresearch / DiT

DiT-3D / DiT-3D

siliconflow / onediff

facebookresearch / jepa

chuanyangjin / fast-DiT

gaomingqi / Track-Anything

allenai / OLMo

instantX-research / InstantID

leptonai / search_with_lepton

rese1f / StableVideo

MooreThreads / Moore-AnimateAnyone

OpenRobotLab / PointLLM

NVlabs / FoundationPose

mlfoundations / open_flamingo

MarkFzp / mobile-aloha

csuhan / OneLLM

MarkFzp / act-plus-plus

AI-Hypercomputer / maxtext

SJTU-IPADS / PowerInfer

OpenGVLab / PonderV2

artidoro / qlora

exiawsh / StreamPETR

Tsinghua-MARS-Lab / futr3d

OpenGVLab / DragGAN

DerryHub / BEVFormer_tensorrt

jiawei-ren / diffmimic

isl-org / ZoeDepth