8000 Yujun-Shi (Yujun Shi) / Starred · GitHub

More Web Proxy on the site http://driver.im/

Yujun-Shi

Follow

Yujun Shi Yujun-Shi

Follow

Research Scientist @ Meta GenAI, PhD @ NUS. Envision & Deliver.

159 followers · 77 following

National University of Singapore
https://yujun-shi.github.io/

Achievements

Achievements

Stars

PKU-YuanGroup / Next-Patch-Prediction

Python 134 3 Updated Jan 2, 2025

PKU-YuanGroup / ConsisID

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Python 692 34 Updated Apr 22, 2025

showlab / ROICtrl

Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation

Python 108 Updated Apr 16, 2025

facebookresearch / MovieGenBench

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

385 22 Updated Mar 8, 2025

magic-research / LightningDrag

Experiencing lightning fast (~1s) and accurate drag-based image editing

Python 74 4 Updated Oct 23, 2024

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,563 87 Updated Sep 27, 2024

PKU-YuanGroup / ChronoMagic-Bench

[NeurIPS 2024 D&B Spotlight🔥] ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation

Python 202 15 Updated Apr 12, 2025

Yushi-Hu / VisualSketchpad

Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Jupyter Notebook 213 12 Updated Oct 28, 2024

TencentARC / BrushNet

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,595 132 Updated Dec 17, 2024

lllyasviel / IC-Light

More relighting!

Python 8,011 492 Updated Feb 20, 2025

DepthAnything / Depth-Anything-V2

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 5,539 518 Updated Jan 22, 2025

CFGpp-diffusion / CFGpp

Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)

Python 206 6 Updated Mar 21, 2025

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,755 77 Updated Aug 15, 2024

PingchuanMa / SGA

[ICML 2024] LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery

Python 73 8 Updated May 31, 2024

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 29,504 4,135 Updated Nov 24, 2024

zqh0253 / 3DitScene

[ICLR 2025] 3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian Splatting

Python 227 11 Updated Nov 23, 2024

Tencent-Hunyuan / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,123 342 Updated Jan 13, 2025

yuweihao / MambaOut

MambaOut: Do We Really Need Mamba for Vision? (CVPR 2025)

Python 2,376 42 Updated Mar 9, 2025

LLaVA-VL / LLaVA-NeXT

Python 3,843 360 Updated May 6, 2025

yunshengtian / ASAP

[ICRA 2024] ASAP: Automated Sequence Planning for Complex Robotic Assembly with Physical Feasibility

C++ 67 14 Updated Feb 13, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 28,717 3,387 Updated Jan 26, 2025

video2game / video2game

Code release of Video2Game

JavaScript 319 22 Updated Apr 25, 2024

PKU-YuanGroup / MagicTime

[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Python 1,316 124 Updated Apr 12, 2025

magic-research / magic-boost

Python 126 7 Updated Aug 10, 2024

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,960 493 Updated May 18, 2025

facebookresearch / co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,313 295 Updated Jan 21, 2025

bfshi / scaling_on_scales

When do we not need larger vision models?

Python 392 13 Updated Feb 8, 2025

PixArt-alpha / PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 3,086 189 Updated Oct 31, 2024

Yujun-Shi / DragDiffusion

[CVPR2024, Highlight] Official code for DragDiffusion

Python 1,215 93 Updated Jan 29, 2024

mhamilton723 / FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Jupyter Notebook 1,510 88 Updated Jun 28, 2024

0