Purshow

Purshow Purshow

42 followers · 67 following

Achievements

Highlights

Stars

tulip-berkeley / open_clip

Forked from mlfoundations/open_clip

An open source implementation of CLIP (With TULIP Support)

Python 153 2 Updated May 14, 2025

Tencent-Hunyuan / HunyuanVideo-I2V

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,476 137 Updated May 20, 2025

WZDTHU / NiT

Native-resolution diffusion Transformer

Python 48 1 Updated Jun 4, 2025

Tencent / HaploVLM

Python 44 3 Updated May 27, 2025

M-E-AGI-Lab / Muddit

Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.

Python 51 1 Updated May 30, 2025

mm-vl / ULM-R1

Co-Reinforcement Learning for Unified Multimodal Understanding and Generation

Python 3 Updated May 31, 2025

TencentARC / MindOmni

Python 27 Updated Jun 7, 2025

yifan123 / flow_grpo

An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 704 20 Updated Jun 5, 2025

LeapLabTHU / Absolute-Zero-Reasoner

Official Repository of Absolute Zero Reasoner

Python 1,470 245 Updated Jun 2, 2025

facebookresearch / perception_models

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,196 63 Updated May 28, 2025

showlab / UniRL

The code repository of UniRL

Python 27 3 Updated May 30, 2025

Visual-Agent / DeepEyes

Python 355 14 Updated Jun 7, 2025

phyworld / phyworld

Jupyter Notebook 129 6 Updated Jan 6, 2025

wusize / OpenUni

Python 69 Updated Jun 5, 2025

MajorDavidZhang / Generalization_unified_VLM

Python 7 Updated May 23, 2025

NVlabs / RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 1,183 47 Updated Jun 6, 2025

stanfordnlp / axbench

Stanford NLP Python library for benchmarking the utility of LLM interpretability methods

Python 92 6 Updated Jun 5, 2025

Wu-Zongyu / CharmBench

A preview-version of one novel multimodal reasoning benchmark CharmBench.

Jupyter Notebook 26 2 Updated Jun 1, 2025

PKU-YuanGroup / OpenS2V-Nexus

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Jupyter Notebook 109 1 Updated Jun 3, 2025

mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

816 19 Updated Jul 31, 2024

FoundationVision / BitVAE

official training and inference code of bitwise tokenizer

Python 25 Updated May 18, 2025

czg1225 / VeriThinker

VeriThinker: Learning to Verify Makes Reasoning Model Efficient

Python 41 1 Updated May 29, 2025

OpenGVLab / VideoMAEv2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Python 642 76 Updated Oct 8, 2024

LgQu / SILMM

Code for paper: SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation

6 Updated Dec 10, 2024

JiuhaiChen / BLIP3o

Python 1,135 42 Updated Jun 4, 2025

selftok-team / SelftokTokenizer

Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Python 131 5 Updated May 30, 2025

ant-research / lumos

[CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text

Python 50 Updated Mar 16, 2025

pengsida / learning_research

本人的科研经验

6,920 405 Updated Jun 4, 2025

xie-lab-ml / awesome-alignment-of-diffusion-models

The collection of awesome papers on alignment of diffusion models.

232 11 Updated Jun 7, 2025

yu-rp / Dimple

Dimple, the first Discrete Diffusion Multimodal Large Language Model

Python 64 3 Updated May 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Purshow Purshow

Achievements

Achievements

Highlights

Block or report Purshow

Stars

tulip-berkeley / open_clip

Tencent-Hunyuan / HunyuanVideo-I2V

WZDTHU / NiT

Tencent / HaploVLM

M-E-AGI-Lab / Muddit

mm-vl / ULM-R1

TencentARC / MindOmni

yifan123 / flow_grpo

LeapLabTHU / Absolute-Zero-Reasoner

facebookresearch / perception_models

showlab / UniRL

Visual-Agent / DeepEyes

phyworld / phyworld

wusize / OpenUni

MajorDavidZhang / Generalization_unified_VLM

NVlabs / RADIO

stanfordnlp / axbench

Wu-Zongyu / CharmBench

PKU-YuanGroup / OpenS2V-Nexus

mlfoundations / MINT-1T

FoundationVision / BitVAE

czg1225 / VeriThinker

OpenGVLab / VideoMAEv2

LgQu / SILMM

JiuhaiChen / BLIP3o

selftok-team / SelftokTokenizer

ant-research / lumos

pengsida / learning_research

xie-lab-ml / awesome-alignment-of-diffusion-models

yu-rp / Dimple