8000 Rocky12138 (Rocky) / Starred · GitHub

More Web Proxy on the site http://driver.im/

Rocky12138

Follow

Rocky Rocky12138

Follow

THU CST ICT

6 followers · 3 following

Achievements

Achievements

Starred repositories

tinnerhrhe / EvoSearch-codes

Python 22 1 Updated May 30, 2025

haidog-yaqub / MeanFlow

Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.

Python 237 7 Updated May 30, 2025

junhahyung / STGuidance

Python 160 12 Updated Apr 7, 2025

yifan123 / flow_grpo

An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 678 20 Updated May 20, 2025

XueZeyue / DanceGRPO

Python 232 6 Updated May 29, 2025

facebookresearch / webssl

Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).

Python 129 8 Updated Apr 29, 2025

Lakonik / GMFlow

[ICML 2025] Gaussian Mixture Flow Matching Models (GMFlow)

Python 97 3 Updated May 28, 2025

stepfun-ai / Step1X-Edit

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,340 60 Updated May 28, 2025

ShoufaChen / PixelFlow

Pixel-Space Generative Models

Python 235 11 Updated May 11, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,191 179 Updated May 30, 2025

SkyworkAI / SkyReels-V2

SkyReels-V2: Infinite-length Film Generative model

Python 2,663 311 Updated May 27, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 13,876 1,207 Updated May 4, 2025

End2End-Diffusion / REPA-E

Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers

Python 223 6 Updated Apr 16, 2025

WeichenFan / CFG-Zero-star

Official repo for CFG-Zero*

Python 571 20 Updated May 2, B67C 2025

MiniMax-AI / MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.

Python 519 50 Updated May 7, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 8,067 497 Updated May 18, 2025

OliverRensu / FlowAR

“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with any VAE.

Python 120 1 Updated May 1, 2025

xie-lab-ml / Weak-to-Strong-Diffusion-with-Reflection

The official code of "Weak-to-Strong Diffusion with Reflection".

Jupyter Notebook 45 Updated May 4, 2025

xie-lab-ml / Zigzag-Diffusion-Sampling

[ICLR2025] The code of Z-Sampling, proposed in our paper "Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflection".

Python 77 2 Updated Feb 19, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,829 1,366 Updated May 27, 2025

kuleshov-group / bd3lms

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Python 657 37 Updated Apr 16, 2025

Osilly / Vision-R1

This is the first paper to explore how to effectively use RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reasoning ca…

Python 578 13 Updated May 7, 2025

BryceZhuo / HybridNorm

The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Python 14 3 Updated Mar 7, 2025

zhixuan-lin / forgetting-transformer

[ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"

Python 104 3 Updated May 15, 2025

THUDM / CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 1,043 76 Updated Mar 29, 2025

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

28,760 2,367 Updated Jun 18, 2024

MIC-DKFZ / batchgenerators

A framework for data augmentation for 2D and 3D image classification and segmentation

Jupyter Notebook 1,129 228 Updated Mar 25, 2025

MoonshotAI / Moonlight

Muon is Scalable for LLM Training

1,052 47 Updated Mar 28, 2025

KellerJordan / Muon

Muon optimizer: +>30% sample efficiency with <3% wallclock overhead

Python 666 33 Updated May 27, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,794 277 Updated May 15, 2025

Starred topics

Tensorflow

0