XingtongGe

👋

Welcome

Do Mi So XingtongGe

👋

Welcome

Run, think, write

12 followers · 35 following

SenseTime Research, HKUST
Beijing
16:09 (UTC +08:00)
https://xingtongge.github.io/
in/xingtong-ge

Achievements

Lists (1)

Sort

🔮 Future ideas

1 repository

Starred repositories

xypeng9903 / ncvsd

[ICML 2025] Noise Conditional Variational Score Distillation

Python 5 Updated Jun 20, 2025

guandeh17 / Self-Forcing

Python 2,064 127 Updated Jun 16, 2025

showlab / Show-o

[ICLR 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,483 64 Updated Jun 21, 2025

Vchitect / DCM

DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation

Python 157 10 Updated Jun 8, 2025

Peyton-Chen / Sparse-vDiT

The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv 2025)

38 Updated Jun 6, 2025

wangkai930418 / awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

1,780 81 Updated Jun 19, 2025

tulerfeng / Video-R1

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 575 28 Updated May 28, 2025

YuzheZhang-1999 / DiffTSR

[CVPR2024] Diffusion-based Blind Text Image Super-Resolution (Official)

Python 145 8 Updated May 8, 2025

XingtongGe / SenseFlow

🚀 SenseFlow: Scaling Distribution Matching for Flow-based Text-to-Image Distillation

21 Updated Jun 3, 2025

yifan123 / flow_grpo

An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 770 27 Updated Jun 16, 2025

Gen-Verse / MMaDA

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,113 52 Updated Jun 13, 2025

bioinf-jku / TTUR

Two time-scale update rule for training GANs

Jupyter Notebook 885 172 Updated Aug 22, 2021

yuvalkirstain / PickScore

Python 515 29 Updated Dec 21, 2024

LAION-AI / aesthetic-predictor

A linear estimator on top of clip to predict the aesthetic quality of pictures

Jupyter Notebook 561 22 Updated Aug 15, 2022

THUDM / VisionReward

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 264 6 Updated Mar 26, 2025

Karine-Huang / T2I-CompBench

[Neurips 2023 & TPAMI] T2I-CompBench (++) for Compositional Text-to-image Generation Evaluation

Python 263 11 Updated Apr 10, 2025

christophschuhmann / improved-aesthetic-predictor

CLIP+MLP Aesthetic Score Predictor

Python 1,116 102 Updated Jul 1, 2024

willisma / SiT

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Python 873 52 Updated Mar 12, 2024

yang-song / score_sde_pytorch

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,941 333 Updated Jul 14, 2024

gnobitab / RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 1,292 72 Updated Jul 20, 2024

microsoft / markitdown

Python tool for converting files and office documents to Markdown.

Python 59,363 3,090 Updated Jun 4, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 11,095 803 Updated May 15, 2025

hustvl / LightningDiT

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 948 28 Updated Jun 12, 2025

liuff19 / Video-T1

Official Implementation of Video-T1: Test-Time Scaling for Video Generation

Python 267 15 Updated Apr 4, 2025

FoundationVision / Infinity

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,336 72 Updated Apr 24, 2025

NVlabs / RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 1,213 47 Updated Jun 6, 2025

XingtongGe / PreprocessingICM

[IEEE TCSVT 2024] Preprocessing Enhanced Image Compression for Machine Vision

Python 13 Updated Mar 23, 2025

lodestone-rock / flow

Python 115 15 Updated Jun 18, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,375 3904 1,496 Updated Jun 13, 2025

CodeGoat24 / UnifiedReward

Official implementation of UnifiedReward & UnifiedReward-Think

Python 426 11 Updated Jun 18, 2025

Starred topics

joint-detection-and-tracking