8000 tobran (MingTao(陶明)) / Starred · GitHub

More Web Proxy on the site http://driver.im/

tobran

Follow

MingTao(陶明) tobran

Follow

Ph.D. student. Research Interests: Generative Models, Vision-Language Model.

95 followers · 30 following

NanJing

Lists (4)

Sort

T2I-dataset

TGI

TGP

text-guided image inpainting

tools

Starred repositories

UCSC-VLAA / OpenVision

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Python 215 12 Updated May 15, 2025

360CVGroup / FG-CLIP

New generation of CLIP with fine grained discrimination capability, ICML2025

Python 90 3 Updated May 14, 2025

dongyangli-del / EEG_Image_decode

Using vision-language models to decode natural image perception from non-invasive brain recordings.

Jupyter Notebook 137 34 Updated Feb 7, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 13,154 1,117 Updated May 4, 2025

kylesargent / FlowMo

Official PyTorch implementation of FlowMo.

Jupyter Notebook 58 4 Updated Apr 7, 2025

zhaoshitian / LeX-Art

Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"

Python 59 3 Updated Apr 5, 2025

zhu-xlab / Copernicus-FM

Towards a Unified Copernicus Foundation Model for Earth Vision

Jupyter Notebook 56 2 Updated May 11, 2025

xiaomi-mlab / Orion

Official code of ORION

Python 240 15 Updated Apr 22, 2025

datawhalechina / easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 11,289 2,029 Updated May 13, 2025

MightXiong / FedMIT

Python 11 1 Updated Mar 14, 2025

ALEEEHU / World-Simulator

Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Generative Models" and Awesome-Text2X-Resources. Watch this repos…

250 14 Updated May 15, 2025

LeslieTrue / SFTvsRL

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 275 16 Updated Apr 28, 2025

VainF / TinyFusion

[CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow

Python 115 1 Updated Apr 5, 2025

VARGPT-family / VARGPT

VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model

Python 327 14 Updated Apr 17, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 20,866 2,439 Updated Apr 30, 2025

LMD0311 / Awesome-World-Model

Collect some World Models for Autonomous Driving (and Robotic) papers.

973 34 Updated May 13, 2025

stepfun-ai / Step-Video-T2V

Python 2,932 277 Updated Mar 17, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 1,597 121 Updated May 4, 2025

hymie122 / RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,626 110 Updated Aug 20, 2024

zzz47zzz / awesome-lifelong-learning-methods-for-llm

[ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Models. (Updated Regularly)

127 6 Updated Feb 4, 2025

Alpha-VLLM / Lumina-Video

Python 237 11 Updated Mar 10, 2025

Alpha-VLLM / Lumina-Image-2.0

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Python 692 45 Updated May 11, 2025

Wang-ML-Lab / llm-continual-learning-survey

[CSUR 2025] Continual Learning of Large Language Models: A Comprehensive Survey

405 19 Updated May 16, 2025

facebookresearch / large_concept_model

Large Concept Models: Language modeling in a sentence representation space

Python 2,171 196 Updated Jan 29, 2025

huggingface / open-muse

Open reproduction of MUSE for fast text2image generation.

Python 351 28 Updated Jun 1, 2024

deepseek-ai / DeepSeek-R1

89,270 11,538 Updated Apr 9, 2025

kvfrans / shortcut-models

Python 501 15 Updated Dec 5, 2024

MaximeVandegar / Papers-in-100-Lines-of-Code

Implementation of papers in 100 lines of code.

Python 1,509 159 Updated May 10, 2025

LMM101 / Awesome-Multimodal-Next-Token-Prediction

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

439 9 Updated Jan 17, 2025

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,547 87 Updated Sep 27, 2024

Starred topics

text-to-image

0