8000 tobran (MingTao(陶明)) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View tobran's full-sized avatar
  • NanJing

Block or report tobran

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Python 215 12 Updated May 15, 2025

New generation of CLIP with fine grained discrimination capability, ICML2025

Python 90 3 Updated May 14, 2025

Using vision-language models to decode natural image perception from non-invasive brain recordings.

Jupyter Notebook 137 34 Updated Feb 7, 2025

Lets make video diffusion practical!

Python 13,154 1,117 Updated May 4, 2025

Official PyTorch implementation of FlowMo.

Jupyter Notebook 58 4 Updated Apr 7, 2025

Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"

Python 59 3 Updated Apr 5, 2025

Towards a Unified Copernicus Foundation Model for Earth Vision

Jupyter Notebook 56 2 Updated May 11, 2025

Official code of ORION

Python 240 15 Updated Apr 22, 2025

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 11,289 2,029 Updated May 13, 2025
Python 11 1 Updated Mar 14, 2025

Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Generative Models" and Awesome-Text2X-Resources. Watch this repos…

250 14 Updated May 15, 2025

Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Python 275 16 Updated Apr 28, 2025

[CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow

Python 115 1 Updated Apr 5, 2025

VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model

Python 327 14 Updated Apr 17, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 20,866 2,439 Updated Apr 30, 2025

Collect some World Models for Autonomous Driving (and Robotic) papers.

973 34 Updated May 13, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 1,597 121 Updated May 4, 2025

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,626 110 Updated Aug 20, 2024

[ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Models. (Updated Regularly)

127 6 Updated Feb 4, 2025
Python 237 11 Updated Mar 10, 2025

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Python 692 45 Updated May 11, 2025

[CSUR 2025] Continual Learning of Large Language Models: A Comprehensive Survey

405 19 Updated May 16, 2025

Large Concept Models: Language modeling in a sentence representation space

Python 2,171 196 Updated Jan 29, 2025

Open reproduction of MUSE for fast text2image generation.

Python 351 28 Updated Jun 1, 2024
Python 501 15 Updated Dec 5, 2024

Implementation of papers in 100 lines of code.

Python 1,509 159 Updated May 10, 2025

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

439 9 Updated Jan 17, 2025

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,547 87 Updated Sep 27, 2024
Next
0