AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods…

Python 284 10 Updated Nov 1, 2024

THUDM / VisionReward

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 237 6 Updated Mar 26, 2025

Fancy-MLLM / R1-Onevision

R1-onevision, a visual language model capable of deep CoT reasoning.

Python 515 14 Updated Apr 13, 2025

ICT-GoKnow / KnowCoder

Official Repo of paper "KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction". In the paper, we propose KnowCoder, the most powerful large language model so far for…

Python 80 10 Updated Aug 5, 2024

JimmyMa99 / SARChat

The first large-scale multimodal dialogue dataset focusing on Synthetic Aperture Radar (SAR) imagery.

Shell 51 2 Updated Feb 15, 2025

InternLM / Condor

27 3 Updated Feb 7, 2025

zhang-guangyi / t-udeepsc

Deep learning-based task-oriented and unified multi-task semantic communications

Python 76 18 Updated May 31, 2024

InternLM / OREAL

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Python 176 6 Updated Mar 20, 2025

ZiyuGuo99 / Image-Generation-CoT

[CVPR 2025] The First Investigation of CoT Reasoning in Image Generation

Python 673 20 Updated May 7, 2025

xie-lab-ml / Golden-Noise-for-Diffusion-Models

The code of our work "Golden Noise for Diffusion Models: A Learning Framework".

Python 154 8 Updated Feb 17, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 11,759 1,484 Updated Apr 24, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 1,773 148 Updated May 9, 2025

microsoft / rStar

Python 529 48 Updated Apr 15, 2025

camel-ai / camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 12,505 1,324 Updated May 16, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 14,384 1,766 Updated May 16, 2025

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 7,825 666 Updated May 16, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,427 2,250 Updated May 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tackhwa

Achievements

Achievements

Block or report tackhwa

Stars

HiDream-ai / HiDream-I1

Junda24 / MonSter

browser-use / browser-use

google / A2A

getmaxun / maxun

juliangarnier / anime

IAAR-Shanghai / SEAP

ModalMinds / MM-EUREKA

Genesis-Embodied-AI / Genesis

FoundationAgents / OpenManus

jianzongwu / DiffSensei

vllm-project / aibrix

simplescaling / s1

mihirp1998 / AlignProp