grpotrainer

Star

Here are 6 public repositories matching this topic...

Goekdeniz-Guelmez / mlx-lm-lora

Star

Train Large Language Models on MLX.

training apple deep-learning ml mlx sft dpo grpo grpotrainer

Updated Jun 18, 2025
Python

GAD-cell / VLM_GRPO

Star

An implementation of GRPO for Unsloth's VLMs training

reinforcement-learning vlm huggingface trl unsloth grpo grpotrainer

Updated Jun 12, 2025
Python

uw-nsl / TinyV

Star

Your efficient and accurate answer verification system for RL training.

rl academic-project llm grpo grpotrainer

Updated Jun 11, 2025
Python

sdiehl / tiny-r1

Star

Recreating the minimal training methods of DeepSeek-R1 for small langauge models.

reasoning r1 grpo grpotrainer

Updated Feb 10, 2025
Python

teilomillet / jiki

Star

interface mcp model-context-protocol mcp-client grpo grpotrainer

Updated May 8, 2025
Python

yflyzhang / simpleR1

Star

simpleR1: A Simple Framework for Training R1-like Models

reinforcement-learning deepseek-r1 grpo r1-zero grpotrainer

Updated Jun 20, 2025
Python

Improve this page

Add a description, image, and links to the grpotrainer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the grpotrainer topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

grpotrainer

Here are 6 public repositories matching this topic...

Goekdeniz-Guelmez / mlx-lm-lora

GAD-cell / VLM_GRPO

uw-nsl / TinyV

sdiehl / tiny-r1

teilomillet / jiki

yflyzhang / simpleR1

Improve this page

Add this topic to your repo