10000 Ginray (Yinlei Sun) / Starred · GitHub

More Web Proxy on the site http://driver.im/

Ginray

Follow

🌶️

艰难，但相信

Yinlei Sun Ginray

🌶️

艰难，但相信

Follow

16 followers · 33 following

hangzhou

Achievements

Achievements

Stars

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 525 33 Updated Jul 2, 2025

Qihoo360 / Light-R1

Python 725 47 Updated May 30, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,371 160 Updated Mar 20, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 24,964 2,320 Updated Jul 3, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7,268 707 Updated Jun 19, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,371 1,723 Updated Jul 5, 2025

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,630 350 Updated Jul 2, 2025

jsksxs360 / How-to-use-Transformers

Transformers 库快速入门教程

Python 1,555 187 Updated Sep 20, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 8,470 726 Updated Jul 4, 2025

modelscope / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 4,716 245 Updated Jul 4, 2025

luxonis / datadreamer

Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models

Python 120 7 Updated May 16, 2025

datadreamer-dev / DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Python 1,029 54 Updated Feb 2, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

33,898 1,842 Updated Aug 1, 2024

Tencent / KsanaLLM

C++ 454 39 Updated Jun 27, 2025

TencentARC / mllm-npu

mllm-npu: training multimodal large language models on Ascend NPUs

Python 90 2 Updated Aug 29, 2024

liguodongiot / unify-easy-llm

unify-easy-llm（ULM）旨在打造一个简易的一键式大模型训练工具，支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。

Python 55 10 Updated Jul 26, 2024

pandada8 / llm-inference-benchmark

LLM 推理服务性能测试

Jupyter Notebook 42 5 Updated Dec 17, 2023

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 41,554 3,309 Updated Jul 5, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 39,216 4,453 Updated Jul 5, 2025

xai-org / grok-1

Grok open release

Python 50,300 8,353 Updated Aug 30, 2024

alibaba / EasyNLP

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Python 2,150 257 Updated Nov 27, 2024

reworkd / AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 34,471 9,450 Updated Apr 29, 2025

deepspeedai / Megatron-DeepSpeed

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,099 355 Updated Mar 24, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 14,454 2,014 Updated Jul 4, 2025

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

20,569 1,973 Updated May 19, 2025

Liang-ZX / VectorNet

Pytorch implementation of CVPR2020 paper “VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation”

Jupyter Notebook 270 56 Updated May 26, 2022

zhiyuanyou / UniAD

[NeurIPS 2022 Spotlight] A Unified Model for Multi-class Anomaly Detection

Python 301 32 Updated Nov 22, 2022

HuangJunJie2017 / BEVDet

Code base of the BEVDet series .

Python 1,604 274 Updated Jul 4, 2024

TRI-ML / dd3d

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Python 481 75 Updated Nov 29, 2022

Sense-GVT / Fast-BEV

Fast-BEV: A Fast and Strong Bird’s-Eye View Perception Baseline

Python 722 94 Updated Sep 6, 2023

0