10000 Ginray (Yinlei Sun) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Ginray's full-sized avatar
🌶️
艰难,但相信
🌶️
艰难,但相信

Block or report Ginray

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 525 33 Updated Jul 2, 2025
Python 725 47 Updated May 30, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,371 160 Updated Mar 20, 2025

Fully open reproduction of DeepSeek-R1

Python 24,964 2,320 Updated Jul 3, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7,268 707 Updated Jun 19, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,371 1,723 Updated Jul 5, 2025

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,630 350 Updated Jul 2, 2025

Transformers 库快速入门教程

Python 1,555 187 Updated Sep 20, 2024

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 8,470 726 Updated Jul 4, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 4,716 245 Updated Jul 4, 2025

Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models

Python 120 7 Updated May 16, 2025

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Python 1,029 54 Updated Feb 2, 2025

LLM101n: Let's build a Storyteller

33,898 1,842 Updated Aug 1, 2024
C++ 454 39 Updated Jun 27, 2025

mllm-npu: training multimodal large language models on Ascend NPUs

Python 90 2 Updated Aug 29, 2024

unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。

Python 55 10 Updated Jul 26, 2024

LLM 推理服务性能测试

Jupyter Notebook 42 5 Updated Dec 17, 2023

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 41,554 3,309 Updated Jul 5, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 39,216 4,453 Updated Jul 5, 2025

Grok open release

Python 50,300 8,353 Updated Aug 30, 2024

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Python 2,150 257 Updated Nov 27, 2024

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 34,471 9,450 Updated Apr 29, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,099 355 Updated Mar 24, 2025

Train transformer language models with reinforcement learning.

Python 14,454 2,014 Updated Jul 4, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

20,569 1,973 Updated May 19, 2025

Pytorch implementation of CVPR2020 paper “VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation”

Jupyter Notebook 270 56 Updated May 26, 2022

[NeurIPS 2022 Spotlight] A Unified Model for Multi-class Anomaly Detection

Python 301 32 Updated Nov 22, 2022

Code base of the BEVDet series .

Python 1,604 274 Updated Jul 4, 2024

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Python 481 75 Updated Nov 29, 2022

Fast-BEV: A Fast and Strong Bird’s-Eye View Perception Baseline

Python 722 94 Updated Sep 6, 2023
Next
0