8000 flyyufelix (Felix Yu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View flyyufelix's full-sized avatar

Highlights

  • Pro

Block or report flyyufelix

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 392 38 Updated May 19, 2025

Have a natural, spoken conversation with AI!

Python 2,545 222 Updated May 17, 2025

Technical report of Kimina-Prover Preview.

291 11 Updated May 10, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 14,617 1,932 Updated Jun 15, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,517 557 Updated Jun 13, 2025

An MCP-based chatbot | 一个基于MCP的聊天机器人

C++ 15,129 2,896 Updated Jun 13, 2025
Python 710 46 Updated May 30, 2025

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Python 650 34 Updated Jun 11, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,981 145 Updated Jun 3, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,582 189 Updated Jun 6, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,078 686 Updated Jun 16, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 3,175 332 Updated Jun 16, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,186 275 Updated May 11, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 40,578 3,218 Updated Jun 12, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,266 325 Updated May 18, 2025

s1: Simple test-time scaling

Python 6,442 750 Updated May 19, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,477 1,315 Updated Jun 16, 2025

Train transformer language models with reinforcement learning.

Python 14,184 1,970 Updated Jun 15, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 11,899 1,492 Updated Apr 24, 2025
Python 540 65 Updated Jan 2, 2025

Recipes to scale inference-time compute of open models

Python 1,093 117 Updated May 22, 2025

AllenAI's post-training codebase

Python 3,009 404 Updated Jun 15, 2025

Reverse Engineering the Abstraction and Reasoning Corpus

Jupyter Notebook 279 45 Updated Feb 24, 2025

Code for 1st place solution to Kaggle's Abstraction and Reasoning Challenge

C++ 157 27 Updated Jun 8, 2020

Distributed Training Over-The-Internet

938 40 Updated May 15, 2025

Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"

Python 140 17 Updated Dec 11, 2023
Jupyter Notebook 442 32 Updated Jul 22, 2024

Machine Learning Engineering Open Book

Python 14,036 848 Updated Jun 9, 2025

Go ahead and axolotl questions

Python 9,606 1,038 Updated Jun 16, 2025

Training LLMs with QLoRA + FSDP

Jupyter Notebook 1,485 195 Updated Nov 9, 2024
Next
0