flyyufelix

Felix Yu flyyufelix

415 followers · 3 following

https://flyyufelix.github.io/

Achievements

Highlights

Stars

abhiemj / manim-mcp-server

Python 392 38 Updated May 19, 2025

KoljaB / RealtimeVoiceChat

Have a natural, spoken conversation with AI!

Python 2,545 222 Updated May 17, 2025

MoonshotAI / Kimina-Prover-Preview

Technical report of Kimina-Prover Preview.

291 11 Updated May 10, 2025

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 14,617 1,932 Updated Jun 15, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,517 557 Updated Jun 13, 2025

78 / xiaozhi-esp32

An MCP-based chatbot | 一个基于MCP的聊天机器人

C++ 15,129 2,896 Updated Jun 13, 2025

Qihoo360 / Light-R1

Python 710 46 Updated May 30, 2025

lucidrains / native-sparse-attention-pytorch

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Python 650 34 Updated Jun 11, 2025

RAGEN-AI / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,981 145 Updated Jun 3, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,582 189 Updated Jun 6, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,078 686 Updated Jun 16, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 3,175 332 Updated Jun 16, 2025

casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,186 275 Updated May 11, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 40,578 3,218 Updated Jun 12, 2025

NovaSky-AI / SkyThought

Sky-T1: Train your own O1 preview model within $450

Python 3,266 325 Updated May 18, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,442 750 Updated May 19, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,477 1,315 Updated Jun 16, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 14,184 1,970 Updated Jun 15, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 11,899 1,492 Updated Apr 24, 2025

lqtrung1998 / mwp_ReFT

Python 540 65 Updated Jan 2, 2025

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 1,093 117 Updated May 22, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,009 404 Updated Jun 15, 2025

michaelhodel / re-arc

Reverse Engineering the Abstraction and Reasoning Corpus

Jupyter Notebook 279 45 Updated Feb 24, 2025

top-quarks / ARC-solution

Code for 1st place solution to Kaggle's Abstraction and Reasoning Challenge

C++ 157 27 Updated Jun 8, 2020

NousResearch / DisTrO

Distributed Training Over-The-Internet

938 40 Updated May 15, 2025

yandex-research / swarm

Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"

Python 140 17 Updated Dec 11, 2023

project-numina / aimo-progress-prize

Jupyter Notebook 442 32 Updated Jul 22, 2024

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 14,036 848 Updated Jun 9, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 9,606 1,038 Updated Jun 16, 2025

AnswerDotAI / fsdp_qlora

Training LLMs with QLoRA + FSDP

Jupyter Notebook 1,485 195 Updated Nov 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Felix Yu flyyufelix

Achievements

Achievements

Highlights

Block or report flyyufelix

Stars

abhiemj / manim-mcp-server

KoljaB / RealtimeVoiceChat

MoonshotAI / Kimina-Prover-Preview

huggingface / lerobot

InternLM / lmdeploy

78 / xiaozhi-esp32

Qihoo360 / Light-R1

lucidrains / native-sparse-attention-pytorch

RAGEN-AI / RAGEN

PeterGriffinJin / Search-R1

OpenRLHF / OpenRLHF

flashinfer-ai / flashinfer

casper-hansen / AutoAWQ

unslothai / unsloth

NovaSky-AI / SkyThought

simplescaling / s1

volcengine / verl

huggingface / trl

Jiayi-Pan / TinyZero

lqtrung1998 / mwp_ReFT

huggingface / search-and-learn

allenai / open-instruct

michaelhodel / re-arc

top-quarks / ARC-solution

NousResearch / DisTrO

yandex-research / swarm

project-numina / aimo-progress-prize

stas00 / ml-engineering

axolotl-ai-cloud / axolotl

AnswerDotAI / fsdp_qlora