8000 leocnj (Lei Chen) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View leocnj's full-sized avatar
  • RIT-Boston
  • Princeton, NJ

Block or report leocnj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 150 7 Updated Jun 2, 2025

Visual testing tool for MCP servers

TypeScript 4,109 535 Updated Jun 17, 2025

🚀 The fast, Pythonic way to build MCP servers and clients

Python 12,723 768 Updated Jun 16, 2025

Verifiers for LLM Reinforcement Learning

Python 60 9 Updated Apr 15, 2025

Rakuten API client for Python

Python 2 1 Updated Sep 2, 2014

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,379 308 Updated May 13, 2025

LIMO: Less is More for Reasoning

Python 960 49 Updated Apr 6, 2025

s1: Simple test-time scaling

Python 6,447 749 Updated May 19, 2025

Building DeepSeek R1 from Scratch

Jupyter Notebook 626 101 Updated Mar 21, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,268 325 Updated May 18, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 14,411 1,026 Updated Jun 15, 2025

Synthetic data curation for post-training and structured data extraction

Python 1,404 109 Updated Jun 16, 2025

Fantastic Data Engineering for Large Language Models

90 4 Updated Dec 29, 2024

A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨

Python 223 17 Updated Apr 26, 2024

A series of math-specific large language models of our Qwen2 series.

Python 948 132 Updated Jan 11, 2025

[Preprint] AIPO: Improving Training Objective for Iterative Preference Optimization

Python 11 Updated Oct 3, 2024

[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Python 158 13 Updated Sep 6, 2024

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,500 148 Updated Jun 17, 2025

A course on aligning smol models.

Jupyter Notebook 5,945 2,112 Updated Jan 24, 2025

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,486 158 Updated May 24, 2025

The Open Cookbook for Top-Tier Code Large Language Model

Python 1,718 107 Updated Dec 8, 2024

[NeurIPS'24] SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning

Python 22 2 Updated Nov 19, 2024

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking

Python 84 7 Updated Jun 13, 2025

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 556 30 Updated Dec 9, 2024

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,756 206 Updated Jun 9, 2025

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Python 1,026 54 Updated Feb 2, 2025

Optimizing inference proxy for LLMs

Python 2,530 191 Updated Jun 17, 2025

欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 776 66 Updated Jun 4, 2025

Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"

Python 54 3 Updated Oct 1, 2024

[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct

Python 2,019 165 Updated Nov 1, 2024
Next
0