8000 fanpu (Fan Pu Zeng) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View fanpu's full-sized avatar

Organizations

@15-411-f20

Block or report fanpu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,159 101 Updated May 8, 2024

A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.

364 17 Updated Oct 4, 2023

A quick guide (especially) for trending instruction finetuning datasets

3,097 204 Updated Nov 28, 2023

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,736 153 Updated May 28, 2025

A community Bash framework.

Shell 14,587 2,301 Updated Jun 3, 2025

A toolkit to run Ray applications on Kubernetes

Go 1,804 544 Updated Jun 6, 2025

A hyperparameter optimization framework

Python 12,070 1,109 Updated Jun 6, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,413 611 Updated May 27, 2025

Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models

Python 55 6 Updated Apr 26, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

Jupyter Notebook 8,008 514 Updated Jun 6, 2025

A distributed execution engine for cloud computing

Python 84 13 Updated May 7, 2012

Open MPI main development repository

C 2,359 901 Updated Jun 4, 2025

Sparrow scheduling platform (U.C. Berkeley).

Python 323 92 Updated Jul 25, 2020

A flexible, high-performance serving system for machine learning models

C++ 6,288 2,201 Updated May 31, 2025

A low-latency prediction-serving system

C++ 1,416 280 Updated Apr 26, 2021
Python 81 22 Updated May 24, 2021

ReduNet

Python 539 81 Updated Feb 17, 2022

Code for CRATE (Coding RAte reduction TransformEr).

Python 1,231 96 Updated Oct 23, 2024
Jupyter Notebook 131 4 Updated May 16, 2025

[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Python 582 72 Updated Feb 29, 2024

Official Repository of Absolute Zero Reasoner

Python 1,466 243 Updated Jun 2, 2025
TypeScript 303 21 Updated Apr 28, 2025

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

132 6156 8 Updated Jun 12, 2024

The NetHack Learning Environment

C 954 113 Updated May 6, 2024

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 777 69 Updated Mar 14, 2025

[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding

Jupyter Notebook 84 2 Updated Apr 1, 2025

Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"

Python 73 8 Updated Oct 15, 2024

Efficient Triton Kernels for LLM Training

Python 5,160 346 Updated Jun 6, 2025
Python 458 36 Updated May 29, 2025
Next
0