8000 bingzhengwei (Anonymous) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View bingzhengwei's full-sized avatar

Block or report bingzhengwei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

This API provides programmatic access to the AlphaGenome model developed by Google DeepMind.

Python 949 94 Updated Jul 1, 2025

Efficient Triton Kernels for LLM Training

Python 5,298 363 Updated Jul 4, 2025

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 224 10 Updated Apr 2, 2025

Distributed Compiler based on Triton for Parallel Systems

Python 863 67 Updated Jul 4, 2025

🌐 WebAgent for Information Seeking bulit by Tongyi Lab: WebWalker & WebDancer & WebSailor

Python 1,261 96 Updated Jul 4, 2025

kernels, of the mega variety

Python 423 21 Updated Jun 2, 2025

Reverse Engineering Gemma 3n: Google's New Edge-Optimized Language Model

Python 200 11 Updated May 27, 2025
Python 307 19 Updated Jun 13, 2025

所有小初高、大学PDF教材。

Roff 43,407 9,686 Updated May 18, 2025

Scalable toolkit for efficient model alignment

Python 820 98 Updated May 31, 2025

Py D696 torch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,165 52 Updated Jun 18, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,570 330 Updated Jul 3, 2025

Distributed RL System for LLM Reasoning

Python 1,928 108 Updated Jul 4, 2025

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1,011 49 Updated Jul 1, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,402 59 Updated May 11, 2025

A PyTorch native platform for training generative AI models

Python 4,003 416 Updated Jul 3, 2025

Fully open reproduction of DeepSeek-R1

Python 24,958 2,320 Updated Jul 3, 2025

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 997 67 Updated May 28, 2025

PyTorch building blocks for the OLMo ecosystem

Python 243 44 Updated Jul 4, 2025

Everything about the SmolLM2 and SmolVLM family of models

Python 2,623 164 Updated Jun 27, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,089 908 Updated Jun 17, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,823 298 Updated Mar 10, 2025

Latest Advances on System-2 Reasoning

Python 1,157 59 Updated Jun 8, 2025

SpargeAttention: A training-free sparse attention that can accelerate any model inference.

Cuda 635 45 Updated Jun 19, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,245 835 Updated Jul 4, 2025

Dynamic Memory Management for Serving LLMs without PagedAttention

C 399 31 Updated May 30, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,640 873 Updated Apr 29, 2025

Official Repo for Open-Reasoner-Zero

Python 1,982 107 Updated Jun 2, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,075 161 Updated Jun 27, 2025
Python 50 1 Updated Oct 2, 2024
Next
0