10000 popomen (Nan Zhe) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View popomen's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report popomen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train transformer language models with reinforcement learning.

Python 14,399 1,999 Updated Jun 28, 2025

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 221 10 Updated Apr 2, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 7,228 701 Updated Jun 19, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 10,201 1,685 Updated Jul 1, 2025

Infiniband Verbs Performance Tests

C 777 331 Updated Jun 19, 2025

RDMA core userspace libraries and daemons

C 1,841 756 Updated Jul 1, 2025

Large Context Attention

Python 716 53 Updated Jan 24, 2025

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,884 1,143 Updated Jun 26, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 29,584 6,074 Updated Jul 1, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,185 289 Updated Jun 30, 2025

A PyTorch native platform for training generative AI models

Python 3,985 411 Updated Jul 1, 2025

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,920 521 Updated Sep 25, 2024

Rotary Transformer

Python 973 56 Updated Mar 21, 2022

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,288 1,506 Updated Jun 26, 2025

A PyTorch Native LLM Training Framework

Python 824 49 Updated Dec 27, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 146,262 29,496 Updated Jul 1, 2025

深度学习经典、新论文逐段精读

30,651 2,661 Updated Mar 22, 2025

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,097 355 Updated Mar 24, 2025

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,637 936 Updated Aug 21, 2024

A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.

Go 1,536 370 Updated Jun 30, 2025

An industrial deep learning framework for high-dimension sparse data

PureBasic 4,289 1,029 Updated Sep 25, 2024

Kubernetes-native Deep Learning Framework

Python 742 116 Updated Jan 26, 2024

DLRover: An Automatic Distributed Deep Learning System

Python 1,496 184 Updated Jul 1, 2025

Policy based networking for cloud native applications

720 99 Updated Apr 3, 2020

flannel is a network fabric for containers, designed for Kubernetes

Go 9,168 2,886 Updated Jun 30, 2025

gRPC to JSON proxy generator following the gRPC HTTP spec

Go 19,339 2,322 Updated Jul 1, 2025

PyTorch extensions for high performance and large scale training.

Python 3,336 288 Updated Apr 26, 2025

Making large AI models cheaper, faster and more accessible

Python 41,006 4,523 Updated Jun 30, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 14,169 2,045 Updated Jul 1, 2025

Giving Kubernetes Superpowers to everyone

Go 6,726 836 Updated Jun 30, 2025
Next
0