-
-
AReaL Public
Forked from inclusionAI/AReaLDistributed RL System for LLM Reasoning
Python Apache License 2.0 UpdatedApr 7, 2025 -
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedMar 5, 2025 -
-
-
-
DeepSpeedExamples Public
Forked from deepspeedai/DeepSpeedExamplesExample models using DeepSpeed
Python Apache License 2.0 UpdatedOct 5, 2024 -
-
-
sphinx-action Public
Forked from ammaraskar/sphinx-actionGithub action that builds docs using sphinx and places errors inline
Python Apache License 2.0 UpdatedJun 20, 2024 -
OpenRLHF-1 Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Python Apache License 2.0 UpdatedJun 19, 2024 -
sphinx-pages Public
Forked from seanzhengw/sphinx-pagesBuild html documentation by Sphinx, and push to branch gh-pages.
Shell UpdatedJun 19, 2024 -
sipo Public
Iteratively Learn Diverse Strategies with State Distance Information
-
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python Apache License 2.0 UpdatedMar 27, 2024 -
cugae Public
CUDA implementation of Generalized Advantage Estimation (GAE)
-
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
-
gpu-burn Public
Forked from wilicc/gpu-burnMulti-GPU CUDA stress test
C++ BSD 2-Clause "Simplified" License UpdatedOct 12, 2023 -
-
Trust-Region-Methods-in-Multi-Agent-Reinforcement-Learning Public
Forked from anonymous-ICLR22/Trust-Region-Methods-in-Multi-Agent-Reinforcement-Learning -
revisiting_marl Public
Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)
-
-
build_football_engine Public
Build script for Google Research Football on M1 Mac.
Shell UpdatedJan 18, 2022 -
-