8000 li-plus (Jiahao Li) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View li-plus's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report li-plus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train transformer language models with reinforcement learning.

Python 13,783 1,885 Updated May 16, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,351 597 Updated May 16, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,662 769 Updated May 12, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,087 5,974 Updated May 16, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,110 959 Updated May 18, 2025

Fully open reproduction of DeepSeek-R1

Python 24,449 2,250 Updated May 17, 2025
Python 6,021 393 Updated May 15, 2025

High-Resolution Image Synthesis with Latent Diffusion Models

Python 40,968 5,246 Updated Oct 10, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 9,345 1,135 Updated Oct 9, 2024

A collection of resources and papers on Diffusion Models

HTML 11,721 976 Updated Aug 1, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 25,031 2,228 Updated May 16, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,280 248 Updated May 16, 2025

A guidance language for controlling large language models.

Jupyter Notebook 20,198 1,104 Updated May 16, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 10,643 1,838 Updated May 15, 2025

The Arcade Learning Environment (ALE) -- a platform for AI research.

C++ 2,274 444 Updated May 12, 2025

A Survey on Large Language Model-Based Game Agents

604 25 Updated Apr 30, 2025

Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

Python 3,381 260 Updated Jan 21, 2025

Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 465 60 Updated Mar 11, 2025

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++ 4,526 991 Updated May 8, 2025

A PyTorch native platform for training generative AI models

Python 3,814 369 Updated May 17, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,550 252 Updated May 12, 2025

Efficient Triton Kernels for LLM Training

Python 5,024 324 Updated May 17, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 19,796 2,097 Updated Mar 11, 2025

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,280 4,905 Updated Aug 1, 2024

An educational resource to help anyone learn deep reinforcement learning.

Python 10,885 2,337 Updated Aug 5, 2024

StarCraft II Learning Environment

Python 8,125 1,166 Updated Jul 23, 2024

A StarCraft II bot api client library for Python 3

Python 552 168 Updated Jan 11, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 83,378 61,172 Updated Apr 19, 2025
Next
0