Lists (1)
Sort Name ascending (A-Z)
Stars
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
verl: Volcano Engine Reinforcement Learning for LLMs
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)
Train transformer language models with reinforcement learning.
上海交通大学 LaTeX 论文模板 | Shanghai Jiao Tong University LaTeX Thesis Template
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
⛽️「算法通关手册」:超详细的「算法与数据结构」基础讲解教程,从零基础开始学习算法知识,850+ 道「LeetCode 题目」详细解析,200 道「大厂面试热门题目」。
Transformer: PyTorch Implementation of "Attention Is All You Need"
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Booking the sports places automatically.
An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git
Fast and Lightweight Observability Data Collector
The repository is for safe reinforcement learning baselines.
Reimplementation (currently partial) of Deep Imitative Models paper, ICLR '20
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Multi-Joint dynamics with Contact. A general purpose physics simulator.
A parallel framework for population-based multi-agent reinforcement learning.
[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI
Awesome Game AI materials of Multi-Agent Reinforcement Learning
[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control
This is the official implementation of "Optimizing Large-Scale Fleet Management on a Road Network using Multi-Agent Deep Reinforcement Learning with Graph Neural Network" (ITSC 2021)
Computational framework for reinforcement learning in traffic control
Spatiotemporal Adaptive Gated Graph Convolution Network for Urban Traffic Flow Forecasting