10000 JiahuiSun (Sun Jiahui) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View JiahuiSun's full-sized avatar

Block or report JiahuiSun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

TypeScript 14,130 1,687 Updated Jun 19, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,791 1,611 Updated Jun 22, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,150 694 Updated Jun 19, 2025

Train transformer language models with reinforcement learning.

Python 14,285 1,982 Updated Jun 22, 2025

上海交通大学 LaTeX 论文模板 | Shanghai Jiao Tong University LaTeX Thesis Template

TeX 3,560 797 Updated Apr 15, 2025

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,168 768 Updated Oct 16, 2024

⛽️「算法通关手册」:超详细的「算法与数据结构」基础讲解教程,从零基础开始学习算法知识,850+ 道「LeetCode 题目」详细解析,200 道「大厂面试热门题目」。

Python 7,013 1,223 Updated Jun 18, 2025
Python 2 1 Updated Sep 30, 2023

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 3,800 546 Updated Aug 6, 2024

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 54,319 17,000 Updated Jun 21, 2025

Booking the sports places automatically.

JavaScript 4 2 Updated Nov 13, 2021

Unified Reinforcement Learning Framework

Python 745 72 Updated Sep 6, 2024

An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git

TeX 10,097 2,713 Updated Mar 15, 2024

Fast and Lightweight Observability Data Collector

C++ 1,915 417 Updated Jun 20, 2025

The repository is for safe reinforcement learning baselines.

Jupyter Notebook 651 86 Updated Apr 16, 2025

Multi-Objective Reinforcement Learning

Python 277 53 Updated Aug 10, 2021

Reimplementation (currently partial) of Deep Imitative Models paper, ICLR '20

Python 73 16 Updated Dec 8, 2022

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,348 881 Updated Mar 24, 2023

Robust-HUCRL

Python 3 1 Updated Nov 13, 2023

Multi-Joint dynamics with Contact. A general purpose physics simulator.

Jupyter Notebook 9,837 1,037 Updated Jun 22, 2025

A parallel framework for population-based multi-agent reinforcement learning.

Python 532 63 Updated Dec 14, 2023

[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI

Python 4,344 618 Updated Jun 26, 2024

Awesome Game AI materials of Multi-Agent Reinforcement Learning

871 105 Updated Jun 26, 2024

[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control

Python 113 32 Updated Oct 9, 2020

Mac screensaver for counting down to a date

Swift 980 92 Updated Jul 16, 2018

This is the official implementation of "Optimizing Large-Scale Fleet Management on a Road Network using Multi-Agent Deep Reinforcement Learning with Graph Neural Network" (ITSC 2021)

Jupyter Notebook 33 12 Updated Apr 27, 2022

Computational framework for reinforcement learning in traffic control

Python 1,123 386 Updated Jul 27, 2024

Spatiotemporal Adaptive Gated Graph Convolution Network for Urban Traffic Flow Forecasting

Python 68 15 Updated Oct 27, 2020
Next
0