JiahuiSun

Sun Jiahui JiahuiSun

I am currently a Ph.D in Shanghai Jiao Tong University(SJTU). I received my Bachelor's degree from Tianjin University(TJU).

16 followers · 19 following

Achievements

Lists (1)

Sort

offline RL

Stars

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

TypeScript 14,130 1,687 Updated Jun 19, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 9,791 1,611 Updated Jun 22, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agent RL)

Python 7,150 694 Updated Jun 19, 2025

deepseek-ai / DeepSeek-R1

90,219 11,653 Updated Apr 9, 2025

deepseek-ai / DeepSeek-V3

Python 97,784 15,906 Updated Jun 16, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 14,285 1,982 Updated Jun 22, 2025

sjtug / SJTUThesis

上海交通大学 LaTeX 论文模板 | Shanghai Jiao Tong University LaTeX Thesis Template

TeX 3,560 797 Updated Apr 15, 2025

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

HTML 8,168 768 Updated Oct 16, 2024

itcharge / AlgoNote

⛽️「算法通关手册」：超详细的「算法与数据结构」基础讲解教程，从零基础开始学习算法知识，850+ 道「LeetCode 题目」详细解析，200 道「大厂面试热门题目」。

Python 7,013 1,223 Updated Jun 18, 2025

TianjunChi / yolov3_pytorch

Python 2 1 Updated Sep 30, 2023

hyunwoongko / transformer

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 3,800 546 Updated Aug 6, 2024

ultralytics / yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 54,319 17,000 Updated Jun 21, 2025

SunicYosen / sjtu-sports

Booking the sports places automatically.

JavaScript 4 2 Updated Nov 13, 2021

OpenRL-Lab / openrl

Unified Reinforcement Learning Framework

Python 745 72 Updated Sep 6, 2024

billryan / resume

An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git

TeX 10,097 2,713 Updated Mar 15, 2024

alibaba / loongcollector

Fast and Lightweight Observability Data Collector

C++ 1,915 417 Updated Jun 20, 2025

chauncygu / Safe-Reinforcement-Learning-Baselines

The repository is for safe reinforcement learning baselines.

Jupyter Notebook 651 86 Updated Apr 16, 2025

RunzheYang / MORL

Multi-Objective Reinforcement Learning

Python 277 53 Updated Aug 10, 2021

nrhinehart / deep_imitative_models

Reimplementation (currently partial) of Deep Imitative Models paper, ICLR '20

Python 73 16 Updated Dec 8, 2022

sweetice / Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,348 881 Updated Mar 24, 2023

sebascuri / rhucrl

Robust-HUCRL

Python 3 1 Updated Nov 13, 2023

google-deepmind / mujoco

Multi-Joint dynamics with Contact. A general purpose physics simulator.

Jupyter Notebook 9,837 1,037 Updated Jun 22, 2025

sjtu-marl / malib

A parallel framework for population-based multi-agent reinforcement learning.

Python 532 63 Updated Dec 14, 2023

kwai / DouZero

[ICML 2021] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | 斗地主AI

Python 4,344 618 Updated Jun 26, 2024

datamllab / awesome-game-ai

Awesome Game AI materials of Multi-Agent Reinforcement Learning

871 105 Updated Jun 26, 2024

mit-gfx / PGMORL

[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control

Python 113 32 Updated Oct 9, 2020

soffes / Countdown

Mac screensaver for counting down to a date

Swift 980 92 Updated Jul 16, 2018

juhyeonkim95 / TaxiSimulatorOnGraph

This is the official implementation of "Optimizing Large-Scale Fleet Management on a Road Network using Multi-Agent Deep Reinforcement Learning with Graph Neural Network" (ITSC 2021)

Jupyter Notebook 33 12 Updated Apr 27, 2022

flow-project / flow

Computational framework for reinforcement learning in traffic control

Python 1,123 386 Updated Jul 27, 2024

RobinLu1209 / STAG-GCN

Spatiotemporal Adaptive Gated Graph Convolution Network for Urban Traffic Flow Forecasting

Python 68 15 Updated Oct 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sun Jiahui JiahuiSun

Achievements

Achievements

Block or report JiahuiSun

Lists (1)

offline RL

Stars

bytedance / deer-flow

volcengine / verl

OpenRLHF / OpenRLHF

deepseek-ai / DeepSeek-R1

deepseek-ai / DeepSeek-V3

huggingface / trl

sjtug / SJTUThesis

LianjiaTech / BELLE

itcharge / AlgoNote

TianjunChi / yolov3_pytorch

hyunwoongko / transformer

ultralytics / yolov5

SunicYosen / sjtu-sports

OpenRL-Lab / openrl

billryan / resume

alibaba / loongcollector

chauncygu / Safe-Reinforcement-Learning-Baselines

RunzheYang / MORL

nrhinehart / deep_imitative_models

sweetice / Deep-reinforcement-learning-with-pytorch

sebascuri / rhucrl

google-deepmind / mujoco

sjtu-marl / malib

kwai / DouZero

datamllab / awesome-game-ai

mit-gfx / PGMORL

soffes / Countdown

juhyeonkim95 / TaxiSimulatorOnGraph

flow-project / flow

RobinLu1209 / STAG-GCN