wyq199321

xebooe wyq199321

3 followers · 8 following

Stars

rlcode / reinforcement-learning-kr-v2

[파이썬과 케라스로 배우는 강화학습] 텐서플로우 2.0 개정판 예제

Python 131 99 Updated Mar 25, 2023

hcnoh / gail-pytorch

A simple implementation of Generative Adversarial Imitation Learning with PyTorch

Python 158 27 Updated Mar 22, 2022

RchalYang / torchrl

Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

Python 221 22 Updated Jul 10, 2022

Stable-Baselines-Team / stable-baselines

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Python 299 60 Updated Apr 29, 2023

cookbenjamin / DDPG

Clean Python Implementation of the Deep Deterministic Policy Gradients Algorithm

Python 75 26 Updated Jan 11, 2017

toshikwa / gail-airl-ppo.pytorch

PyTorch implementation of GAIL and AIRL based on PPO.

Python 218 34 Updated Nov 22, 2020

BY571 / CQL

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

Python 136 23 Updated May 6, 2024

ikostrikov / pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,760 836 Updated May 29, 2022

ikostrikov / pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Python 441 89 Updated Sep 13, 2018

thesouther / cnn_with_numpy

卷积神经网络（Convolutional Neural Networks, CNN），只使用python基础库搭建。

Jupyter Notebook 18 6 Updated Apr 21, 2020

thesouther / MARL

多智能体强化学习（MARL）算法复现，包括QMIX，VDN，QTRAN、MAVEN等等

Python 197 24 Updated Jun 6, 2022

sfujim / TD7

Author's PyTorch implementation of TD7 for online and offline RL

Python 143 12 Updated Sep 12, 2023

oxwhirl / wqmix

Code for Weighted QMIX

Python 136 36 Updated Nov 12, 2020

Bigpig4396 / Multi-Agent-Reinforcement-Learning-Environment

Hello, I pushed some python environments for Multi Agent Reinforcement Learning.

Python 708 128 Updated May 23, 2022

adik993 / ppo-pytorch

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

Python 140 28 Updated Jan 12, 2019

qqiang00 / Reinforce

Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments

Jupyter Notebook 852 480 Updated Nov 20, 2019

alirezakazemipour / DDPG-HER

Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.

Python 97 17 Updated May 12, 2025

GiannisMitr / DQN-Atari-Breakout

A Deep Q-Network trained to play Breakout Atari game on OpenAI Gym environment.

Jupyter Notebook 17 3 Updated Dec 5, 2021

grantsrb / Gym-Snake

An OpenAI gym environment made for RL

Python 68 30 Updated Dec 16, 2023

DLR-RM / rl-trained-agents

A collection of pre-trained RL agents using Stable Baselines3

Python 127 27 Updated Nov 5, 2024

QuimMarset / MAI-ATCI

Jupyter Notebook 1 1 Updated Nov 3, 2023

toshikwa / sac-discrete.pytorch

PyTorch implementation of SAC-Discrete.

Python 302 35 Updated Jul 25, 2024

unionai-oss / pandera

A light-weight, flexible, and expressive statistical data testing library

Python 3,820 335 Updated May 21, 2025

dgriff777 / rl_a3c_pytorch

A3C LSTM Atari with Pytorch plus A3G design

Python 567 117 Updated Apr 18, 2023

pranz24 / Blynk-Water-Level-Sensor

Monitoring water level in a small glass using nodemcu and Blynk

C++ 8 3 Updated Aug 21, 2018

BY571 / DQN-Atari-Agents

DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow, and DRQN

Jupyter Notebook 123 14 Updated Dec 18, 2020

Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Python 1,212 190 Updated Feb 9, 2021

modin-project / modin

Modin: Scale your Pandas workflows by changing a single line of code

Python 10,165 665 Updated May 24, 2025

openai / safety-gym

Tools for accelerating safe exploration research.

Python 538 144 Updated Apr 2, 2023

google-research / batch_rl

Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games

Python 547 74 Updated Jun 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly