Stars
[파이썬과 케라스로 배우는 강화학습] 텐서플로우 2.0 개정판 예제
A simple implementation of Generative Adversarial Imitation Learning with PyTorch
Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Clean Python Implementation of the Deep Deterministic Policy Gradients Algorithm
PyTorch implementation of GAIL and AIRL based on PPO.
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…
PyTorch implementation of Trust Region Policy Optimization
卷积神经网络(Convolutional Neural Networks, CNN),只使用python基础库搭建。
Author's PyTorch implementation of TD7 for online and offline RL
Hello, I pushed some python environments for Multi Agent Reinforcement Learning.
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments
Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.
A Deep Q-Network trained to play Breakout Atari game on OpenAI Gym environment.
A collection of pre-trained RL agents using Stable Baselines3
PyTorch implementation of SAC-Discrete.
A light-weight, flexible, and expressive statistical data testing library
A3C LSTM Atari with Pytorch plus A3G design
Monitoring water level in a small glass using nodemcu and Blynk
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow, and DRQN
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Modin: Scale your Pandas workflows by changing a single line of code
Tools for accelerating safe exploration research.
Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games