8000 wyq199321 (xebooe) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View wyq199321's full-sized avatar

Block or report wyq199321

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[파이썬과 케라스로 배우는 강화학습] 텐서플로우 2.0 개정판 예제

Python 131 99 Updated Mar 25, 2023

A simple implementation of Generative Adversarial Imitation Learning with PyTorch

Python 158 27 Updated Mar 22, 2022

Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

Python 221 22 Updated Jul 10, 2022

Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Python 299 60 Updated Apr 29, 2023

Clean Python Implementation of the Deep Deterministic Policy Gradients Algorithm

Python 75 26 Updated Jan 11, 2017

PyTorch implementation of GAIL and AIRL based on PPO.

Python 218 34 Updated Nov 22, 2020

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

Python 136 23 Updated May 6, 2024

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,760 836 Updated May 29, 2022

PyTorch implementation of Trust Region Policy Optimization

Python 441 89 Updated Sep 13, 2018

卷积神经网络(Convolutional Neural Networks, CNN),只使用python基础库搭建。

Jupyter Notebook 18 6 Updated Apr 21, 2020

多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等

Python 197 24 Updated Jun 6, 2022

Author's PyTorch implementation of TD7 for online and offline RL

Python 143 12 Updated Sep 12, 2023

Code for Weighted QMIX

Python 136 36 Updated Nov 12, 2020

Hello, I pushed some python environments for Multi Agent Reinforcement Learning.

Python 708 128 Updated May 23, 2022

Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)

Python 140 28 Updated Jan 12, 2019

Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments

Jupyter Notebook 852 480 Updated Nov 20, 2019

Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.

Python 97 17 Updated May 12, 2025

A Deep Q-Network trained to play Breakout Atari game on OpenAI Gym environment.

Jupyter Notebook 17 3 Updated Dec 5, 2021

An OpenAI gym environment made for RL

Python 68 30 Updated Dec 16, 2023

A collection of pre-trained RL agents using Stable Baselines3

Python 127 27 Updated Nov 5, 2024
Jupyter Notebook 1 1 Updated Nov 3, 2023

PyTorch implementation of SAC-Discrete.

Python 302 35 Updated Jul 25, 2024

A light-weight, flexible, and expressive statistical data testing library

Python 3,820 335 Updated May 21, 2025

A3C LSTM Atari with Pytorch plus A3G design

Python 567 117 Updated Apr 18, 2023

Monitoring water level in a small glass using nodemcu and Blynk

C++ 8 3 Updated Aug 21, 2018

DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow, and DRQN

Jupyter Notebook 123 14 Updated Dec 18, 2020

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Python 1,212 190 Updated Feb 9, 2021

Modin: Scale your Pandas workflows by changing a single line of code

Python 10,165 665 Updated May 24, 2025

Tools for accelerating safe exploration research.

Python 538 144 Updated Apr 2, 2023

Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games

Python 547 74 Updated Jun 26, 2023
Next
0