PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,757 836 Updated May 29, 2022

ml-jku / hopfield-layers

Hopfield Networks is All You Need

Python 1,803 201 Updated Apr 23, 2023

hermesdt / reinforcement-learning

Jupyter Notebook 39 24 Updated May 20, 2020

jonathan-laurent / AlphaZero.jl

A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.

Julia 1,278 139 Updated Jan 5, 2025

tgangwani / RL-Indirect-imitation

Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)

Python 19 4 Updated Feb 29, 2020

Dralliag / opera

Online Prediction by ExpeRt Aggregation

R 52 17 Updated Nov 7, 2024

Bjarten / early-stopping-pytorch

Early stopping for PyTorch

Jupyter Notebook 1,252 292 Updated Nov 11, 2024

cyberbotics / webots

Webots Robot Simulator

C++ 3,591 1,826 Updated May 20, 2025

felixmusil / ml_tools

set of tools and utilities for machine learning of materials

Python 3 1 Updated Jan 9, 2020

qzed / irl-maxent

Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python

Jupyter Notebook 286 63 Updated Apr 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lviano

Block or report lviano

Stars

stefanoviel / SOAR-IL

aravindsiv / irl-lab

Kaixhin / imitation-learning

adriangb / scikeras

tkipf / c-swm

benelot / pybullet-gym

ASzot / rl-toolkit

werner-duvaud / muzero-general

sunblaze-ucb / rl-generalization

ythuangyt / Robust-Reinforcement-Learning-via-Adversarial-training-with-Langevin-Dynamics

ikostrikov / pytorch-a2c-ppo-acktr-gail