PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,805 839 Updated May 29, 2022

ml-jku / hopfield-layers

Hopfield Networks is All You Need

Python 1,827 207 Updated Apr 23, 2023

hermesdt / reinforcement-learning

Jupyter Notebook 39 23 Updated May 20, 2020

jonathan-laurent / AlphaZero.jl

A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.

Julia 1,292 140 Updated Jan 5, 2025

tgangwani / RL-Indirect-imitation

Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)

Python 19 4 Updated Feb 29, 2020

Dralliag / opera

Online Prediction by ExpeRt Aggregation

R 53 17 Updated Nov 7, 2024

Bjarten / early-stopping-pytorch

Early stopping for PyTorch

Jupyter Notebook 1,261 292 Updated Nov 11, 2024

cyberbotics / webots

Webots Robot Simulator

C++ 3,687 1,864 Updated Jul 23, 2025

felixmusil / ml_tools

set of tools and utilities for machine learning of materials

Python 3 1 Updated Jan 9, 2020

qzed / irl-maxent

Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python

Jupyter Notebook 290 64 Updated Apr 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lviano

Block or report lviano

Stars

stefanoviel / SOAR-IL

aravindsiv / irl-lab

Kaixhin / imitation-learning

adriangb / scikeras

tkipf / c-swm

benelot / pybullet-gym

ASzot / rl-toolkit

werner-duvaud / muzero-general

sunblaze-ucb / rl-generalization

ythuangyt / Robust-Reinforcement-Learning-via-Adversarial-training-with-Langevin-Dynamics

ikostrikov / pytorch-a2c-ppo-acktr-gail