8000 lviano / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View lviano's full-sized avatar

Block or report lviano

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 4 1 Updated Mar 30, 2025

An implementation of popular Inverse Reinforcement Learning algorithms for various tasks.

Jupyter Notebook 21 6 Updated Jul 26, 2017

Imitation learning algorithms

Python 541 43 Updated Mar 22, 2025

Scikit-Learn API wrapper for Keras.

Python 247 51 Updated Dec 12, 2024

Contrastive Learning of Structured World Models

Python 393 66 Updated Jun 3, 2020

Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.

Python 858 122 Updated Oct 16, 2021
Python 11 4 Updated Apr 26, 2022

MuZero

Python 2,673 655 Updated Sep 3, 2024

Modifiable OpenAI Gym environments for studying generalization in RL

Python 87 14 Updated Jan 22, 2019

Author implementations of paper "Robust Reinforcement Learning via Adversarial training with Langevin Dynamics"

Python 9 1 Updated Sep 21, 2020

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,805 839 Updated May 29, 2022

Hopfield Networks is All You Need

Python 1,827 207 Updated Apr 23, 2023
Jupyter Notebook 39 23 Updated May 20, 2020

A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.

Julia 1,292 140 Updated Jan 5, 2025

Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)

Python 19 4 Updated Feb 29, 2020

Online Prediction by ExpeRt Aggregation

R 53 17 Updated Nov 7, 2024

Early stopping for PyTorch

Jupyter Notebook 1,261 292 Updated Nov 11, 2024

Webots Robot Simulator

C++ 3,687 1,864 Updated Jul 23, 2025

set of tools and utilities for machine learning of materials

Python 3 1 Updated Jan 9, 2020

Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python

Jupyter Notebook 290 64 Updated Apr 21, 2024
0