10000 lviano / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View lviano's full-sized avatar

Block or report lviano

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 5 1 Updated Mar 30, 2025

An implementation of popular Inverse Reinforcement Learning algorithms for various tasks.

Jupyter Notebook 21 6 Updated Jul 26, 2017

Imitation learning algorithms

Python 529 43 Updated Mar 22, 2025

Scikit-Learn API wrapper for Keras.

Python 246 50 Updated Dec 12, 2024

Contrastive Learning of Structured World Models

Python 392 66 Updated Jun 3, 2020

Open-source implementations of OpenAI Gym MuJoCo environments for use with the OpenAI Gym Reinforcement Learning Research Platform.

Python 849 122 Updated Oct 16, 2021
Python 10 4 Updated Apr 26, 2022

MuZero

Python 2,644 647 Updated Sep 3, 2024

Modifiable OpenAI Gym environments for studying generalization in RL

Python 87 14 Updated Jan 22, 2019

Author implementations of paper "Robust Reinforcement Learning via Adversarial training with Langevin Dynamics"

Python 8 1 Updated Sep 21, 2020

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,757 836 Updated May 29, 2022

Hopfield Networks is All You Need

Python 1,803 201 Updated Apr 23, 2023
Jupyter Notebook 39 24 Updated May 20, 2020

A generic, simple and fast implementation of Deepmind's AlphaZero algorithm.

Julia 1,278 139 Updated Jan 5, 2025

Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)

Python 19 4 Updated Feb 29, 2020

Online Prediction by ExpeRt Aggregation

R 52 17 Updated Nov 7, 2024

Early stopping for PyTorch

Jupyter Notebook 1,252 292 Updated Nov 11, 2024

Webots Robot Simulator

C++ 3,591 1,826 Updated May 20, 2025

set of tools and utilities for machine learning of materials

Python 3 1 Updated Jan 9, 2020

Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python

Jupyter Notebook 286 63 Updated Apr 21, 2024
0