gymnasium_mujoco_docker

Gymnasium MuJoCo Docker

A Docker-based environment for training and evaluating the Inverted Pendulum MuJoCo environment from Gymnasium. This project supports both CPU and GPU training using NVIDIA CUDA.

Project Structure

.
├── .docker
│   ├── docker-compose.yaml
│   ├── Dockerfile
│   └── req.txt
├── eval.py
├── LICENSE
├── README.md
└── train.py

Requirements

Docker
Docker Compose

Quick Start

Building the Container

cd .docker
docker-compose build

Training the Agent

To train the agent using default parameters:

cd .docker
docker-compose run mujoco python3 train.py

With custom parameters:

cd .docker
docker-compose run mujoco python3 train.py --timesteps 2000000 --num-envs 4

To view the training progress use tensorboard in a new terminal:

cd .docker
docker-compose run -p 6006:6006 mujoco python -m tensorboard.main --logdir=./logs --bind_all

Evaluating the Trained Agent

After training, you can evaluate the agent:

cd .docker
docker-compose run mujoco python eval.py --model-path ./models/best/best_model.zip --num-episodes 2 --render --save-video

Training Options

The train.py script accepts the following command-line arguments:

--timesteps: Number of timesteps to train for (default: 1000000)
--seed: Random seed (default: 42)
--num-envs: Number of parallel environments (default: 8)
--save-dir: Directory to save models (default: ./models)
--log-dir: Directory to save logs (default: ./logs)

Evaluation Options

The eval.py script accepts the following command-line arguments:

--model-path: Path to the trained model (required)
--num-episodes: Number of episodes to evaluate (default: 10)
--render: Render the environment during evaluation
--save-video: Save a video of the evaluation

Environment Details

This Docker environment uses:

Python 3.12 (from the official python:3.12-slim-bookworm image)
Gymnasium with MuJoCo support
Stable Baselines3 for reinforcement learning algorithms
Software rendering (osmesa) for environments without GPU

Customization

To use different MuJoCo environments, simply modify the env_id variable in the training and evaluation scripts.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

gymnasium_mujoco_docker

Gymnasium MuJoCo Docker

Project Structure

Requirements

Quick Start

Building the Container

Training the Agent

Evaluating the Trained Agent

Training Options

Evaluation Options

Environment Details

Customization

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.docker		.docker
logs		logs
models		models
videos		videos
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
train.py		train.py

License

jc-cr/gymnasium_mujoco_docker

Folders and files

Latest commit

History

Repository files navigation

gymnasium_mujoco_docker

Gymnasium MuJoCo Docker

Project Structure

Requirements

Quick Start

Building the Container

Training the Agent

Evaluating the Trained Agent

Training Options

Evaluation Options

Environment Details

Customization

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages