RL in RL: Reinforcement Learning in Rocket League

Overview

Welcome to the RL in RL (Reinforcement Learning in Rocket League) project! The primary aim of this project is to train highly skilled bots using reinforcement learning techniques to challenge and ultimately beat Hazem Mansour (my GC1 friend) in Rocket League.

Frameworks Used

RLBot: This framework allows us to create and control Rocket League bots.
RLgym: A framework designed specifically for training reinforcement learning bots in Rocket League.
PPO (Proximal Policy Optimization): Used as our reinforcement learning algorithm, leveraging hyperparameters and reward systems to optimize bot performance.

Training Process

The training process was conducted using a separate repository with access to a high-performance GPU. This setup allowed for faster and more efficient training of the .pt files (model weights). Once trained, these files can be used on other machines.

Getting Started

Requirements

Before you begin, ensure you have met the following requirements:

Install Python 3.8
Install RLBot GUI

Installation

Download RLBot GUI: Follow the instructions here to download and install the RLBot GUI.
Download the repository: Download the repository zip file and extract it.
Load the project in RLBot GUI: Load the extracted bot into the RLBot GUI using "Load Folder".
Download rlgym Library as the RLBot GUI will instruct you: Click the yellow hazard icon and click install and wait for the terminal to finish installing rlgym.

Usage

Load the trained model weights (already loaded)

In RLBot GUI, ensure that the trained model weights (sam-model.pt files) are correctly loaded.
Start a match: Use the RLBot GUI to start a match.
Play against SamBotV3

Download SamBotV3 and place him in the RLBot directory to challenge the best version of SamBot.

Main Aim

The main objective of this project is to develop a highly skilled bot that can beat Hazem Mansour in Rocket League. Through the use of RLBot, RLgym, and PPO hyperparameters and reward systems, I've created a challenging and competitive bot.

Proximal Policy Optimization (PPO)

PPO is a reinforcement learning algorithm designed to improve the training process. It strikes a balance between simplicity, efficiency, and performance. PPO uses a policy gradient method to optimize the agent's actions by adjusting the policy parameters. The main concept behind PPO is to ensure that updates to the policy do not deviate too much from the previous policy, which helps maintain stability during training.

Reward System

In reinforcement learning, agents learn by interacting with the environment and receiving feedback in the form of rewards. The reward system is crucial in guiding the agent's behavior. Positive rewards encourage desired actions, while negative rewards discourage undesired actions. By designing a well-structured reward system, we can ensure that the agent learns to make decisions that lead to optimal performance in the game.

Acknowledgment

The initial bot code and example setup were forked from GoslingUtils.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
SamBotV2/SamBotV2/SamBot		SamBotV2/SamBotV2/SamBot
SamBotV3/SamBotV3		SamBotV3/SamBotV3
__pycache__		__pycache__
.gitattributes		.gitattributes
.gitignore		.gitignore
ExampleBot.cfg		ExampleBot.cfg
ExampleBot2.cfg		ExampleBot2.cfg
ExampleBot2.py		ExampleBot2.py
README.md		README.md
SamBot.py		SamBot.py
appearance.cfg		appearance.cfg
objects.py		objects.py
routines.py		routines.py
tools.py		tools.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RL in RL: Reinforcement Learning in Rocket League

Overview

Frameworks Used

Training Process

Getting Started

Requirements

Installation

Usage

Main Aim

Proximal Policy Optimization (PPO)

Reward System

Acknowledgment

About

Uh oh!

Releases

Packages

Languages

samalouty/RL-in-RL

Folders and files

Latest commit

History

Repository files navigation

RL in RL: Reinforcement Learning in Rocket League

Overview

Frameworks Used

Training Process

Getting Started

Requirements

Installation

Usage

Main Aim

Proximal Policy Optimization (PPO)

Reward System

Acknowledgment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages