RocketLeagueGym-Rewards

Project Links

🌐 Project Website
📑 Project Slides

Setup

Install Miniconda3, then set up your environment:

conda create --name ddrl-project python=3.10
conda activate ddrl-project

On CUDA CIMS, use /scratch for installing packages and saving checkpoints.
On Greene, use /scratch/<NET-ID> for the same.

Ensure the pip you're using points to your conda environment:

which pip
pip install --cache-dir=/scratch -r requirements.txt

Training

There are two training modes: single GPU and multi-GPU with DDP.

Environment Variable

Before training, set the authorize key as an environment variable inside train.py or train_ddp.py.

Training on Different Systems

CUDA CIMS: Use screen to run and detach (press Ctrl+A, then D)
Greene: Use sbatch to submit jobs

Run Training

cd train

# Single GPU
python train.py

# Multi-GPU DDP
python train_ddp.py

For now, prefer using single GPU (train.py).
To use cuda:1:

Learner(device="cuda:1")

Before DDP Training

pip uninstall rlgym_ppo
cd rlgym-ppo
pip install -e .

Then run:

python train_ddp.py

Reward Functions

Reward functions are located in the rewards directory.
To use a specific reward, import it in train.py or train_ddp.py.

Inference

Installing `rlviser` on macOS

# Install Rust from https://www.rust-lang.org/tools/install

# Clone and build
git clone https://github.com/VirxEC/rlviser.git
cd rlviser
git checkout v0.7.17
rustup install nightly
cargo +nightly build --release -Z build-std=std --target aarch64-apple-darwin

# Copy executable
cp target/aarch64-apple-darwin/release/rlviser ../eval

Visualizing Matches with `rlviser`

Download the weights from the training cluster to your local machine. Change lines 73-74 in visual_bot_match.py to the policy weights checkpoint directory on your local machine.

cd eval
python visual_bot_match.py

Simulating Matches with `rlgymsim`

Used for simulating matches between two bots to compare performance of bots based on rewards cumulated and goals scored.

cd eval
python simulate_bot_match.py

Testing reward functions in `rlviser`

You can test the reward functions by playing manually as the blue agent.

Controls:

W, A, S, D: Movement
Space: Jump
Left Shift: Boost
Q, E: Roll
X: Handbrake

cd eval
python human_match_improved.py

Play against a bot

Challenge any of the bots by loading the agent weight and playing.

cd eval
python human_vs_bot.py

TODO

Add metrics to simulate_bot_match.py for publishing to Weights & Biases (wandb)
(Srivats) Send updated Learner script for checkpoint save/load with wandb only
Brainstorm and implement new reward functions to improve performance
Train using your custom reward
Visualize or simulate your model every 500 steps on wandb; adjust reward weights based on insights

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RocketLeagueGym-Rewards

Project Links

Setup

Training

Environment Variable

Training on Different Systems

Run Training

Before DDP Training

Reward Functions

Inference

Installing `rlviser` on macOS

Visualizing Matches with `rlviser`

Simulating Matches with `rlgymsim`

Testing reward functions in `rlviser`

Play against a bot

TODO

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
eval		eval
rewards		rewards
rlgym-ppo		rlgym-ppo
train		train
README.md		README.md
requirements.txt		requirements.txt

Aiden-Frost/RocketLeagueGym-Rewards

Folders and files

Latest commit

History

Repository files navigation

RocketLeagueGym-Rewards

Project Links

Setup

Training

Environment Variable

Training on Different Systems

Run Training

Before DDP Training

Reward Functions

Inference

Installing rlviser on macOS

Visualizing Matches with rlviser

Simulating Matches with rlgymsim

Testing reward functions in rlviser

Play against a bot

TODO

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Installing `rlviser` on macOS

Visualizing Matches with `rlviser`

Simulating Matches with `rlgymsim`

Testing reward functions in `rlviser`

Packages