MuZero for Jass

MuZero is a model based reinforcement learning method using deep learning and therefore requires training. The data is generated either through self-play or reanalysing existing data (tfrecord format as described in jass-ml-py repo.

Setup

Install the package

$ pip install -v -e .

Run tests to verify local setup

$ sjmz (--nodocker) --test

And finally start the container hosting the baselines with

$ sjmz --baselines

Training

The MuZero training process is implemented in a distributed manner. The trainer is the master which will gather all the data, train the networks and evaluate them asynchronously on different metrics. In the folder resources/data_collectors there are different compose files for different machines to host data collectors. They all assume that the master container is running on the ws03 machine (IP: 10.180.39.13). If this would not be the case, the IP in the files must be adapted. To start the training process first run

$ sjmz --attach train --file experiments/experiment-1.json

and wait until the flask server started hosting. Then start the data collectors on the respective machines

$ sjmz collect --machine (gpu03|e01|...)

available configurations are stored at resources/data_collectors. The collectors should then register them on the master container and start to collect data. Once the replay buffer has been filled, the optimization procedure will start and the corresponding metrics will be logged to wandb.ai at the configured location.

Evaluate

To evaluate run

$ sjmz eval --files experiment-1/dmcts.json experiment-1/policy-B.json

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
extern		extern
jass_mu_zero		jass_mu_zero
resources		resources
test		test
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
entrypoint.sh		entrypoint.sh
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MuZero for Jass

Setup

Training

Evaluate

About

Uh oh!

Releases

Packages

Languages

andrinbuerli/jass-mu-zero

Folders and files

Latest commit

History

Repository files navigation

MuZero for Jass

Setup

Training

Evaluate

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages