Contextual DMControl Benchmark

Benchmark for generalization in continuous control from pixels, based on DMControl Generalization Benchmark. It is adapted to allow for direct control of the contexts by supplying at the start of an experiment a training set and testing set of colours, video backgrounds and initial physics engine states.

Contexts

The DMControl Generalization Benchmark provides full control for creating benchmarks for visual generalization to random colors, video backgrounds and initial states.

Example colors and video backgrounds can be found in cdmc/env/data/ and cdmc/generate_contexts.py shows an example of how to create training and testing sets.

Using an empty context set (in empty.json) will run on the default colors and background and sample initial physics states from the full distribution (default DMC behaviour).

Algorithms

This repository contains implementations of the following algorithms in a unified framework:

using standardized architectures and hyper-parameters, wherever applicable.

Setup

First install mujoco dependency:

Download old version of mujoco and paste it in your home .mujoco folder
Download free mujoco license and paste it in your home .mujoco folder

Then, we assume that you have access to a GPU with CUDA >=9.2 support. All dependencies can then be installed with the following commands:

conda env create -f setup/conda.yaml
conda activate dmcgb
sh setup/install_envs.sh

Datasets

Part of this repository relies on external datasets. SODA uses the Places dataset for data augmentation, which can be downloaded by running

wget http://data.csail.mit.edu/places/places365/places365standard_easyformat.tar

The video_easy data was proposed in PAD, and the video_hard data uses a subset of the RealEstate10K dataset for background rendering. All test environments (including video files) are included in this repository, namely in the cdmc/env/ directory.

Training & Evaluation

The scripts directory contains training and evaluation bash scripts for all the included algorithms. Alternatively, you can call the python scripts directly, e.g. for training call

python3 cdmc/train.py \
  --algorithm sac \
  --seed 0 \
  --train_context_file empty.json \
	--test_context_file empty.json

to run SAC on the default task, walker_walk.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
cdmc		cdmc
figures		figures
scripts		scripts
setup		setup
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
empty.json		empty.json
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Contextual DMControl Benchmark

Contexts

Algorithms

Setup

Datasets

Training & Evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

MWeltevrede/contextual-dmcontrol

Folders and files

Latest commit

History

Repository files navigation

Contextual DMControl Benchmark

Contexts

Algorithms

Setup

Datasets

Training & Evaluation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages