Experimental Evaluation of the Generalizability of the Soft Actor-Critic + Behavioral Cloning Algorithm
This repository is part of the Research Project CSE3000 in 2024 @ TU Delft. Other projects can be found here.
The research questions were:
- Can SAC combined with BC effectively generalize to new tasks within a multi-task reinforcement learning environment?
- What characteristics of the offline dataset are critical for the success or failure of SAC+BC in such settings?
pip install requirements -r requirements.txt
Follow Instructions in the main()
Function in results.py
The environment used is the MiniGrid Four-Room which contain 3 configurations, of which one is the training configuration the other are testing configuration for reachable and unreachable tasks. Every configuration will contain 40 tasks.
The datasets have been created with an Optimal, Suboptimal and Mixed policy.