We apply MAPPO in Overcooked environment using Gym API. Each strategy we found takes 30~40 steps to deliver a soup in each layout.
Run 'main.py' and the results are rendered in console. Set the 'layout_opt' to be 1~5 at the beginning of 'main.py', which correspond to the 5 layouts we've trained.
- cramped room
- asymmetric advantages
- coordination ring
- forced coordination
- counter circuit
Featurization coding and shaped reward are given in 'overcooked-mappo/overcooked_ai_py/mdp/overcooked_mdp.py'.