You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
## Reproducing Advantage Actor Critic (A2C) Algorithm Results
2
+
3
+
This repository contains scripts that enable training agents using the Advantage Actor Critic (A2C) Algorithm on MuJoCo and Atari environments. We follow the original paper [Asynchronous Methods for Deep Reinforcement Learning](https://arxiv.org/abs/1602.01783) by Mnih et al. (2016) to implement the A2C algorithm but fix the number of steps during the collection phase.
4
+
5
+
6
+
## Examples Structure
7
+
8
+
Please note that each example is independent of each other for the sake of simplicity. Each example contains the following files:
9
+
10
+
1.**Main Script:** The definition of algorithm components and the training loop can be found in the main script (e.g. a2c_atari.py).
11
+
12
+
2.**Utils File:** A utility file is provided to contain various helper functions, generally to create the environment and the models (e.g. utils_atari.py).
13
+
14
+
3.**Configuration File:** This file includes default hyperparameters specified in the original paper. Users can modify these hyperparameters to customize their experiments (e.g. config_atari.yaml).
15
+
16
+
17
+
## Running the Examples
18
+
19
+
You can execute the A2C algorithm on Atari environments by running the following command:
20
+
21
+
```bash
22
+
python a2c_atari.py
23
+
```
24
+
25
+
You can execute the A2C algorithm on MuJoCo environments by running the following command:
0 commit comments