Udacity Continuous Control Project 2

Introduction

The goal of this project is to train an agent to control a robotic arm, such that it stays close to a floating ball.

A reward of +0.1 is given for each time step that the arm is in the region of a floating ball. The environment is considered to be "solved" when the agent receives a reward of > 30.0 for 100 consecutive episodes of 1000 steps.

Environment Details

The environment is provided by Unity, a company that specializes in building worlds that can be used for video game development, simulation, animation, and architecture/design. The following is the description of the state space and actions available to the agent:

The observation space consists of 33 variables corresponding to position, rotation, velocity, and angular velocities of the arm. Each action is a vector with four numbers, corresponding to torque applicable to two joints. Every entry in the action vector should be a number between -1 and 1.

Dependencies

Download the x64 windows environment for the single agent, from here.
The code expects the Reacher.exe file to be located in the following directory of the repo "./Reacher_Windows_x86_64_SingleAgent/Reacher.exe"
Create a new conda environment with the provided requirements.txt file. Ex. conda create --name --file requirements.txt

Using the Code

The code may be run using the command python continuous_control.py <config.json file> [network_file.pth]. The network file is an optional parameter that will first load a previous file.

Train/Run Mode

Example python navigator.py config.json

The arguments to the program are provided using a .json file. See the utilities/config.py file for the default parameters. Setting train_mode: true will train the agent. Setting train_mode: false will simply run the agent for a single episode without training the networks.

{
    "env_name": "reacher",
    "train_mode": true,
    "device": "cuda",
    "actor_learning_rate": 1e-4,
    "critic_learning_rate": 1e-4,
    "batch_size": 8,
    "gamma": 0.99,
    "tau": 1e-3
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Reacher_Windows_x86_64_SingleAgent		Reacher_Windows_x86_64_SingleAgent
agent		agent
environment		environment
images		images
network		network
utilities		utilities
README.md		README.md
Report.html		Report.html
Report.ipynb		Report.ipynb
checkpoint.pth		checkpoint.pth
config.json		config.json
continuous_control.py		continuous_control.py
requirements.txt		requirements.txt
unity-environment.log		unity-environment.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Udacity Continuous Control Project 2

Introduction

Environment Details

Dependencies

Using the Code

Train/Run Mode

About

Uh oh!

Releases

Packages

Uh oh!

Languages

fundmntlTheorem/udacity_p2_continuous_control

Folders and files

Latest commit

History

Repository files navigation

Udacity Continuous Control Project 2

Introduction

Environment Details

Dependencies

Using the Code

Train/Run Mode

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages