open-ai-gym-taxi-v2

A summary which briefly touches upon Q-learning

Installation

Training the agent

Requires Python 3

$ pip install -r requirements.txt
$ python main.py

Testing the agent

The training generates a csv with Q-table info of the form alpha_{alpha_value}_gamma_{gamma_value}_score_{score}__{timestamp}.csv One is added in the examples folder of this repository.

$ python test.py <path-to-qtable.csv>

The taxi agent generally gets trained with a best average reward of ~9 in 50k episodes.
The taxi agent gathers upto 8.5+ average rewards in a test of 100 episodes.

Problem Statement

1. Environment description

+---------+
|R: | : :G|
| : : : : |
| : : : : |
| | : | : |
|Y| : |B: |
+---------+

The map is a 5x5 gridworld.
The alphabets R, G, B, Y are 4 locations.
A passenger can be at any of the 4 locations.
A passenger's destination can be any of the left 3 locations.
The pipe symbol | denotes a wall.
The colon symbol : denotes a pass.
The taxi can pass through : but not |
The environment rewards 20 points when a passenger is dropped to their destination.
The environment penalizes -10 points if pickup operation is performed on a cell where there is no passenger.
The environment penalizes -10 points if drop operation is performed if no passenger had boarded the taxi.
The environment penalizes -1 for every other action.

2. Initial conditions

At the start, the taxi will be at any of the 25 positions on the map (from Environment description[1]).
A passenger will be at any of R, G, B, Y locations.
A destination will be at any of the R, G, B, Y locations.

3. Expected behaviour

The taxi must find the passenger traveling the shortest path.
The taxi must pickup the passenger.
The taxi must find the shortest path to the passenger's destination.
Drop the passenger at their destination traversing the shortest path.

Qualify

OpenAI Gym defines "solving" taxi-v1 task as getting average return of 9.7 over 100 consecutive trials.

Name		Name	Last commit message	Last commit date
Latest commit History 95 Commits
examples		examples
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
main.py		main.py
monitor.py		monitor.py
q_table.py		q_table.py
requirements.txt		requirements.txt
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

open-ai-gym-taxi-v2

Installation

Training the agent

Testing the agent

Problem Statement

1. Environment description

2. Initial conditions

3. Expected behaviour

Qualify

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

ltbringer/open-ai-gym-taxi-v2

Folders and files

Latest commit

History

Repository files navigation

open-ai-gym-taxi-v2

Installation

Training the agent

Testing the agent

Problem Statement

1. Environment description

2. Initial conditions

3. Expected behaviour

Qualify

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages