Mountain Car with SARSA Function Approximation

This repository contains two projects:

Using SARSA with linear function approximation to solve the Mountain Car problem.
Using Actor Critic to solve the Continuous Mountain Car problem.

Mountain car is one of the most popular reinforcement learning test environemts. The agent must learn use the momentum gained by rolling down the hills to reach the goal. It has a continous state space with a discrete set of actions (left, right, and do nothing). I used a linear combination of a feature vector and set of weights to approximate the state-action value function Q. The state samples were transformed into a higher dimensional space using an approximation of an RBF kernel allowing for a non-linear value function.

Simply run mountaincar.py and comment out the env.render() as necessary to toggle visualization. I have also included optional methods for gradient checking and plots of action choices, value function, and rewards.

It would take little effort to turn this into a Q-Learning solution.

For the continuous environment, the agent tends to converge to the local optima of choosing not to move. This is beacuse the agent is given negative rewards for each action taking and a reward of 0 is better than a reward of -99. The agent only works when it discovers the optimal policy of reaching the flag in the first episode. I will have to come back to this project when I have a better exploration strategy.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
AC-Mountain-Car-Continuous-V0.py		AC-Mountain-Car-Continuous-V0.py
LICENSE		LICENSE
README.md		README.md
mountain_car.py		mountain_car.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mountain Car with SARSA Function Approximation

About

Uh oh!

Releases

Packages

Languages

License

SamKirkiles/mountain-car-SARSA-AC

Folders and files

Latest commit

History

Repository files navigation

Mountain Car with SARSA Function Approximation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages