Zap Q-learning with Nonlinear Function Approximation

This code is an implementation of our paper: "Zap Q-learning with Nonlinear Function Approximation. S. Chen, A. Devraj, F. Lu, A. Busic and S. P. Meyn."

It uses neural network to approximate the optimal Q-function and applies our Zap Q-learning algorithm to solve the Cartpole problem. Adaptations of the code to solve other examples in OpenAI gym are straightforward.

Requirements

To install requirements:

pip install -r requirements.txt

📋 Our code is based on Python 2.7, Pytorch 1.4, numpy, OpenAI gym and etc.

Training

To train the model(s) in the paper, run this command:

python zapNN.py

📋 You can modifiy the network structure inside the code. It also provides two types of step-size schedules: decreasing step-sizes and constant step-sizes. Detailed definitions can be found in the paper.

Evaluation

To reproduce the plots in our paper, simply run the following command following Training:

python eval_plot.py

Contributing

📋 All content is licensed under the MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
NRF.pdf		NRF.pdf
NRF.png		NRF.png
README.md		README.md
config.py		config.py
eval_plot.py		eval_plot.py
requirements.txt		requirements.txt
zapNN.py		zapNN.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Zap Q-learning with Nonlinear Function Approximation

Requirements

Training

Evaluation

Contributing

About

Uh oh!

Releases

Packages

Languages

License

shuhangchen/ZapQ-NN

Folders and files

Latest commit

History

Repository files navigation

Zap Q-learning with Nonlinear Function Approximation

Requirements

Training

Evaluation

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages