Augmented-Random-Search

Find the paper here: Simple random search provides a competitive approach to reinforcement learning

‘Simple random search provides a competitive approach to reinforcement learning’, by Horia Mania and Aurelia Guy, University of California, Berkeley in March 20, 2018.
Two different directions proposed for simplifying RL:
- Evolution Strategies : derivative-free policy optimization method, to train policies faster
- Training linear policies via natural policy gradients to solve the continuous control problems
ARS combines best of both to obtain the simplest model-free RL method yet, a derivative-free optimization algorithm for training linear policies.
Simple random search method can match or exceed state-of-the-art sample efficiency on MuJoCo locomotion benchmarks.
15 times more computationally efficient than Evolution Strategies, the fastest competing method, which employs several complicated algorithmic elements.
Method learns static, linear policies that achieve high rewards on all MuJoCo tasks. No neural networks are used, and yet state-of-the-art performance is still uniformly achieved.
Though ARS successfully trains policies for locomotion tasks when hyper-parameters and random seeds are varied, it exhibits a large variance, and learned policies do not uniformly yield high rewards.

Detailed explanation with presentation can be found in the file 'Augmented Random Search(1).ppt'.
The code is intricately elaborated with proper comments for easy understanding in the file 'ARS- Half Cheetah.ipynb'.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ARS- Half Cheetah.ipynb		ARS- Half Cheetah.ipynb
Augmented Random Search (1).pptx		Augmented Random Search (1).pptx
LICENSE		LICENSE
README.md		README.md
results.mp4		results.mp4

Provide feedback