PPO-Panda

UPDATE [January 2022] :

Modified this commit created by Nikhil Barhate to accomodate the Franka Emika Panda Robot.
Installed Panda-Gym
Adapted the classes to account for the difference in return after the step

Open `PPO_colab.ipynb` in Google Colab to see original PPO implementation for roboschool

Introduction

See this link to review the details (learning rates, episode logging, utils, etc) of this implementation of PPO.
New modifications include to the reward figure, the rendering, etc
No changes in PPO

Usage

To train a new network : run train.py
To test a preTrained network : run test.py
To plot graphs using log files : run plot_graph.py
To save images for gif and make gif using a preTrained network : run make_gif.py
All parameters and hyperparamters to control training / testing / graphs / gifs are in their respective .py file
PPO_colab.ipynb combines all the files in a jupyter-notebook
All the hyperparameters used for training (preTrained) policies are listed in the README.md in PPO_preTrained directory

Note :

if the environment runs on CPU, use CPU as device for faster training.

Citing

Please use this bibtex if you want to cite this repository in your publications :

@misc{ppo_panda,
    author = {Lobbezoo, Andrew},
    title = {PyTorch Implementation of Proximal Policy Optimization for the OpenAI Panda},
    year = {2022},
    publisher = {GitHub},
    journal = {GitHub repository},
    howpublished = {\url{https://github.com/alobbezoo/PPO-Panda}},
}

Results

PPO Continuous PandaReachDense-v2	PPO Continuous PandaReachDense-v2

Dependencies

Trained and Tested on:

Python 3
PyTorch
NumPy
gym

Training Environments

gym

Graphs and gifs

pandas
matplotlib
Pillow
pyvirtualdisplay
python-opengl

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
.idea		.idea
PPO_figs/PandaReachDense-v2		PPO_figs/PandaReachDense-v2
PPO_gif_images/PandaReachDense-v2		PPO_gif_images/PandaReachDense-v2
PPO_gifs/PandaReachDense-v2		PPO_gifs/PandaReachDense-v2
PPO_logs/PandaReachDense-v2		PPO_logs/PandaReachDense-v2
PPO_preTrained		PPO_preTrained
__pycache__		__pycache__
.gitattributes		.gitattributes
LICENSE		LICENSE
PPO.py		PPO.py
PPO_colab.ipynb		PPO_colab.ipynb
README.md		README.md
make_gif.py		make_gif.py
plot_graph.py		plot_graph.py
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PPO-Panda

UPDATE [January 2022] :

Open `PPO_colab.ipynb` in Google Colab to see original PPO implementation for roboschool

Introduction

Usage

Note :

Citing

Results

Dependencies

References

About

Uh oh!

Releases

Packages

Languages

License

alobbezoo/PPO-Panda

Folders and files

Latest commit

History

Repository files navigation

PPO-Panda

UPDATE [January 2022] :

Open PPO_colab.ipynb in Google Colab to see original PPO implementation for roboschool

Introduction

Usage

Note :

Citing

Results

Dependencies

References

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Open `PPO_colab.ipynb` in Google Colab to see original PPO implementation for roboschool

Packages