Transfer-Learning-in-RL

I have done this project under the guidance of Srijita Das and Prof.Matthew Taylor in the Intelligent Robot Learning Laboratory(https://irll.ca/team/) of University of Alberta. I work in the areas of improving RL through prior knowlegdge and transfer learning in RL. A major bottleneck of RL algorithms is their sample inefficiency. To make RL algorithms really deployable in practical settings we need to make sample efficient RL algorithms that generalise well in the real world.

Setup and Installation

Run bash setup.sh to create the directories necessary to store experiment results
Install pip dependencies with pip install -r req.txt for running experiments on local computer. If u prefer to run experiments on colab this step is not needed.

Experiments Done

All source code is made available in Jupyter Notebooks, so that plots may be visualized.

DQN CartPole
DQN LunarLander
DDPG ContinuousCartPole
DDPG LunarLanderContinuous
REINFORCE CartPole
REINFORCE LunarLander

I tried to cover all the three types of RL algorithms- State Value based(DQN), Policy Gradient based(REINFORCE) and Actor-Critic based(DDPG) to show that transfer learning works in all these three algorithmic settings. To assure environment invarience I have tried all the three algorithms on two standard benchmark environments namely OpenAI Gym CartPole and OpenAI Gym LunarLander. In these experiments the transfer has been between identical MDPs thus making the process of transfer straightforward. Although I have tried one experiment where I transfered the policy between CartPole env to LunarLander env, both of which have different MDPs governing their dynamics.

More work on transfer between non-identical MDPs comming soon...

For the saved model checkpoints please go to the drive link: https://drive.google.com/drive/folders/1bYG2X95UOsF8mbZyJ-0fVLhd01FtIZLz?usp=sharing

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Plots		Plots
README.md		README.md
ddpg(cart_env).ipynb		ddpg(cart_env).ipynb
ddpg(lander_env).ipynb		ddpg(lander_env).ipynb
dqn(cart_env).ipynb		dqn(cart_env).ipynb
dqn(lander_env).ipynb		dqn(lander_env).ipynb
reinforce(cart+lander_env).ipynb		reinforce(cart+lander_env).ipynb
req.txt		req.txt
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Transfer-Learning-in-RL

Setup and Installation

Experiments Done

About

Uh oh!

Releases

Packages

Languages

abhranilchandra/Transfer-Learning-in-RL

Folders and files

Latest commit

History

Repository files navigation

Transfer-Learning-in-RL

Setup and Installation

Experiments Done

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages