Reinforcement Learning Playground

Goal and Purpose

This repository is a collection of Reinforcement Learning (RL) experiments designed for educational purposes. It aims to provide practical examples of how popular RL algorithms perform in different environments. This project is ideal for anyone looking to:

Learn about fundamental RL concepts.
Experiment with different RL algorithms.
Test RL models in various environments.

The primary goal is to offer a clear and accessible introduction to RL, showcasing how different algorithms can be applied to solve various control problems. It's a starting point for understanding the core mechanics of RL and exploring its potential.

What's Inside

This project includes implementations of the following RL algorithms and environments:

CartPole-v1: Solved using Proximal Policy Optimization (PPO). PPO is a policy gradient algorithm that optimizes the policy directly.
MountainCar-v0: Solved using Deep Q-Network (DQN). DQN is a value-based algorithm that learns to estimate the optimal action-value function.
MountainCarContinuous-v0: Solved using Soft Actor-Critic (SAC). SAC is an off-policy actor-critic algorithm that aims to maximize both reward and entropy.
Acrobot-v1: Solved using Deep Q-Network (DQN). This demonstrates the application of DQN to a more complex control problem.
LunarLander-v3: Solved using Deep Q-Network (DQN). This example shows how DQN can be applied to a classic control problem with continuous state space and discrete actions.
MiniHack-Room-5x5-v0: Solved using Proximal Policy Optimization (PPO). This environment is a grid-based game that requires navigation and decision-making skills.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
acrobot-scripts		acrobot-scripts
cartpole-scripts		cartpole-scripts
lunarlander-scripts		lunarlander-scripts
minihack-scripts		minihack-scripts
mountaincar-continuous-scripts		mountaincar-continuous-scripts
mountaincar-scripts		mountaincar-scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
usage.md		usage.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reinforcement Learning Playground

Goal and Purpose

What's Inside

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

devarashs/implementation-for-stable-baselines3-gym

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Playground

Goal and Purpose

What's Inside

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages