Skip to content
#

policy-improvement

Here are 10 public repositories matching this topic...

Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program.

  • Updated Apr 3, 2019
  • Jupyter Notebook

A web-based interactive Grid World environment for learning and visualizing reinforcement learning algorithms including policy evaluation, policy improvement, and value iteration. Built with Flask backend implementing RL algorithms and JavaScript frontend for grid visualization.

  • Updated Mar 13, 2025
  • JavaScript

Based on the book --- Reinforcement Learning: An Introduction (2nd ed, 2018) by Sutton and Barto. For the Reinforcement Learning course Assignment 2 (see Gridworld Problem 1.pdf) at Memorial University of Newfoundland, Jul. 18, 2024

  • Updated Aug 12, 2024
  • Jupyter Notebook

This repository contains two reinforcement learning projects: "Treasure Hunt in the Frozen Lake," which navigates a modified FrozenLake using dynamic programming, and "Optimizing Movie Recommendations," which employs Multi-Armed Bandits to enhance user satisfaction.

  • Updated Feb 27, 2025
  • Jupyter Notebook

A reinforcement learning agent trained using Q-Learning to solve OpenAI Gym’s FrozenLake environment. The project demonstrates value-based learning, policy improvement, and exploration strategies in a slippery gridworld setting.

  • Updated May 7, 2025
  • Python

Improve this page

Add a description, image, and links to the policy-improvement topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the policy-improvement topic, visit your repo's landing page and select "manage topics."

Learn more