Reinforcement Learning and Loss of Plasticity Phenomenon in Coverage Path Planning Environment

Introduction

This repository implements a single-agent Coverage Path Planning (CPP) environment in OpenAI Gym, designed to investigate the phenomenon of loss of plasticity in deep reinforcement learning. In CPP, an agent must systematically cover every accessible tile in a map without unnecessary overlap. We compare popular RL algorithms (DQN, PPO) under a curriculum-learning protocol, and benchmark their performance against an A* planner as a gold-standard baseline.

Repository Structure

projeto-intermediario-p-j-cpp-main/
├── assets/                  # Sample figures and plots
├── coverage_env.py          # Gym environment for CPP tasks
├── pygame_renderer.py       # Pygame-based visualization of the agent
├── logs/                    # TensorBoard logs for training runs
├── requirements.txt         # Python dependencies
├── a_star_test.ipynb        # Notebook: A* baseline evaluation
├── dqn_mlp_test.ipynb       # Notebook: Training and evaluation of DQN agent
├── ppo_mlp_test.ipynb       # Notebook: Training and evaluation of PPO agent
├── ppo_mlp_l2_test.ipynb    # Notebook: PPO agent with L2 regularization
├── model_per_level.ipynb    # Notebook: Curriculum-learning across levels
├── conclusion.ipynb         # Report on results and analysis
└── README.md                # Project overview and usage instructions

Prerequisites

Python 3.8 or higher
Git (to clone this repository)

Installation

Clone the repository:

git clone https://github.com/insper-classroom/projeto-intermediario-p-j-cpp.git
cd projeto-intermediario-p-j-cpp-main

Create and activate a virtual environment:

python3 -m venv venv
source venv/bin/activate  # on Windows: venv\\Scripts\\activate

Install dependencies:
```
pip install -r requirements.txt
```

Usage

1. Running Notebooks

All experiments and visualizations are organized as Jupyter notebooks. Open the notebook of interest:

A Baseline*: a_star_test.ipynb
DQN Agent: dqn_mlp_test.ipynb
PPO Agent: ppo_mlp_test.ipynb
PPO with L2: ppo_mlp_l2_test.ipynb
Individual Level Learning (no curriculum): model_per_level.ipynb
Results Summary: conclusion.ipynb

Each notebook contains step-by-step code cells to train agents, log metrics, and plot performance.

2. TensorBoard Visualization

Training logs are saved under the logs/ directory. To inspect learning curves:

tensorboard --logdir logs

Navigate to http://localhost:6006 in your browser.

3. Custom Experiments

You can adapt hyperparameters or extend the environment by editing:

coverage_env.py: modify the grid layouts, reward function or action space.
pygame_renderer.py: adjust visualization settings.

Import CoverageEnv in your own scripts or notebooks:

from coverage_env import CoverageEnv

4. Analysis of our Work

The analysis of our work is presented in the conclusion.ipynb notebook. It includes:

A summary of the results obtained from the experiments.
A discussion of the implications of the findings.
Suggestions for future work and improvements.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Reinforcement Learning and Loss of Plasticity Phenomenon in Coverage Path Planning Environment

Introduction

Repository Structure

Prerequisites

Installation

Usage

1. Running Notebooks

2. TensorBoard Visualization

3. Custom Experiments

4. Analysis of our Work

5. Demo Video

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
assets		assets
levels		levels
logs		logs
models		models
.gitignore		.gitignore
README.md		README.md
a_star_test.ipynb		a_star_test.ipynb
comparing_models.ipynb		comparing_models.ipynb
conclusion.ipynb		conclusion.ipynb
coverage_env.py		coverage_env.py
dqn_mlp_test.ipynb		dqn_mlp_test.ipynb
model_per_level.ipynb		model_per_level.ipynb
ppo_mlp_l2_test.ipynb		ppo_mlp_l2_test.ipynb
ppo_mlp_test.ipynb		ppo_mlp_test.ipynb
pygame_renderer.py		pygame_renderer.py
requirements.txt		requirements.txt

insper-classroom/projeto-intermediario-p-j-cpp

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning and Loss of Plasticity Phenomenon in Coverage Path Planning Environment

Introduction

Repository Structure

Prerequisites

Installation

Usage

1. Running Notebooks

2. TensorBoard Visualization

3. Custom Experiments

4. Analysis of our Work

5. Demo Video

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages