Drone Semantic Segmentation

Deep Learning project for binary and multiclass semantic segmentation on drone imagery.

Overview

This repository provides a complete pipeline for semantic segmentation on drone-acquired datasets using state-of-the-art models like UNet, SegFormer, and UFormer.
It supports training, evaluation, prediction visualization, and distributed training (for Gricad cluster).

Project Structure

semantic-segmentation-drone-data/
├── doc/
│   └── slides.pdf  
├── src/
│   ├── python/
│   │   ├── droneDataset.py     # Dataset and preprocessing logic
│   │   ├── metrics.py          # Metrics: PA, MPA, IoU, mIoU
│   │   ├── model.py            # Model definitions (UNet, SegFormer, UFormer)
│   │   ├── trainer.py          # Training, validation, and testing logic
│   │   ├── vizualization.py    # Visualization utilities
│   ├── get_curves.py           # Plot training curves from CSV logs
│   ├── main.py                 # Train/validate/test a model
│   └── predict.py              # Generate predictions from a trained model
├── outputs/
│   ├── MultiUnet/
│   │   └── predictions.zip
│   └── SegFormer/
│       └── predictions.zip
├── config.yaml                 # Main configuration file
└── README.md                   # Project documentation

Getting Started

1. Clone the Repository

git clone https://github.com/your-username/semantic-segmentation-drone-data.git
cd semantic-segmentation-drone-data

2. Install Dependencies

Make sure you’re using Python ≥3.8 and a virtual environment:

python -m venv venv
source venv/bin/activate  # or venv\Scripts\activate on Windows
pip install -r requirements.txt

Configuration

Update the config.yaml file to modify:

Dataset paths
Model architecture (UNet, SegFormer, etc.)
Training hyperparameters
Output paths
Distributed training settings

Usage

Train a model

python ./src/main.py

Make sure distributed::active is disabled in config.yaml if running locally.

Make predictions

python ./src/predict.py

Update the model checkpoint path in config.yaml.

Plot learning curves

python ./src/get_curves.py

Distributed Training on Gricad

Setup

Enable distributed training:
```
distributed:
  active: 1
```

Create a .h script with the following:

export CUDA_VISIBLE_DEVICES=0,1,2,3
torchrun --nproc_per_node=4 src/main.py

Use localhost as master node (Gricad allocates it). If port conflicts occur, change it manually.

Requirements

All dependencies are listed in requirements.txt.

Key packages include:

PyTorch
torchvision
segmentation_models_pytorch
transformers
matplotlib / plotly
PyYAML / pandas / PIL

Notes

Predictions and curves must be run with distributed mode off.
Some models require custom install steps depending on your PyTorch version (e.g. for HuggingFace SegFormer).
Set TF_ENABLE_ONEDNN_OPTS=0 for compatibility when using CPU backends.

Results

Below is a sample prediction result from the MultiUNet model on the drone dataset:

The UNet model successfully segments large and clearly defined classes such as moving objects and landable areas. However, it shows limitations with smaller or less contrasted elements, sometimes misclassifying obstacles or blending class boundaries. This result highlights UNet's solid performance in general structure recognition, but also its relative weakness in fine-grained or context-dependent segmentation tasks.

A sample prediction result from the SegFormer model

This SegFormer prediction shows robust performance on large and structured regions, especially the water body, which is segmented with high precision. The model also correctly identifies many surrounding obstacles and patches of nature, even in complex, cluttered urban scenery. While some minor confusion persists between moving and obstacle classes in dense zones, the overall segmentation is consistent and well-aligned with the ground truth. This reflects SegFormer's strong ability to model long-range dependencies and handle heterogeneous scenes with fine details.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
docs		docs
outputs		outputs
src		src
README.md		README.md
config.yaml		config.yaml
requierements.txt		requierements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Drone Semantic Segmentation

Overview

Project Structure

Getting Started

1. Clone the Repository

2. Install Dependencies

Configuration

Usage

Train a model

Make predictions

Plot learning curves

Distributed Training on Gricad

Setup

Requirements

Notes

Results

About

Uh oh!

Releases

Packages

Languages

Louiscrrn/Semantic-Segmentation-DroneDataset

Folders and files

Latest commit

History

Repository files navigation

Drone Semantic Segmentation

Overview

Project Structure

Getting Started

1. Clone the Repository

2. Install Dependencies

Configuration

Usage

Train a model

Make predictions

Plot learning curves

Distributed Training on Gricad

Setup

Requirements

Notes

Results

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages