SegClarity

SegClarity is a comprehensive framework for semantic segmentation with explainable AI capabilities, supporting both document segmentation and urban scene understanding tasks.

Overview

This project provides:

Document Segmentation: Models trained on UTP and splitAB1 datasets for document layout analysis
Urban Scene Segmentation: Models trained on Cityscapes dataset for street scene understanding
Explainable AI: Attribution methods for understanding model decisions
Visualization Tools: Comprehensive visualization of predictions and attributions

Project Structure

SegClarity/
├── Modules/                    # Core framework modules
│   ├── Architecture/          # Model architectures (UNet, LUNet)
│   ├── Dataset/              # Dataset handling utilities
│   ├── CityscapeDataset/     # Cityscapes-specific dataset tools
│   ├── ModelXAI/            # Explainable AI methods
│   ├── Attribution/         # Attribution computation
│   ├── Visualization/       # Visualization utilities
│   └── ...
├── Notebooks/                # Jupyter notebooks for experiments
│   ├── 01_Model_predictions_on_documents.ipynb
│   ├── 02_Model_predictions_on_cityscapes.ipynb
│   ├── 03_Attributions_on_documents.ipynb
│   └── 04_Attributions_on_cityscapes.ipynb
├── models/                   # Pre-trained model weights
├── datasets/                 # Dataset storage
└── requirements.txt          # Python dependencies

Setup Instructions

1. Clone the Repository

git clone https://github.com/iheb-brini/SegClarity.git
cd SegClarity

2. Install Dependencies

Create a virtual environment and install the required packages:

# Create virtual environment
python -m venv .venv
source .venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

Note: The requirements include PyTorch with CUDA 12.6 support. If you don't have CUDA or need a different version, modify the PyTorch installation in requirements.txt.

Key Dependencies:

PyTorch & TorchVision: Deep learning framework with CUDA support
Captum: Model interpretability and attribution methods
Albumentations: Advanced image augmentation library (used for Cityscapes)
Scikit-image: Image processing utilities (used for Otsu thresholding, resizing)
OpenCV: Computer vision operations
Matplotlib: Visualization and plotting
PIL/Pillow: Image loading and processing
Pytest: Testing framework (for evaluation modules)

3. Download Pre-trained Models

Download the pre-trained model weights from the releases:

# Create models directory if it doesn't exist
mkdir -p models

# Download model weights from GitHub releases
# Visit: https://github.com/iheb-brini/SegClarity/releases/tag/model_weights
# Download the model weights archive and extract to the models/ folder

Expected model structure after download:

models/
├── cityscapes/
│   └── unet/
│       └── best_model.pth
├── splitAB1/
│   ├── lunet/
│   │   ├── finetuned_models_minloss/
│   │   └── from_scratch_models/
│   └── unet/
│       ├── finetuned_models_minloss/
│       └── from_scratch_models/
└── UTP/
    ├── lunet/
    │   └── from_scratch_models/
    └── unet/
        └── from_scratch_models/

4. Download Datasets

Document Datasets (UTP and splitAB1)

The document datasets (UTP and splitAB1) are already included in the repository under the datasets/ folder.

Cityscapes Dataset

Download the Cityscapes dataset for urban scene segmentation:

Register and Login: Visit Cityscapes Dataset
Download: Download the following packages:
- leftImg8bit_trainvaltest.zip (11GB) - Training, validation, and test images
- gtFine_trainvaltest.zip (241MB) - Fine annotations
Extract: Extract the downloaded files to datasets/cityscapes/

Expected Cityscapes structure:

datasets/cityscapes/
├── leftImg8bit/
│   ├── train/
│   ├── val/
│   └── test/
└── gtFine/
    ├── train/
    ├── val/
    └── test/

Running Experiments

Jupyter Notebooks

Install Jupyter notebook (if missing):

pip install jupyter notebook

Start Jupyter and run the experiment notebooks:

jupyter notebook

Available Notebooks:

01_Model_predictions_on_documents.ipynb
- Evaluates document segmentation models (LUNet, UNet)
- Works with UTP and splitAB1 datasets
- Visualizes predictions vs ground truth
02_Model_predictions_on_cityscapes.ipynb
- Evaluates urban scene segmentation models
- Works with Cityscapes dataset
- Provides semantic segmentation results
03_Attributions_on_documents.ipynb
- Computes and visualizes attributions for document models
- Uses various XAI methods (GradCAM, Integrated Gradients, etc.)
- Analyzes model decision-making on document layouts
04_Attributions_on_cityscapes.ipynb
- Computes and visualizes attributions for urban scene models
- Explains model predictions on street scenes
- Provides insights into what the model focuses on

Notebook Configuration

Each notebook allows you to configure:

Dataset type: Choose between available datasets
Model architecture: Select UNet or LUNet
Model variant: Choose from-scratch or fine-tuned models
Device: CPU or GPU (if available)

System Requirements

Python: >=3.10, <3.13
CUDA: 12.6 (optional, for GPU acceleration)
RAM: 8GB minimum, 16GB recommended
Storage: 15GB for datasets and models

Troubleshooting

Common Issues:

CUDA Out of Memory: Reduce batch size or use CPU
Missing Models: Ensure model weights are downloaded and placed in correct directories
Dataset Not Found: Verify dataset paths and structure
Import Errors: Check that all dependencies are installed correctly

Getting Help:

Check the notebook documentation for specific usage instructions
Verify file paths and directory structures match the expected layout
Ensure all dependencies are properly installed

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
Modules		Modules
Notebooks		Notebooks
datasets		datasets
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SegClarity

Overview

Project Structure

Setup Instructions

1. Clone the Repository

2. Install Dependencies

3. Download Pre-trained Models

4. Download Datasets

Document Datasets (UTP and splitAB1)

Cityscapes Dataset

Running Experiments

Jupyter Notebooks

Available Notebooks:

Notebook Configuration

System Requirements

Troubleshooting

Common Issues:

Getting Help:

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

License

iheb-brini/SegClarity

Folders and files

Latest commit

History

Repository files navigation

SegClarity

Overview

Project Structure

Setup Instructions

1. Clone the Repository

2. Install Dependencies

3. Download Pre-trained Models

4. Download Datasets

Document Datasets (UTP and splitAB1)

Cityscapes Dataset

Running Experiments

Jupyter Notebooks

Available Notebooks:

Notebook Configuration

System Requirements

Troubleshooting

Common Issues:

Getting Help:

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages