Road Anomaly Detection

Our Road Anomaly Detection project. We've been working on using computer vision, specifically YOLOv8 models, to automatically spot issues like cracks and potholes on road surfaces. This repository contains the dataset details, the models we trained and used, evaluation results, and the demo applications we built.

Dataset

We put together a custom dataset specifically for training our main detection model. The whole process, from sourcing data to annotation, is documented if you're curious about the nitty-gritty details.

Dataset Creation Documentation: Read the full process here

The dataset structure within this repository looks like this:

dataset/
├── test/
│   ├── images/
│   └── labels/
├── train/
│   ├── images/
│   └── labels/
└── valid/
    ├── images/
    └── labels/

---
config:
  theme: neo
  themeVariables:
    fontSize: 13px
  layout: fixed
---
flowchart TD
 subgraph sg0["<span style=color:>1️⃣Initial Data Collection</span>"]
    direction LR
        DS1["RAD Dataset<br>(~8.4k img)"]
        DS2["Indian Roads<br>(~5.1k img)"]
        DS3["Humps/Bumps/Potholes<br>(~3.2k img)"]
        DS4["HighRPD Dataset<br>(~11.7k img)"]
  end
 subgraph sg1["2️⃣HighRPD Preprocessing"]
        PreprocHighRPD["Preprocess HighRPD<br>(XML-&gt;YOLO, Map Classes, Split 70/20/10)"]
        PreprocOutput["Preprocessed HighRPD<br>(Train/Valid/Test Splits)"]
  end
 subgraph sg2["3️⃣Label Standardization"]
        InitialCollect["Combined Other Datasets<br>(RAD, Indian, HBP)"]
        Standardize["Standardize All Labels<br>(Define 7 Unified Classes)"]
  end
 subgraph sg3["4️⃣Merging &amp; Initial Split"]
        Merge["Merge Datasets<br>(HighRPD Splits + Standardized Others)<br>Add Prefixes, Verify Pairs"]
        InitialMerged["Initial Merged Dataset<br>Train: 18,005 | Valid: 4,518 | Test: 2,846<br>(Total: 25,369 Images)"]
  end
 subgraph sg4["5️⃣Balancing via Augmentation (Train Set)"]
        AnalyzeImbalance["Analyze Train Set Imbalance<br>(Low: HV, Ped, SB)"]
        Augment["Augment Minority Classes<br>(Flips, Brightness, Rotations)"]
        AugmentedImages["Generated Augmented Images<br>(+5,316 Train: HV, Ped, SB)"]
  end
 subgraph sg5["6️⃣Class Weight Calculation"]
        CalcWeights["Calculate Class Weights<br>(Based on Final Train Dist.)"]
        WeightsOutput["Class Weights<br>(for data.yaml)"]
  end
 subgraph sg6["7️⃣Final Dataset"]
        FinalDataset["Final Unified &amp; Balanced Dataset<br>Total: 30,685 Images<br><b>Train: 23,321</b> (Orig+Aug)<br>Valid: 4,518 | Test: 2,846<br>(YOLOv8 Format + Weights)"]
  end
    DS4 --> PreprocHighRPD
    PreprocHighRPD --> PreprocOutput
    DS1 --> InitialCollect
    DS2 --> InitialCollect
    DS3 --> InitialCollect
    InitialCollect -- Data & Labels --> Standardize
    PreprocOutput -- HighRPD Data & Labels --> Standardize
    Standardize -- Standardized Data --> Merge
    PreprocOutput -- "Pre-split HighRPD Data" --> Merge
    Merge --> InitialMerged
    InitialMerged -- Train Split --> AnalyzeImbalance
    AnalyzeImbalance --> Augment
    Augment --> AugmentedImages
    Augment -- "Post-Augmentation Train Dist." --> CalcWeights
    CalcWeights --> WeightsOutput
    InitialMerged -- Original Train, Valid, Test Splits --> FinalDataset
    AugmentedImages -- Augmented Train Images --> FinalDataset
    WeightsOutput -- Class Weights --> FinalDataset
     DS1:::dataset
     DS2:::dataset
     DS3:::dataset
     DS4:::dataset
     PreprocHighRPD:::process
     PreprocOutput:::output
     InitialCollect:::output
     Standardize:::process
     Merge:::process
     InitialMerged:::output
     AnalyzeImbalance:::importantNote
     Augment:::process
     AugmentedImages:::output
     CalcWeights:::process
     WeightsOutput:::output
     FinalDataset:::output
    classDef dataset fill:#f9d,stroke:#333,stroke-width:1px
    classDef process fill:#cde,stroke:#333,stroke-width:1px
    classDef output fill:#dfd,stroke:#333,stroke-width:2px
    classDef importantNote fill:#ffc,stroke:#e7b400,stroke-width:1px,color:black

Model 1: Custom Trained YOLOv8m (`best.pt`)

This is the primary model we trained from scratch using our custom dataset.

Model Architecture: YOLOv8m
Training Epochs: 120
Training Time: Approx. 27.8 hours
Hardware: NVIDIA GeForce RTX 3060 Laptop GPU (6GB)
Best Weights File: RoadDetectionModel/RoadModel_yolov8m.pt_rounds120_b9/weights/best.pt (Size: 52.0 MB)
Repository: Based on this structure

Validation Performance (`best.pt` during training)

These metrics reflect the performance on the validation set using the best weights saved during the training process.

Class	Precision	Recall	mAP@.5	mAP@.5:.95
Overall	0.738	0.726	0.733	0.443
Heavy-Vehicle	0.921	0.976	0.979	0.764
Light-Vehicle	0.894	0.965	0.967	0.659
Pedestrian	0.838	0.903	0.910	0.494
Crack	0.553	0.430	0.454	0.219
Crack-Severe	0.526	0.467	0.471	0.265
Pothole	0.595	0.432	0.432	0.171
Speed-Bump	0.842	0.911	0.919	0.530

Validation results saved in: RoadDetectionModel/RoadModel_yolov8m.pt_rounds120_b9

Test Set Performance (`best.pt` - Final Evaluation)

We ran a final evaluation on a dedicated test set using the best.pt model.

Class	Precision	Recall	mAP@.5	mAP@.5:.95
Overall	0.736	0.740	0.745	0.448
Heavy-Vehicle	0.913	0.978	0.981	0.763
Light-Vehicle	0.892	0.951	0.961	0.649
Pedestrian	0.822	0.915	0.918	0.522
Crack	0.576	0.484	0.505	0.240
Crack-Severe	0.548	0.503	0.493	0.273
Pothole	0.597	0.440	0.468	0.198
Speed-Bump	0.804	0.908	0.885	0.487

Average Inference Speed: ~12.0 ms per image
Test results saved in: runs/detect/val3

Sample Processed Video: Watch a sample here

Overall Test Metrics Summary:

Precision: 0.736
Recall: 0.740
mAP@0.5: 0.745
mAP@0.5:0.95: 0.448
F1-Score: 0.738 (Calculated as 2 * (P * R) / (P + R))

Test Set Evaluation Visualizations (Model 1 - `best.pt`)

Here are some charts generated during the final test set evaluation:

(Images sourced from runs/detect/val3)

Model 2: Pre-trained YOLOv8s (`YOLOv8_Small_2nd_Model.pt`)

We also incorporated a second, pre-trained model for comparison and potential fusion.

Model File: YOLOv8_Small_2nd_Model.pt
Model Architecture: YOLOv8s
Source Repository: oracl4/RoadDamageDetection
Training Data: CRDDC2022 Dataset
Detected Classes: Longitudinal Crack, Transverse Crack, Alligator Crack, Potholes

Demo Applications

We've built a couple of interfaces to showcase the models in action.

Streamlit Web App

This is our main demo app, allowing you to test the models easily.

Functionality: Detect anomalies in uploaded images, videos, or a live camera feed ("Dash Cam").

---
config:
  layout: fixed
  theme: mc
  look: neo
---
flowchart TD
    U["User"] --> SB["Streamlit Sidebar"]
    SB --> MS["Model Selection"] & CT["Confidence Thresholds"] & IS["Input Source"]
    IS --> FileInput["Uploaded File (Image)/(Video)"] & CamInput["Camera Feed"]
    FileInput --> Frame["Input Frame/Image"]
    CamInput --> Frame
    Frame --> PROC["Processing Engine"]
    MS --> PROC
    CT --> PROC
    PROC --> YOLO["YOLOv8 Inference"] & AnnotatedFrame["Annotated Frame/Image"]
    YOLO --> PROC
    AnnotatedFrame --> MA["Streamlit Main Area"]
    MA --> U
     U:::user
     SB:::ui
     MS:::config
     CT:::config
     IS:::config
     FileInput:::data
     CamInput:::data
     Frame:::data
     PROC:::process
     YOLO:::model
     AnnotatedFrame:::data
     AnnotatedFrame:::Class_01
     MA:::ui
    classDef user fill:#f9d,stroke:#333,stroke-width:2px
    classDef ui fill:#add,stroke:#333,stroke-width:2px
    classDef config fill:#ffeb99,stroke:#333,stroke-width:1px
    classDef data fill:#cceeff,stroke:#333,stroke-width:1px
    classDef process fill:#ccffcc,stroke:#333,stroke-width:2px
    classDef model fill:#ffcc99,stroke:#333,stroke-width:2px
    classDef Class_01 stroke-width:4px, stroke-dasharray: 0, stroke:#D50000
    style AnnotatedFrame color:#000000

Features:
- Choose between Model 1 (M1) and Model 2 (M2) or use both.
- Adjust the confidence threshold for each model independently.
- View detections overlaid on the input. M1 detections are in RED, M2 detections are in BLUE.
Live Demo: Try it out here!

Flask Interface App (Under Construction)

We started building a Flask-based interface as well.

Location: interface-app/
Status: This app is currently under development and not fully functional yet.

Project Structure

Here's a glance at how the project files are organized:

C:.
│   .gitignore
│   do this setup.md
│   main.py
│   packages.txt
│   README.md
│   requirements.txt
│   run.py
│   run2model.py
│   train.ipynb
│   visualize_annotations_data.py
│   web interface.md
│   yolo11n.pt
│   yolov8m.pt
│   YOLOv8_Small_2nd_Model.pt
│
├───.devcontainer
│       devcontainer.json
│
├───.streamlit
│       config.toml
│
├───.vscode
│       settings.json
│
├───dataset
│   ├───test
│   │   ├───images
│   │   └───labels
│   ├───train
│   │   ├───images
│   │   └───labels
│   └───valid
│       ├───images
│       └───labels
│
├───inference_output
│       India_000884_jpg.rf.7d8d1739a4debaece30cbe543980de9c_annotated.jpg     
│
├───inference_output_two_models
│       v_annotated_2models.mp4
│
├───interface-app
│   │   app.py
│   │   requirements.txt
│   │   ... (static, templates folders)
│
├───RoadDetectionModel
│   └───RoadModel_yolov8m.pt_rounds120_b9
│       │   args.yaml
│       │   confusion_matrix.png
│       │   ... (other training/validation outputs)
│       │
│       └───weights
│               best.pt
│               last.pt
│
└───runs
    └───detect
        ├───val
        │    ... (older Test Set Evaluation Metrics)
        │
        └───val3
             (New Test Set Evaluation Metrics )

Running Locally

Want to run this project on your own machine? Great! We've put together a guide to help you set up the environment and get things running.

Please follow the instructions in the do this setup.md file located in the root of this repository.

Authors

Thanks for checking out our project!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Road Anomaly Detection

Dataset

Model 1: Custom Trained YOLOv8m (`best.pt`)

Validation Performance (`best.pt` during training)

Test Set Performance (`best.pt` - Final Evaluation)

Sample Processed Video: Watch a sample here

Overall Test Metrics Summary:

Test Set Evaluation Visualizations (Model 1 - `best.pt`)

Model 2: Pre-trained YOLOv8s (`YOLOv8_Small_2nd_Model.pt`)

Demo Applications

Streamlit Web App

Flask Interface App (Under Construction)

Project Structure

Running Locally

Authors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.devcontainer		.devcontainer
.streamlit		.streamlit
.vscode		.vscode
RoadDetectionModel/RoadModel_yolov8m.pt_rounds120_b9		RoadDetectionModel/RoadModel_yolov8m.pt_rounds120_b9
inference_output		inference_output
interface-app		interface-app
runs/detect		runs/detect
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
YOLOv8_Small_2nd_Model.pt		YOLOv8_Small_2nd_Model.pt
do this setup.md		do this setup.md
main.py		main.py
packages.txt		packages.txt
requirements.txt		requirements.txt
run.py		run.py
run2model.py		run2model.py
train.ipynb		train.ipynb
visualize_annotations_data.py		visualize_annotations_data.py
web interface.md		web interface.md
yolo11n.pt		yolo11n.pt
yolov8m.pt		yolov8m.pt

License

collabdoor/Road-Anomaly-Detection

Folders and files

Latest commit

History

Repository files navigation

Road Anomaly Detection

Dataset

Model 1: Custom Trained YOLOv8m (best.pt)

Validation Performance (best.pt during training)

Test Set Performance (best.pt - Final Evaluation)

Sample Processed Video: Watch a sample here

Overall Test Metrics Summary:

Test Set Evaluation Visualizations (Model 1 - best.pt)

Model 2: Pre-trained YOLOv8s (YOLOv8_Small_2nd_Model.pt)

Demo Applications

Streamlit Web App

Flask Interface App (Under Construction)

Project Structure

Running Locally

Authors

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Model 1: Custom Trained YOLOv8m (`best.pt`)

Validation Performance (`best.pt` during training)

Test Set Performance (`best.pt` - Final Evaluation)

Test Set Evaluation Visualizations (Model 1 - `best.pt`)

Model 2: Pre-trained YOLOv8s (`YOLOv8_Small_2nd_Model.pt`)

Packages