Skip to content

savannahgong/Automation-Anywhere-1A

Repository files navigation

Automation-Anywhere-1A - Vision Model for Signature Detection

Overview

This project implements a vision-based data extraction system designed to detect signatures on documents. Using state-of-the-art object detection models, YOLO and Detectron2, the system processes a dataset of various documents to identify and localize signatures with high accuracy.

Project Workflow

  1. Dataset Preparation

The dataset consists of a diverse set of documents with labeled signature regions. Each document has annotations in a format compatible with both YOLO and Detectron2 (e.g., COCO JSON for Detectron2 and TXT files for YOLO).

The dataset was split into three subsets:

Train: Used for training the models.

Validation: Used to tune hyperparameters and monitor model performance during training.

Test: Used for final evaluation of the trained models.

  1. Model Selection

Two object detection frameworks were selected for the task:

YOLO (You Only Look Once):

A single-stage object detection model that predicts bounding boxes and class probabilities directly.

Chosen for its speed and suitability for real-time applications.

Used the YOLOv5 variant for training due to its improved accuracy and efficiency for detecting objects of varying sizes.

Detectron2:

A two-stage region-based convolutional neural network (R-CNN) framework.

Detectron2 first identifies regions of interest and then classifies and refines bounding boxes in these regions.

Chosen for its high accuracy, especially in detecting smaller objects, albeit at a higher computational cost.

  1. Model Training

Both models were trained on the Train subset of the dataset.

Hyperparameters such as learning rate, batch size, and the number of epochs were tuned based on performance on the Validation subset.

The COCO-Detection/faster_rcnn_R_50_FPN_1x model was selected for Detectron2 due to its effectiveness in detecting small to medium-sized objects.

  1. Model Evaluation

Performance was evaluated using metrics such as Precision, Recall, F1-score, and mAP (mean Average Precision) on the Test subset.

Comparisons between YOLO and Detectron2 were made to analyze trade-offs between speed and accuracy.

  1. Deployment

The trained models were deployed in a pipeline to process new documents and detect signatures.

Results included the bounding box coordinates and confidence scores for each detected signature.

Overall Results

YOLO:

Speed: High

Accuracy: Moderate (best for medium to large signatures)

Detectron2:

Speed: High

Accuracy: High (particularly effective for smaller signatures)

Data Visualization

Screenshot 2024-12-13 at 6 19 57 PM Screenshot 2024-12-13 at 6 19 57 PM

Detectron:

Sample Output

Yolo:

Screenshot 2024-12-13 at 6 22 04 PM

Detectron:

Screenshot 2024-12-13 at 6 21 54 PM

Individual Contributions

Data Preprocessing: Harini, Savannah, Ananya
Detectron2 Model: Harini, Makayla
YOLOV8 Model: Savannah, Ananya
Streamlit Application: Savannah, Harini

Shoutout to our Challenge Advisor, Anish and our AI Studio TA, Aditya for guiding us along the way

Potential Next Steps

  • Increasing AP scores for both models
  • utilizing synthetic data to better train each model
  • analyzing wrong predictions to find patterns in mistakes

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •