Bounding Box Extraction

Contact Details

Name: Pratik Ghute
Contact No./ Whatsapp: +91 7057033662
Email: pratikghute54@gmail.com

Implemented Features

Bounding Box Extraction Function

Implemented the get_bounding_box function to extract precise bounding boxes from mask images. This function utilized contour detection after thresholding the images to binary, leveraging the clear contrast between the object and background. Bounding box coordinates (xmin, ymin, xmax, ymax) were stored in a JSON file as ground truth.

Performance Observation

Evaluated SAM2’s performance by tracking objects across subsequent images using bounding boxes from the first image of each object. Assessed its segmentation ability under various conditions such as lighting and angles.

Model Limitations

Observed that SAM2 occasionally failed to track or mask objects, even with multiple occurrences in the image. This pointed to limitations in the model’s consistency in object tracking and detection.

Function Modification for Multi-threading

Enhanced the track_item_boxes function to support multi-threading, enabling dynamic path and variable management across threads. This optimization significantly improved the efficiency and speed of bounding box predictions for multiple objects.

Test Set Construction

Exclusion of Training Images

Excluded the first image of each object from the test set to avoid bias, as these images were used for training.

Handling Multiple Instances

For images with multiple masks, only the mask associated with the tracked object was considered valid. Given the single-shot tracking task not object detection(so only one instance per image must be ground truth), only one mask per image was relevant to ensure focused evaluation.

Handling No Detections

For images where SAM2 did not detect any object, only one mask image was included in the test set to represent the ground truth. This approach aligned with the expectation of having a single prediction per image.

Construction of Test Set Dictionary

Constructed the test set dictionary based on these criteria and generated ground_truth.json and predictions.json in COCO format for evaluation.

Performance Evaluation

Overall Precision and Recall

SAM2 demonstrated moderate precision and recall. Precision metrics indicated challenges in accurate localization, particularly at higher alignment levels. Recall metrics showed SAM2 detected approximately 23% of objects in the test set.

Performance by Object Size

Small Objects: No small objects were present in the test set, resulting in metrics for this category being reported as -1.000.
Medium Objects: Achieved a precision of 0.343 and recall of 0.348, indicating reasonable performance on medium-sized objects.
Large Objects: Precision and recall were lower, at 0.138 and 0.225 respectively, suggesting difficulties in accurately detecting and tracking large objects.

Summary

The evaluation reveals that SAM2 performs reasonably well with medium-sized objects but shows limitations with large objects and lacks assessment for small objects due to their absence in the test set. The results indicate that while SAM2 is functional, further improvements are necessary to enhance its precision and tracking capabilities across varying object sizes.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Explaining approach for sam2_paralleldots.pdf		Explaining approach for sam2_paralleldots.pdf
README.md		README.md
ground_truth.json		ground_truth.json
image_info.json		image_info.json
predictions.json		predictions.json
sam2_mask_tracking.ipynb		sam2_mask_tracking.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Bounding Box Extraction

Contact Details

Implemented Features

Bounding Box Extraction Function

Performance Observation

Model Limitations

Function Modification for Multi-threading

Test Set Construction

Exclusion of Training Images

Handling Multiple Instances

Handling No Detections

Construction of Test Set Dictionary

Performance Evaluation

Overall Precision and Recall

Performance by Object Size

Summary

About

Uh oh!

Releases

Packages

Languages

Pratik-Ghute/Sam2-performance-analysis-object-tracking

Folders and files

Latest commit

History

Repository files navigation

Bounding Box Extraction

Contact Details

Implemented Features

Bounding Box Extraction Function

Performance Observation

Model Limitations

Function Modification for Multi-threading

Test Set Construction

Exclusion of Training Images

Handling Multiple Instances

Handling No Detections

Construction of Test Set Dictionary

Performance Evaluation

Overall Precision and Recall

Performance by Object Size

Summary

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages