Skip to content

suyogparkhi/IISF_Bug_bunnies

Repository files navigation

IISF_Bug_bunnies

This is the repository of Bug Bunnies , Group-3 IISF Space Hackathon 2023

Problem Statement :


# Group 3, Topic 1: Optimized Geospatial Data Management for Bhuvan Platform.

# Our Approach :
Implemented Modified MinHash algorithm on the given unstructured geospatial data . This Algorithm converts binary and text convertable files to text files and applies shingling which generates a pickle file . This is the output of the preprocessing step . We run the MinHash algorithm on the generated pickle file . The Minhash algorithm gives a set of n nearest redundant files (for eg. if file1 , file2 and file3 are similar the ouptput will be the set of 3 ie. {file1 , file2 , file3}) . Integrating this output with the fronted to display redundant files and addition/deletion of files . Now , our data is ready for geospatial processing and storage . We store and process the geospatial data efficiently using postgreSQL and duckDB . Implementing an efficient cataloging system for this data using filtering techniques .

# Tech Stacks :
Python , Flask , duckDB

updates (timestamp : 17th jan 10 pm) :
1) Completed with data preprocessing and MinHash Algorithm 2) 50% frontend completed 3) Yet to integrate MinHash with frontend 4) Sample Cataloging completed 5) Yet to write generic code to implement Cataloging and its frontend integration
Installations:
pip install flask pip install duckdb leafmap conda create -n geo python=3.11 conda activate geo conda install -c conda-forge mamba mamba install -c conda-forge python-duckdb duckdb-engine jupysql leafmap
Working Model :
Frontend : Screenshot 2024-01-17 at 11 23 23 PM
# MinHash : Preprocessing (conversion to txt) : Screenshot 2024-01-17 at 11 23 23 PM list of redundant files : Screenshot 2024-01-17 at 11 23 23 PM jaccard simmilarity : Screenshot 2024-01-17 at 11 23 23 PM redundancy set : Screenshot 2024-01-17 at 11 23 23 PM # Gespatial configuration using duckDB :
Screenshot 2024-01-17 at 11 25 11 PM Screenshot 2024-01-17 at 11 25 17 PM Screenshot 2024-01-17 at 11 25 39 PM
# Sample Cataloging : Screenshot 2024-01-17 at 11 23 59 PM Screenshot 2024-01-17 at 11 24 06 PM Screenshot 2024-01-17 at 11 24 19 PM

About

This is the repository of Bug Bunnies , Group-3 IISF Space Hackathon 2023

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •