AI-pipeline-boreholes

These codes are the codes relevant to create the AI enabled pipeline for Oman Drilling Project Multi-borehole Observatory borehole BA1B.

data availability

The core cuts images can be found here: https://www.icdp-online.org/projects/by-continent/asia/oodp-oman/public-data-2

Go to Public Images and choose BA1B in the Cores row, the zip folder whould be around 180 Mo.

The segmented images are available in the following link. please download the folder "core_images_all" where you can find images for all core images: https://drive.google.com/drive/folders/1fvtC4qP-yYlxWZGLyOqsKw76l6tssY-c?usp=sharing

In this parent folder, you can find greyscale and segmented images for all the core sections. Tif images with names ending with "_Simple Segmentation_3" are the segmented images used for image analysis and calculation of Statistical microstructure descriptors (SMDs).

Getting started

The SMDs calculations are already done for the ROI core images selected from core images, and sum of the first 50 values are computed and added as columns in the "Datasets\Dataset_BA1B.xlsx" (see the below if you want to redo SMD calculations from scratch or want to do it on your own images). Therefore, to reproduce the results presented in the manuscript, you can run the pipleline simply by running:

bash run_pipeline.sh

Assuming your data is in the folders this should run the pipeline and:

run the ChatGPT text categorization if you have a provided API key stored in api-key.txt
assemble the data set
run the catboost models
generate the figures from the paper

SMD calculation on your own images

If you perform SMD calcualtions on your images or similar dataset. Following scripts should be run one by one:

Step 1 : Select region of interest (ROI) from segmented images for each core:

python ROI_selection.py --path_imgs D:\Hamed\SerpAIpipeline\data\core_
images_all --image_size 512 --stride 128 --path_output D:\Hamed\SerpAIpipeline\data\cropped_test

This script reads the segmented images from 'path_imgs' which is the path to parent folder 'core_images_all' described above. ROIs are selected by sliding a window of size "image_size" over the images with a stride, and chooses the region with maximum fracture fraction. It saves the ROI images in each folder in "core_images_all" and also in the 'path_output' from user.

Step 2: Calculate SMDs from ROI core images: Here, you can calculate all SMDs on your ROI images of size 512 by 512 pixels. if you have images with different size, cpp code in folder 'Cpp_source_512' should be recompiled with different parameters. See the docstring for more detail.

python calculate_smds.py --path_ROI_imgs D:\Hamed\SerpAIpipeline\data\cropped_ROIs --cpathPn D:\Hamed\SerpAIpipeline\SerpAI_Github\AI-pipeline-boreholes\Cpp_source_512\Cpp_source\Polytope --runtimePn D:\Hamed\SerpAIpipeline\SerpAI_Github\AI-pipeline-boreholes\Cpp_source_512\runtime -
-outputPn D:\Hamed\SerpAIpipeline\SerpAI_Github\AI-pipeline-boreholes\Cpp_source_512\runtime\output --path_output D:\Hamed\SerpAIpipeline\smd_outputs_test

This script saves a dictionary (as a pickle file .pkl) for each image in the output folder specified by user ('path_output'). In each dictionary, the polytope functions (s2, p3, p4, ..L, f2, f3, f4, fL) are the keys and values are the probabilities at each distance r. The name of each dictionary shows the core image name.

Step 3: Calculate sum of first 50 points in SMD functions by running the following:

python calculate_smds_sum.py --path_smds D:\Hamed\SerpAIpipeline\smd_outputs --path_output D:\Hamed\SerpAI
pipeline\smd_outputs_test

Here, the code goes to the 'path_smds' and read all smd pickle files (calculated and saved in the previous step). It then calculate the sum of the first 'num_points' these functions. The results will be saved in a csv file named 'dataset_smd_sum.csv' in the 'Dataset' folder. The script will create a new spreadsheet named 'Dataset_BA1B_updated.xlsx' with updated SMD sume values. Use this file as input in running meow.py

Name		Name	Last commit message	Last commit date
Latest commit History 158 Commits
.vscode		.vscode
Connectivity		Connectivity
Cpp_source		Cpp_source
Cpp_source_512/Cpp_source		Cpp_source_512/Cpp_source
Datasets		Datasets
Eccentricity		Eccentricity
Figures		Figures
Filters		Filters
GPT_extraction		GPT_extraction
Graph per depth		Graph per depth
Image_analysis		Image_analysis
S2_outputs		S2_outputs
src		src
.gitignore		.gitignore
5057_OMAN_5_B_SECTION_UNITS.txt		5057_OMAN_5_B_SECTION_UNITS.txt
Connectivity several.py		Connectivity several.py
Graph notebook.ipynb		Graph notebook.ipynb
Polytope_for_segmented_images.ipynb		Polytope_for_segmented_images.ipynb
README.md		README.md
ROI_selection.py		ROI_selection.py
ams-resist-analysis.ipynb		ams-resist-analysis.ipynb
calculate_smds.py		calculate_smds.py
calculate_smds_sum.py		calculate_smds_sum.py
geology_legend.pdf		geology_legend.pdf
litho-and-shap.ipynb		litho-and-shap.ipynb
material_testing.py		material_testing.py
meow.py		meow.py
paper-figures.ipynb		paper-figures.ipynb
paperplots.py		paperplots.py
physical_measurements.pdf		physical_measurements.pdf
plot_equalize.py		plot_equalize.py
plot_general.py		plot_general.py
requirements.txt		requirements.txt
residuals.pdf		residuals.pdf
roc.pdf		roc.pdf
roc_using_gpt_classifications.pdf		roc_using_gpt_classifications.pdf
run_pipeline.sh		run_pipeline.sh
shap.pdf		shap.pdf
shap.png		shap.png
smd.py		smd.py
topics_keywords.pdf		topics_keywords.pdf
uumatsci_utils.py		uumatsci_utils.py
vcds.ipynb		vcds.ipynb
veins.ipynb		veins.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI-pipeline-boreholes

data availability

Getting started

SMD calculation on your own images

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

SerpRateAI/AI-pipeline-boreholes

Folders and files

Latest commit

History

Repository files navigation

AI-pipeline-boreholes

data availability

Getting started

SMD calculation on your own images

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages