Why?

This program is meant for sanitizing and plotting data exported from the X-LIMS system. Inherently, X-LIMS tends to export CSV files that look something like this:

SampledDate	Parameter	SWPPRCalc	ReportedResult	Unit
6/1/2024	Biochemical Oxygen Demand	586	586	mg/L
6/1/2024	pH	6.66	6.66	pH Units
6/1/2024	% Volatile Suspended Solids	85	85	%
6/1/2024	Total Suspended Solids	814	814	mg/L
6/1/2024	TSS - Change in Wt	0.68	0.68	%
6/1/2024	Volatile Suspended Solids	690	690	mg/L

Notice that the Parameter column has multiple parameters in it, and that the SampledDate column has redundant values. Let's use pandas to get this CSV into wide format, with a column for each parameter and one row per time stamp! This will help with plotting curves, analyzing trends, and making our lives generally easier.

Organization:

src: Python source code
exports: CSV and PNG files generated during the run
imports: CSV files downloaded from X-LIMS. It is assumed that three lines will be skipped. The file names will be cleaned up to be used as chart titles, and the CSV import filenames will also be altered to make the export PNG and CSV filenames.
configs: Users can dictate which data is ignored.

Configuration:

Edit /configs/exclude_parameters.toml to control which variables are shown (and not) in the PNG export charts, by commenting the ones you want to see.
To print easy copy-and-paste variable lists from each file, edit /configs/show_parameter_lists.toml, so that show_parameter_lists = true; The default is false, as in show_parameter_lists = false.

Quick Start

Windows:

git clone https://github.com/City-of-Memphis-Wastewater/xlimsprep

cd xlimsprep

cat main.ps1

.\main

Linux:

git clone https://github.com/City-of-Memphis-Wastewater/xlimsprep

cd xlimsprep

cat main.sh

chmod +x main.sh

./main.sh

Step-by-step

Windows:

Clone this repository, generating a folder in your current folder.

git clone https://github.com/City-of-Memphis-Wastewater/xlimsprep

Navigate into the newly cloned directory.

cd xlimsprep

Check the contents of the /imports/ directory.

ls imports

Generate a directory called .venv, to hold a Python virtual environment. This virtual environment will use the same version of Python as your system installation unless otherwise specifed. Alterantively you can use pyenv (not shown). If you already have an environment, running this again should not hurt anything.

python -m venv .venv

Activate the virtual environment. If you already have an environment active, running this again should not hurt anything.

.venv\Scripts\activate

Prepare your local virtual environment with the packages necessary to run this software. This is preferred to pip installing packges to your system Python. Why? Ask a friend. Ask a chatbot. Ask your father.

pip install -r requirements.txt

Run the main script to import your CSV files. Export columnar CSV files (without ignoring any parameters). Export PNG charts, which will ignore any parameters listed in /configs/exclude_parameters.toml.

python -m src.main

Launch an image viewer to see the images in your /exports/ directory.

python -m src.view

Launch a tkinter window to enjoy navigating multiple CSV export files quickly. Small column titles? Tooltips to the rescue. Hover that mouse.

python -m src.xplor

Turn off your venv virtural environment. It is easy to forget to turn your venv on and off, which is a major argument in favor of Poetry.

Deactivate

See which parameters are suppressed from plotting to the PNG charts. If a parameters is commented out using a pound sign in the skip_parameters list, it will not be suppressed. Confusing, right? The list is of items that will be suppressed. So, if you prevent an item from being in the list by commenting it out, it will be kept in the plots.

cat .\configs\exclude_variables.toml
notepad .\configs\exclude_variables.toml

Okay, good luck.

Rollout:

My projects typically use poetry and pyenv, but this one does not.

This one just uses venv and a requirements.txt file.

Why? Because Termux has my attention right now.

Design Implications

Import and export files have not been added to the .gitignore, so that users will be able to test the software with useful files.
To run your own new files, add them to the /imports/ directory.
You can delete existing files in the /imports/ and /exports/ directories, if you would like.
To run your own CSV files exported from X-LIMS, you may paste them into the /imports/ folder, with any file naming convention that you like.
It is expected that 3 lines will be skipped in the imported CSV files, though this can be changed in /src/importer.py in the skip_rows value assigned in the pd.read_csv funtion.
The keys (column names) that are relied upon in data sanitization are "SampledDate","Parameter", and "SWPPRCalc". You could change these if you wanted, to suit your raw data.
You'll need to edit the TOML parameters in /configs/ to be useful to your specific files.
Files in the /exports/ directory will be overwritten with easy run of src.main, if they are assigned an existing filename.

Maintenance goals:

Add configuration TOML file in /configs/ for controlling the raw column names time, numeric data, and mixed parameter labels. These are currently:
- time: "SampledDate"
- numeric data: "SWPPRCalc"
- mixed parameter labels: "Parameter
Add configuration TOML file in /configs/ for controlling the skip row value for data frame CSV import.
Add /src/wipe.py file for wiping exisiting example data in /imports/, /exports/, and /page/.
Drive the contents in /page/ based on export when running /src/main.py. Currently /page/ is built manually.

These above goals will modularize the code to be useful for sanitizing data (into a wide format) beyond X-LIMS downloads.

AI Disclaimer:

Artificial Intelligence was used for partial code generation and troubleshooting. Any Python file with my name in it was written by me. The design decisions were made by me and the flow of the program function-to-function was decided based on my experience. Artifical intelligence was and is hugely informative on building the most modular and futureproof functions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Why?

Organization:

Configuration:

Quick Start

Windows:

Linux:

Step-by-step

Windows:

Rollout:

Design Implications

Maintenance goals:

AI Disclaimer:

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
configs		configs
exports		exports
imports		imports
page		page
src		src
webpage		webpage
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
main.ps1		main.ps1
main.sh		main.sh
requirements.txt		requirements.txt

License

City-of-Memphis-Wastewater/xlimsprep

Folders and files

Latest commit

History

Repository files navigation

Why?

Organization:

Configuration:

Quick Start

Windows:

Linux:

Step-by-step

Windows:

Rollout:

Design Implications

Maintenance goals:

AI Disclaimer:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages