This repo focuses on reproducible ETL pipelines and the application of reproducible analysis.
To find out more my article is available on Medium:
To clone this repo:
git clone https://github.com/josephlewisjgl/ReproducibleETLA.git
To run the code you can just run main.py with the name of a file you are looking to upload:
python main.py -f 'SharkAttacks_2022.csv'