Skip to content

This repository will showcase extracting, cleaning and uploading files onto Google Bigquery, based on a project for the MSBA program at the University of Montana.

Notifications You must be signed in to change notification settings

MeganAlbee/TheWedge

Repository files navigation

The Wedge Co-Op

Data Engineering Project with the Univeristy of Montana.


About

The Wedge Co-Op is the largest co-operative grocery store in the United States and is in Minneapolis, MN. Through a partnership with the co-op, we have data dating back to January 1, 2010 from the point-of-sale (POS) system that the Wedge developed. We have data through January 2017. This system logs every row of every receipt. This dataset is complex and was the product of manual export that was not consistent.

Goal and requirements

To create a data pipeline that extracts zipped transaction records, cleans and uploads to Google BigQuery. Then, develop business reports that would serve a manager, owner or operations manager.


Key learnings

  • Python script to data extracting and cleaning via Pandas
  • Working with large data sets
  • Python, SQL and GBQ to extract sample and write into text file
  • Create SQL summary tables using a single SQLite database via Python

Data Visualization

In addition to this project, I created a dashboard with Looker Studio that gives a manager an overview of people, products and profits.

Preview online here.

About

This repository will showcase extracting, cleaning and uploading files onto Google Bigquery, based on a project for the MSBA program at the University of Montana.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •