In this repository, I'm analyzing a dataset containing a company's online retail transactions. The dataset for this analysis can be found at UCI Machine Learning Repository. The original data set is stored in an excel file, with a total of 541,909 records and 8 fields. The file is imported into the Jupyter notebook and a subset is extracted during data prep.
This project requires Python (>= 3.5) and the following Python libraries installed:
This notebook covers the following:
- Exploratory Data Analysis
- Data Cleaning
- K-Means Clustering
- Further Analysis