Skip to content

Data cleaning project using Excel/Power Query on Audible audiobook data. Standardized formats, fixed inconsistencies, and prepared structured datasets for analysis. Great practice for data transformation and preprocessing in Excel.

Notifications You must be signed in to change notification settings

farook8090/Amazon-audible

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“Š Audible Dataset Cleaning β€” Excel Case Study πŸ“Œ Project Overview

This case study demonstrates how to utilize Power Query Editor in Excel to clean and standardize an Audible dataset. The objective is to ensure data consistency, accuracy, and readiness for analysis by applying various data transformation techniques.

πŸ“ Case Study Description

The dataset required multiple cleaning and formatting tasks to prepare it for meaningful analysis. Using Power Query, the following steps were performed:

πŸ”Ή Data Cleaning & Transformation Tasks

Name Standardization

Converted all entries in the name column to consistent title case.

Author Column Split

Separated combined names in the author column into individual authors.

Release Date Formatting

Ensured all entries in the releasedate column follow a DD-MM-YYYY format.

Duration Conversion

Converted the time column from text to a recognized duration format.

Price Column Standardization

Ensured the price column is numeric.

Identified and handled non-numeric values.

Applied consistent currency formatting with two decimal places.

Ratings Transformation

Converted text-based ratings in the stars column into numeric values.

Narrators Column Split

Split the narratedby column into multiple columns when multiple narrators are listed.

Release Info Column Creation

Merged releasedate and language columns into a new column:

releaseinfo β†’ "DD-MM-YYYY, Language"

πŸ› οΈ Tools Used

Microsoft Excel

Power Query Editor

πŸš€ Outcome

By cleaning and standardizing the Audible dataset, the data is now:

Consistent and error-free.

Properly formatted for analysis.

Ready to be used for reporting and visualization.

About

Data cleaning project using Excel/Power Query on Audible audiobook data. Standardized formats, fixed inconsistencies, and prepared structured datasets for analysis. Great practice for data transformation and preprocessing in Excel.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published