This repository showcases a complete data analysis workflow across three major projects, using real-world datasets and Python-based visualizations. It demonstrates skills in data cleaning, EDA, statistical insights, and business presentation.
- Dataset:
AB_NYC_2019.csv
- Analysis split into 5 parts: loading, exploration, visualization, outliers, and summary
- Charts: price distributions, room types, availability trends, outliers
- Deliverables: Python scripts, summary reports, PowerPoint presentation
- Basic data exploration
- Heatmaps, scatterplots, distribution plots
- Dataset:
titanic.csv
- Visualizing survival rates by class and gender
- Bar charts and survival distributions
- Data wrangling with
pandas
- Visualizations using
matplotlib
andseaborn
- Outlier detection
- Reporting and storytelling with data
- Structured Python scripting
- Clone the repo:
git clone https://github.com/arun-data-analyst/Data-Analysis-Case-Study.git cd Data-Analysis-Case-Study
Arun Acharya
Data Analyst in training | Willis College