Titanic EDA – Exploratory Data Analysis

Author: Tirtha Dutta
Date: 24 June 2025
Dataset: Kaggle – Yasser H Titanic Dataset

Objective

Explore the Titanic dataset using visual and statistical techniques to uncover relationships, trends, and anomalies.
This step helps lay the foundation for building accurate machine learning models.

Key EDA Steps Performed

Summary statistics for all numerical features
Histograms to visualize distributions
Boxplots to inspect outliers and spread
Correlation heatmap to check relationships
Markdown cell summarizing key findings

How to run locally?

git clone https://github.com/tirtha103/titanic-eda.git cd titanic-eda

Create and activate virtual environment (Windows)

python -m venv venv venv\Scripts\activate

Install dependencies

pip install -r requirements.txt

Launch the notebook

jupyter lab notebooks/01_eda_walkthrough.ipynb

Folder Structure

titanic-eda/
│
├── data/
│   └── titanic_cleaned.csv                # Cleaned dataset (from Task 1)
│
├── images/
│   ├── histograms.png                     # Histograms of numeric features
│   ├── boxplots.png                       # Boxplots for outlier inspection
│   └── correlation_heatmap.png            # Correlation heatmap
│
├── notebooks/
│   └── 01_eda_walkthrough.ipynb           # Full EDA notebook
│
├── report/
│   └── titanic_eda_report.pdf             # Optional PDF report
│
├── requirements.txt                       # Python package requirements
├── LICENSE                                # MIT License
└── .gitignore                             # Ignored files

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Titanic EDA – Exploratory Data Analysis

Objective

Key EDA Steps Performed

How to run locally?

Create and activate virtual environment (Windows)

Install dependencies

Launch the notebook

Folder Structure

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
images		images
notebooks		notebooks
report		report
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

tirtha103/Titanic-EDA

Folders and files

Latest commit

History

Repository files navigation

Titanic EDA – Exploratory Data Analysis

Objective

Key EDA Steps Performed

How to run locally?

Create and activate virtual environment (Windows)

Install dependencies

Launch the notebook

Folder Structure

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages