Skip to content

bvslahari007/EDA-beginner-learning-notebook

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

Learning EDA from Scratch: A Beginner's Notebook

This is a beginner-friendly notebook where I practiced Exploratory Data Analysis (EDA) on the Titanic dataset using Python.
The goal was to understand EDA step-by-step in a hands-on, fun, and understandable way and to help other beginners do the same.


Tools Used

  • pandas
  • numpy
  • seaborn
  • matplotlib
  • Kaggle Notebooks

Dataset Used

Titanic: Machine Learning from Disaster
View Dataset on Kaggle


What’s Inside

  • Understanding the dataset
  • Types of data & summary stats
  • Handling missing values & duplicates
  • Univariate & bivariate analysis
  • Correlation & heatmaps
  • My personal notes and beginner-level thoughts

How to Add a Dataset in Kaggle Notebook

  1. Open your notebook
  2. Click on Add Data on the sidebar
  3. Search and add the dataset (e.g., "Titanic")
  4. Access the file in your code like this:
df = pd.read_csv("../input/titanic/train.csv")

Feedback Welcome

This notebook is a part of my learning journey, so if you have any suggestions, corrections, or ideas —
feel free to let me know! I’d love to improve this further and keep learning.

License

This project is shared for educational purposes only.
Please don’t copy or repost it without credit.
© Vinaya Sangeeta Lahari Baswa, 2025.

About

Beginner-friendly EDA practice on the Titanic dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published