I dove into a 75-year span of movie data ππΏ. Using a Kaggle dataset of 16 IMDb CSVs (genre-based), I unified and cleaned the data to explore:
π₯ Trends in ratings, votes, and box office gross π§ Correlations across genres, certificates, and runtime π Top actors, directors, and blockbuster years
- Key tools: Pandas, Seaborn, Data Cleaning, Python
- Key lessons: Data wrangling is everything, .loc is your friend, and good plots need good questions!
- Biography films run the longest
- Adventure genre dominates box office π°
- Christopher Nolan rules the votes β but not the ratings π€
- Check out the code, charts, and insights inside!