Project for "Applied Data Science with Python" course at OTH Regensburg - summer 2022
- Project Description Create a simple application that stores and extracts information about top 50 Hollywood actors and actresses on IMDb.
- Requirements:
- List of all available actors and actresses
- About the actor/actresses
- All-time movie names and years
- Awards to actor/actresses in different years
- Movie genre of actor/actresses
- Average rating of their movies (overall and each year)
- Top 5 movies, their respective years and genre
- Development Tool and Packages
- Programming Language: Python 3.8.x
- Modules & packages:
- BeautifulSoup, requests (crawling and scraping data)
- pandas, numpy, matplotlib (data processing & visualization)
- PyQt5 (GUI)
- IDE: PyCharm Professional 2021.3.1
- Database: SQLite
- How to clone & run app
Install requirements:
pip3 install -f requirements.txt
Run:
python src/view/main_window.py
- References: https://github.com/sebikolon/IMDB