Welcome to my Auto Ria porfolio project! From data scraping to cleaning and creating dashboard
You can try interactive dashboard Auto Ria Dashboard and i will also attach screenshots below
In this project i wanted to use my skills and knowledge to analyze the secondary market of cars in Ukraine based on the cars that are sold on the AutoRia website. The project involved scraping the data from AutoRia website (216.520 rows), loading the data, cleaning and creating visualization in PowerBI. The dataset includes: manufacturer, model_name, release_year, mileage, fuel_type, fuel, transmission, accident, price(USD), price(UAH), city, region, link.
auto_ria_cleaned.csv
: The primary datatset that was scraped and used in this project, dataset contains detailed information about cars. The file weights more than 25mb so it is inside of a auto_ria_cleaned.rar
.
city_region.csv
: This dataset was scraped and used to add region names into primary dataset, because we only had city or village names and for PowerBI visualization i needed regions
Auro Ria Data Scraping.ipynb
: To begin, I scraped data from the AutoRia website, which contains information about cars that are on sale and saved as auto_ria.csv
.
Auto Ria Cleaning.ipynb
: In this file i did data cleaning where i worked with: handling Null values; changing column names; reordering columns; changing data type; making 2 different columns from a single one; dropping unneeded columns. After cleaning up the data i saved cleaned file as a auto_ria_cleaned.csv
.
City&Regions Data Scraping.ipynb
: In this file i scraped city, village and regions of Ukraine from Wikipedia and saved as a city_region.csv
to add regions into primary dataset
Web scraping, creating dataset, data cleaning, data analysis, PowerBI visualization
Python, BeautifulSoup, requests, Pandas, PowerBI
PowerBI dashboard with 4 pages containig all necessary information about secondary car market in Ukraine, which will help you make certain decisions to solve problems associated with used cars
I would like to give special thanks to AutoRia for the data i collected