League-of-Legends-LoL-Match-Analysis-using-PySpark-6GB-Dataset

League of Legends (LoL) Match Analysis using PySpark | 6GB Dataset This repository contains an analysis of League of Legends (LoL) match data using PySpark. The analysis is performed on a large-scale dataset of approximately 6GB in size. The dataset includes information about matches, champions, items, and various statistics.

Dataset The dataset used in this analysis consists of LoL match data, which includes information about individual matches, such as the participating teams, champions selected by each player, item builds, match outcomes, and more. The dataset size is approximately 6GB.

Analysis The analysis performed in this project utilizes PySpark, a powerful framework for big data processing and analysis. The PySpark framework enables efficient processing and analysis of the large-scale LoL match dataset.

Key components of the analysis include:

Calculating champion win rates based on match outcomes Determining ban rates for champions Analyzing item synergies and their impact on win rates Examining pick rates for champions and items Investigating champion synergies and their win rates Exploring country-specific champion win rates Repository Structure

The repository is organized as follows:

data/: Folder containing the large-scale dataset (not included in the repository due to its size) analysis.ipynb: Jupyter Notebook containing the PySpark code for the analysis Riot_match_collector.ipynb: Jupyter Notebook containing the code to download the data set from RIOT api items.json: JSON file containing information about LoL items champions_with_highest_win_rate_per_country.csv: CSV file containing champions with the highest win rates per country item_synergies.csv: CSV file containing item synergies and their win rates item_pick_rate.csv: CSV file containing item pick rates item_win_rate.csv: CSV file containing item win rates champion_synergies.csv: CSV file containing champion synergies and their win rates champion_pick_rate.csv: CSV file containing champion pick rates champion_ban_rate.csv: CSV file containing champion ban rates champion_win_rate.csv: CSV file containing champion win rates Usage To replicate the analysis or explore the results, follow these steps:

Obtain the LoL match dataset y executing the Riot_match_collector.ipynb Jupyter Notebook (approximately 6GB in size) and place it in the data/ folder. Install the required libraries and dependencies as specified in the requirements.txt file. Execute the code in the analysis.ipynb Jupyter Notebook to perform the analysis. Explore the generated CSV files to access the results of the analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
graphs		graphs
results		results
README.md		README.md
Riot_match_collector.ipynb		Riot_match_collector.ipynb
analysis.ipynb		analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

League-of-Legends-LoL-Match-Analysis-using-PySpark-6GB-Dataset

About

Uh oh!

Releases

Packages

Languages

kareemfarahat/League-of-Legends-LoL-Match-Analysis-using-PySpark-6GB-Dataset

Folders and files

Latest commit

History

Repository files navigation

League-of-Legends-LoL-Match-Analysis-using-PySpark-6GB-Dataset

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages