Skip to content

Ioannis-Raptis/PySpark_Practice_Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

PySpark_Practice_Project

A short project using PySpark. We used the bank marketing dataset from Kaggle. We handle missing values, label and encode categorical data. We scale numeric data and create a Random Forest model.

Dataset available at: https://www.kaggle.com/janiobachmann/bank-marketing-dataset

About

A short practice project using Pyspark.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages