Skip to content

Clean and process email data stored in Google Cloud Storage using PySpark. Identify and save the top 10 spam and ham emails based on a spam indicator. Calculate and save TF-IDF values for these top emails to analyze their content.

Notifications You must be signed in to change notification settings

ananyadas06/cloud-Technology--detecting-spam-and-ham-mails

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

cloud-Technology--detecting-spam-and-ham-mails

Clean and process email data stored in Google Cloud Storage using PySpark. Identify and save the top 10 spam and ham emails based on a spam indicator. Calculate and save TF-IDF values for these top emails to analyze their content.

About

Clean and process email data stored in Google Cloud Storage using PySpark. Identify and save the top 10 spam and ham emails based on a spam indicator. Calculate and save TF-IDF values for these top emails to analyze their content.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published