Skip to content

ntua-el20069/advancedDB-PySpark-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

PySpark Project for LA Crime Data Analysis

This repository was created as an assignment for the course Advanced Database System Concepts (NTUA) (2024-25) The project focuses on analyzing Los Angeles crime data from 2010 to 2024 primarily based on

Execution Instructions

The Introduction section of the notebook contains Execution Instructions

Prerequisites

  • store the datasets required for data analysis and update the links that refer to them
  • install PySpark, Sedona

About

A PySpark Big Data Analysis project for Advanced Database System Concepts course (NTUA)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published