Skip to content

franceslinyc/Big-Data-Analysis-of-NYC-Trip-data-2021

Repository files navigation

Big-Data-Analysis-of-NYC-Trip-data-2021

Final project for CS512 Data Science Tools and Programming (Big Data) at OSU

Description

Wrangle, explore and analyze the NYC Taxi & Limousine Commission's trip records of 2019-2020 (~ 35.26 GB) using cloud computing services (Google Cloud Platform's Compute Engine, BigQuery, Cloud Dataproc), R (Tidyverse), Python (PySpark / Apache Spark) and query language.

Documentation

Lin_CS512_Final_Project.pdf contains the full report.

Lin_Plotting contains codes that join datasets and produce plots.

About

Final project for CS512 Data Science Tools and Programming (Big Data) at OSU

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages