Final project for CS512 Data Science Tools and Programming (Big Data) at OSU
Wrangle, explore and analyze the NYC Taxi & Limousine Commission's trip records of 2019-2020 (~ 35.26 GB) using cloud computing services (Google Cloud Platform's Compute Engine, BigQuery, Cloud Dataproc), R (Tidyverse), Python (PySpark / Apache Spark) and query language.
Lin_CS512_Final_Project.pdf contains the full report.
Lin_Plotting contains codes that join datasets and produce plots.