Skip to content

The Data-Engineering-Project-Taxi-data involves analyzing and processing taxi data to extract valuable insights for business optimization. It includes building ETL pipelines, data modeling, and real-time analytics.

Notifications You must be signed in to change notification settings

Lucky-akash321/Data-Engineering-Taxi-Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data-Engineering-Project-Taxi-data

This project aims to securely manage, streamline, and perform analysis on the structured data of USA Taxi service

Introduction

This project focus on performing data analysis on Taxi data using various tools like GCP Storage, Python, Compute Instance, Mage Data Pipeline Tool, BigQuery, and Looker Studio

Project Flow

Technology used

Python, Modern Data Pipeine Tool - https://www.mage.ai/, Google Cloud Platform like Google Storage, Compute Instance,BigQuery, and Looker Studio

Dataset used

Website - https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page
Data Dictionary https://www.nyc.gov/assets/tlc/downloads/pdf/data_dictionary_trip_records_yellow.pdf

Data Model

About

The Data-Engineering-Project-Taxi-data involves analyzing and processing taxi data to extract valuable insights for business optimization. It includes building ETL pipelines, data modeling, and real-time analytics.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published