Skip to content

johbry17/Cyclistic-Analytics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

70 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cyclistic Analytics

Google Data Analytics Capstone

What drives a casual rider to become a member? An R-powered look into bikeshare behavior.

🔗 Final Report (2024)
🔗 Final Report (2019)
📊 EDA (2024)
📊 EDA (2019)

Table of Contents

Project Overview

Cyclistic is a hypothetical bikeshare company in Chicago. This project investigates differences between casual riders and annual members to support a key business goal: increase member conversions.

Using public data provided by Divvy (via Lyft), the analysis:

  • Cleans and preprocesses raw CSVs from 2024 and 2019
  • Explores ride patterns by time, location, and bike type
  • Visualizes data with ggplot2, leaflet, and R Markdown
  • Provides strategic recommendations based on real insights

Originally built as the final project for the Google Data Analytics Certificate, it follows the six-phase analytics process:

Six Phases of Data Analytics

Features

  • 📄 R Markdown reports knitted to interactive HTML
  • 🗺️ Interactive Leaflet maps showing ride density by user type
  • 📈 Time-of-day usage patterns (hourly ride histograms)
  • 📆 Weekday vs. weekend trends by user type
  • 📊 Bike type and ride volume by month
  • 🧭 Year-over-year comparisons to spot post-pandemic trends
  • 📌 Top stations mapped for casuals vs. members
  • 💡 Strategic recommendations for converting casual riders

Tools & Technologies

  • Language: R, R Markdown
  • Packages: tidyverse, sf, ggspatial, leaflet, leaflet.extras, fontawesome
  • Cleaning Scripts: data_cleaning_v2.R, data_cleaning_v3.R
  • Deployment: GitHub Pages (HTML reports)

Usage

  1. View the final summary reports:

  2. Explore the exploratory data analysis:

  3. Run cleaning scripts:

    • Download trip data from Divvy Data Portal
    • Use provided R scripts to clean and process the data locally:
      • data_cleaning_v2.R for 2019
      • data_cleaning_v3.R for 2024

Gallery

Time & Day Plots:

Rides per Hour, by Member Type

Rides per Hour and Day of Week, by Member Type

Map Visualizations:

2019 comparison map

2024 casual stations map

2024 member stations map

Bike Type by Month:

2024 Rides per Month by Bike and Member Type

Certificate

Final capstone project for Google Data Analytics Professional Certificate

Google Data Analytics Certificate

References

Divvy and Lyft provided the data to Google and Coursera.

Licenses

Acknowledgements

Thanks to:

  • Google and Coursera for the learning platform
  • Divvy, Lyft, and Motivate International Inc. for providing the data
  • Everyone working on sustainable mobility solutions

Author

Bryan Johns, March 2025
bryan.johns.official@gmail.com | LinkedIn | GitHub | Portfolio

AI logo AI generated mock logo

About

Exploratory and statistical analysis of ride data for a fictional bikeshare company, completed in R and R Markdown.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages