An end-to-end data engineering and visualization project that fetches real-time cricket data from the Cricbuzz API and builds insightful dashboards using Snowflake and Power BI.
This project captures live and historical cricket data โ including player stats, match outcomes, series insights, partnerships, and rankings โ to build a robust analytical platform for cricket lovers, analysts, and fantasy players.
- Data Extraction: Python + RapidAPI (Cricbuzz API)
- Data Warehouse: Snowflake
- Data Transformation & Cleaning: Python (pandas), SQL (Snowflake)
- Dashboard & Visualization: Microsoft Power BI
Cricbuzz API
โ
Python ETL Scripts
โ
Cleaned & Transformed Data
โ
Snowflake Data Warehouse
โ
Power BI Dashboards
Table Name | Description |
---|---|
icc_rankings_main |
ICC player & team rankings (Test, ODI, T20) |
team_players_main |
Player roster and details per team |
player_batting_stats_main |
Historical batting performance |
player_bowling_stats_main |
Historical bowling performance |
series_main |
Series-level metadata |
matches_main |
Match details including teams, results |
innings_main |
Inning-wise stats like score, RR, wickets |
battingperformance_main |
Player-wise batting performance per innings |
bowlingperformance_main |
Over-by-over bowling breakdown |
partnerships_main |
Batting partnerships analysis |
wickets_main |
Fall of wickets detail |
- Top players by format and role
- Filter by country and position
- Batting and bowling performance
- Player career trends and match impact
- Win/loss summaries
- Toss impact vs match results
- Match-wise performance drilldown
- Ball-by-ball impact
- Top partnerships
- Fall of wickets patterns
git clone https://github.com/AkshSurani/cricbuzz-cricket-data-pipeline-dashboard.git
cd cricbuzz-dashboard
pip install -r requirements.txt
Create a .env
file:
RAPIDAPI_KEY=your_cricbuzz_api_key
SNOWFLAKE_ACCOUNT=your_account
SNOWFLAKE_USER=your_username
SNOWFLAKE_PASSWORD=your_password
SNOWFLAKE_DATABASE=cricket_db
SNOWFLAKE_SCHEMA=public
SNOWFLAKE_WAREHOUSE=compute_wh
Run python Files.
Configure your Snowflake connection and execute the load script using Python or SQL.
- Open
Cricbuzz_Cricket_Dashboard.pbix
- Connect to Snowflake
- Refresh visuals to see the latest data
- Top Run Scorers (avg, SR, consistency)
- Top Wicket Takers (econ, avg, impact)
- Player Rankings (format-wise, role-wise)
- Win/Loss records
- Toss decision impact
- Series-wide score summaries
- Batting Partnerships
- Powerplay & Death Over performance
- Fall of Wickets Timeline
- Squad composition by roles
- Margin of victory
- Country-wise dominance
- โฐ Real-time Auto Refresh using GitHub Actions or Airflow
- ๐ฑ Fantasy Cricket Insights & Player Selection Advisor
- ๐งฎ Predictive Analysis using ML models (Win Predictor, Player Form)
Aksh Surani โ Data Engineer LinkedIn
Divy Kaila โ Data Engineer LinkedIn
This project is intended for educational and portfolio use.
API access is bound by Cricbuzz and RapidAPI terms of service.
Give it a โญ on GitHub!
Or share it with your cricket analytics buddies. ๐