This repository contains processed Toronto budget data for 2019-2024 and tools for data manipulation.
toronto_budget_combined_2024_to_2019-v2.csv
- Combined budget data from all years (2024-2019) stacked vertically- Total rows: 112,765
- Order: 2024 data at top, down to 2019 at bottom
- All columns properly aligned across years
append_budget_data_csv.py
- Working script using Python's CSV module to combine budget filesappend_budget_data.py
- Alternative script using pandas (requires compatible NumPy version)
All files contain the following columns:
- Program
- Service
- Activity
- Expense/Revenue
- Category Name
- Sub-Category Name
- Commitment item
- Amount
- Year
- Fixed header corruption in 2020 budget file (missing "Program" column name)
- All files now have consistent column structure
- Data stacked chronologically from 2024 (top) to 2019 (bottom)
To recreate the combined dataset:
python append_budget_data_csv.py
- 2024: 18,585 rows
- 2023: 20,128 rows
- 2022: 19,715 rows
- 2021: 17,127 rows
- 2020: 17,779 rows
- 2019: 19,428 rows