This repository contains raw data for the PGLearn Midwest24k dataset.
The original Matpower and time series data files are part of the Texas A&M University Electric Grid Datasets, namely the 23,643-bus Midwest synthetic grid. If you use this dataset in your work, please cite the appropriate papers.
- Make sure you have julia installed
- Instantiate the current environment
julia --project=. -e 'using Pkg; Pkg.instantiate()'
Data processing is not required if you cloned the repository.
This code is included for reproducibility.
- Download raw demand data files from TAMU (see here)
- Copy the following files into the
data/
folderMidwest24k_20220923.m
MISOSPP2020MWtimeseries.csv
MISOSPP2020MVARtimeseries.csv
- Rename matpower file
mv data/Midwest24k_20220923.m data/Midwest24k_TAMU_20220923.m
- Execute the data processing script
julia --project=. process.jl
It is recommended to execute this script with multiple threads
julia --project=. --threads=4 interpolate.jl
By default, the demand data is interpolated down to 5-min granularity.
While the underlying code supports other granularities (see interpolate_demand
function in interpolate.jl
), this functionality is not exposed from the command line.
Please open an issue if you'd like to request this feature.