ts-deepar-pipeline

Flexible pipeline for modeling time series data with DeepAR, calling predictions, and producing results.

Initial grouping of data to desired level for forecasting
Processing for DeepAR json format
Tuning job
Batch Transform and post-processing of results

Data quality and Seasonality

Crime data is quite stochastic, however seasonal patterns prevail. As such, predicting at the daily level would not be very effective, so predicting at the weekly level is a more tractable problem.

An interesting correlation between two Categories, given the global pandemic of 2020:

As less cars are on the roads, more are stolen or looted.

Training and Test Data

The data starts in January 2016 and goes through early January 2021. In order to assess performance on a sizeable portion of the data, I chose to train through December 2019, and predict 52 weeks in the year 2020.

The maximum Context Length equals:

(Training Set length) - (Prediction Period) = 209-52 = 157

Modeling results

After a few iterations of hyperparameter tuning jobs, the results are not as good as expected. Things to consider that affect model performance:

Data Considerations

Immediate drop in crime as a result of Denver Covid shutdown in March
Civil unrest and rioting in late May/early June
Lower crime overall due to Covid lockdowns across 2020

Further Modeling Considerations

Add categorical features for each series (i.e. creating broader group for grouping similar categories such as "auto-theft" and "theft-from-motor-vehicle"
Add dynamic features (i.e. binary indicators for scheduled political events and holidays)

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
training_input		training_input
.gitignore		.gitignore
Den Traffic Accidents & Auto Thefts.png		Den Traffic Accidents & Auto Thefts.png
README.md		README.md
bath_transform_post_process.py		bath_transform_post_process.py
deepar_prep.py		deepar_prep.py
deepar_results_viz.png		deepar_results_viz.png
init_data_grouping.py		init_data_grouping.py
requirements.txt		requirements.txt
tuning_job.py		tuning_job.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ts-deepar-pipeline

Data quality and Seasonality

An interesting correlation between two Categories, given the global pandemic of 2020:

Training and Test Data

Modeling results

Data Considerations

Further Modeling Considerations

Visualization of modeling results for each Crime Category forecasted

About

Uh oh!

Releases

Packages

Languages

mwheeler235/ts-deepar-pipeline

Folders and files

Latest commit

History

Repository files navigation

ts-deepar-pipeline

Data quality and Seasonality

An interesting correlation between two Categories, given the global pandemic of 2020:

Training and Test Data

Modeling results

Data Considerations

Further Modeling Considerations

Visualization of modeling results for each Crime Category forecasted

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages