Project Dionysus is an educational time series analysis project focused on modeling and forecasting wine production data from multiple producers.
The project demonstrates how to:
- Load and preprocess real-world time series datasets with inconsistent date formats
- Structure reusable, scalable analysis code via object-oriented design
- Generate summary statistics and forecasts using SARIMAX modeling
Three sample datasets are provided:
Almeirim.csv
Benavente.csv
Cartaxo.csv
Each contains monthly production estimates (in thousands of liters) over a shared time interval. The datasets reflect three distinct formatting styles, representative of future scalability challenges.
Wine
Class: Encapsulates data loading, validation, reporting, and forecasting logicreport()
Method: Outputs filtered summary statistics over a user-defined date rangeforecast()
Method: Uses SARIMAX to predict future output and visualize trends- Automation: Automatically processes all CSV files in the /data/ directory
Install dependencies:
pip install -r requirements.txt