This repository presents a rigorous univariate time-series forecasting study of the Consumer Price Index (CPI) using an ARIMA(5,1,0) specification.
The objective is to:
- Implement statistically disciplined out-of-sample validation
- Benchmark against a naive persistence model
- Generate a forward 12-month CPI projection
- Demonstrate leakage-safe forecasting methodology
This project emphasizes methodological correctness rather than purely predictive performance.
- Frequency: Monthly
- Total observations: 190 months (~15.8 years)
- Target variable: CPI index level
The dataset was chronologically indexed and strictly partitioned prior to model estimation.
To remove long-run trend and induce stationarity, first differencing was applied:
An ARIMA(5,1,0) model was selected:
- ( d = 1 ) → First differencing
- ( p = 5 ) → Five autoregressive lags
- ( q = 0 ) → No moving-average component
The differenced model is defined as:
where:
- ( c ) = constant
- ( \phi_i ) = autoregressive coefficients
- ( \varepsilon_t ) = white noise error term
To simulate realistic forecasting conditions:
- Training set: First 178 months
- Test set: Final 12 months (strictly held out)
The model was fit exclusively on training data and used to forecast the unseen 12-month horizon.
No future information was used during estimation.
A naive persistence forecast was implemented as a baseline:
This establishes a minimal predictive benchmark.
Performance was assessed using standard out-of-sample error measures.
| Metric | ARIMA | Naive |
|---|---|---|
| RMSE | 0.8921 | 5.3743 |
| MAE | 0.6927 | 4.7434 |
| MAPE | 0.21% | — |
The ARIMA model substantially outperforms the naive persistence benchmark, demonstrating strong short-term predictive accuracy under proper validation.
validation_plot.png— 12-month holdout comparison (ARIMA vs Naive)future_forecast.png— Forward 12-month CPI projectionvalidation_metrics.md— Numerical performance summary
The forward 12-month projection suggests continued CPI momentum consistent with recent inflation dynamics.
Forecast uncertainty increases with horizon length due to recursive error propagation.
- Linear structure assumption (may underperform during structural macroeconomic shocks)
- No explicit seasonal modeling (SARIMA extension could be explored)
- Optimized for short-term forecasting performance
Meherab Hossain Shafin
Independent time-series forecasting study demonstrating statistically disciplined model validation and benchmark comparison for applied macroeconomic forecasting.
Install dependencies:
pip install pandas numpy matplotlib statsmodels scikit-learn
---