Forecasting Performance Comparison: LSTM Model vs. Dynamic and Static Forecasting of ARIMA and Random Walk Models-Evidence from S&P500

Gul, Levent; YÜKSEL HALİLOĞLU, EBRU; Sevgi, N.; Dogan, Nukhet; Berument, N.

doi:10.35378/gujs.1709484

Forecasting Performance Comparison: LSTM Model vs. Dynamic and Static Forecasting of ARIMA and Random Walk Models-Evidence from S&P500

Gul L., YÜKSEL HALİLOĞLU E., Sevgi N. H., Dogan N., Berument N. H.

Gazi University Journal of Science, cilt.39, sa.1, ss.499-522, 2026 (ESCI, Scopus, TRDizin)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 39 Sayı: 1
Basım Tarihi: 2026
Doi Numarası: 10.35378/gujs.1709484
Dergi Adı: Gazi University Journal of Science
Derginin Tarandığı İndeksler: Emerging Sources Citation Index (ESCI), Scopus, TR DİZİN (ULAKBİM)
Sayfa Sayıları: ss.499-522
Anahtar Kelimeler: Autoregressive Integrated Moving Average, Cross-validation, Long Short-Term Memory, Machine Learning, Random Walk
Gazi Üniversitesi Adresli: Evet

Özet

This paper re-evaluates the forecasting performance of Long Short-Term Memory (LSTM) models against traditional Autoregressive Integrated Moving Average (ARIMA) and Random Walk (RW) models using S&P 500 index data. The LSTM models in the literature, which utilize observed lagged values for their forecasting (static forecasting), are compared to traditional models that use recursive, self-generated predictions for forecasting (dynamic forecasting). To provide comparable statistics, we gather relevant statistics for the static forecasting of ARIMA and RW models. We repeat the exercise for three different cross-validation schemes. The empirical evidence presented here suggests that the static forecasting power of static ARIMA and RW models consistently matches or outperforms that of LSTMs. Moreover, we find that while traditional models remain robust across varying sample sizes, the performance of LSTMs decreases when the training datasets are reduced. The empirical evidence presented here suggests that the reported superiority of LSTM in financial time series forecasting might be due to the forecasting method that is employed rather than a genuine predictive advantage.