File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.2196/67156
- Scopus: eid_2-s2.0-105004988488
- PMID: 40354111
- Find via

Supplementary
- Citations:
- Appears in Collections:
Article: Population-Wide Depression Incidence Forecasting Comparing Autoregressive Integrated Moving Average and Vector Autoregressive Integrated Moving Average to Temporal Fusion Transformers: Longitudinal Observational Study
| Title | Population-Wide Depression Incidence Forecasting Comparing Autoregressive Integrated Moving Average and Vector Autoregressive Integrated Moving Average to Temporal Fusion Transformers: Longitudinal Observational Study |
|---|---|
| Authors | |
| Keywords | ARIMA deep learning depression incidence forecasting electronic health records machine learning medical informatics population-wide depression incidence structural break scenarios temporal fusion transformers vector-ARIMA |
| Issue Date | 12-May-2025 |
| Publisher | JMIR Publications |
| Citation | Journal of Medical Internet Research, 2025, v. 27, n. 1 How to Cite? |
| Abstract | Background: Accurate prediction of population-wide depression incidence is vital for effective public mental health management. However, this incidence is often influenced by socioeconomic factors, such as abrupt events or changes, including pandemics, economic crises, and social unrest, creating complex structural break scenarios in the time-series data. These structural breaks can affect the performance of forecasting methods in various ways. Therefore, understanding and comparing different models across these scenarios is essential. Objective: This study aimed to develop depression incidence forecasting models and compare the performance of autoregressive integrated moving average (ARIMA) and vector-ARIMA (VARIMA) and temporal fusion transformers (TFT) under different structural break scenarios. Methods: We developed population-wide depression incidence forecasting models and compared the performance of ARIMA and VARIMA-based methods to TFT-based methods. Using monthly depression incidence from 2002 to 2022 in Hong Kong, we applied sliding windows to segment the whole time series into 72 ten-year subsamples. The forecasting models were trained, validated, and tested on each subsample. Within each 10-year subset, the first 7 years were used for training, with the eighth year for setting hold-out validation, and the ninth and tenth years for testing. The accuracy of the testing set within each 10-year subsample was measured by symmetric mean absolute percentage error (SMAPE). Results: We found that in subsamples without significant slope or trend change (structural break), multivariate TFT significantly outperformed univariate TFT, vector-ARIMA (VARIMA), and ARIMA, with an average SMAPE of 11.6% compared to 13.2% (P = .01) for univariate TFT, 16.4% (P = .002) for VARIMA, and 14.8% (P = .003) for ARIMA. Adjusting for the unemployment rate improved TFT performance more effectively than VARIMA. When fluctuating outbreaks happened, TFT was more robust to sharp interruptions, whereas VARIMA and ARIMA performed better when incidence surged and remained high. Conclusions: This study provides a comparative evaluation of TFT and ARIMA and VARIMA models for forecasting depression incidence under various structural break scenarios, offering insights into predicting disease burden during both stable and unstable periods. The findings support a decision-making framework for model selection based on the nature of disruptions and data characteristics. For public health policymaking, the results suggest that TFT may be a more suitable tool for disease burden forecasting during periods of stable burden level or when sudden temporary interruption, such as pandemics or socioeconomic variation, impacts disease occurrence. |
| Persistent Identifier | http://hdl.handle.net/10722/364076 |
| ISSN | 2023 SCImago Journal Rankings: 2.020 |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Yang, Deliang | - |
| dc.contributor.author | Tang, Yiyi | - |
| dc.contributor.author | Chan, Vivien Kin Yi | - |
| dc.contributor.author | Fang, Qiwen | - |
| dc.contributor.author | Chan, Sandra Sau Man | - |
| dc.contributor.author | Luo, Hao | - |
| dc.contributor.author | Wong, Ian Chi Kei | - |
| dc.contributor.author | Ou, Huang Tz | - |
| dc.contributor.author | Chan, Esther Wai Yin | - |
| dc.contributor.author | Bishai, David Makram | - |
| dc.contributor.author | Chen, Yingyao | - |
| dc.contributor.author | Knapp, Martin | - |
| dc.contributor.author | Jit, Mark | - |
| dc.contributor.author | Craig, Dawn | - |
| dc.contributor.author | Li, Xue | - |
| dc.date.accessioned | 2025-10-21T00:35:29Z | - |
| dc.date.available | 2025-10-21T00:35:29Z | - |
| dc.date.issued | 2025-05-12 | - |
| dc.identifier.citation | Journal of Medical Internet Research, 2025, v. 27, n. 1 | - |
| dc.identifier.issn | 1439-4456 | - |
| dc.identifier.uri | http://hdl.handle.net/10722/364076 | - |
| dc.description.abstract | Background: Accurate prediction of population-wide depression incidence is vital for effective public mental health management. However, this incidence is often influenced by socioeconomic factors, such as abrupt events or changes, including pandemics, economic crises, and social unrest, creating complex structural break scenarios in the time-series data. These structural breaks can affect the performance of forecasting methods in various ways. Therefore, understanding and comparing different models across these scenarios is essential. Objective: This study aimed to develop depression incidence forecasting models and compare the performance of autoregressive integrated moving average (ARIMA) and vector-ARIMA (VARIMA) and temporal fusion transformers (TFT) under different structural break scenarios. Methods: We developed population-wide depression incidence forecasting models and compared the performance of ARIMA and VARIMA-based methods to TFT-based methods. Using monthly depression incidence from 2002 to 2022 in Hong Kong, we applied sliding windows to segment the whole time series into 72 ten-year subsamples. The forecasting models were trained, validated, and tested on each subsample. Within each 10-year subset, the first 7 years were used for training, with the eighth year for setting hold-out validation, and the ninth and tenth years for testing. The accuracy of the testing set within each 10-year subsample was measured by symmetric mean absolute percentage error (SMAPE). Results: We found that in subsamples without significant slope or trend change (structural break), multivariate TFT significantly outperformed univariate TFT, vector-ARIMA (VARIMA), and ARIMA, with an average SMAPE of 11.6% compared to 13.2% (P = .01) for univariate TFT, 16.4% (P = .002) for VARIMA, and 14.8% (P = .003) for ARIMA. Adjusting for the unemployment rate improved TFT performance more effectively than VARIMA. When fluctuating outbreaks happened, TFT was more robust to sharp interruptions, whereas VARIMA and ARIMA performed better when incidence surged and remained high. Conclusions: This study provides a comparative evaluation of TFT and ARIMA and VARIMA models for forecasting depression incidence under various structural break scenarios, offering insights into predicting disease burden during both stable and unstable periods. The findings support a decision-making framework for model selection based on the nature of disruptions and data characteristics. For public health policymaking, the results suggest that TFT may be a more suitable tool for disease burden forecasting during periods of stable burden level or when sudden temporary interruption, such as pandemics or socioeconomic variation, impacts disease occurrence. | - |
| dc.language | eng | - |
| dc.publisher | JMIR Publications | - |
| dc.relation.ispartof | Journal of Medical Internet Research | - |
| dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
| dc.subject | ARIMA | - |
| dc.subject | deep learning | - |
| dc.subject | depression incidence forecasting | - |
| dc.subject | electronic health records | - |
| dc.subject | machine learning | - |
| dc.subject | medical informatics | - |
| dc.subject | population-wide depression incidence | - |
| dc.subject | structural break scenarios | - |
| dc.subject | temporal fusion transformers | - |
| dc.subject | vector-ARIMA | - |
| dc.title | Population-Wide Depression Incidence Forecasting Comparing Autoregressive Integrated Moving Average and Vector Autoregressive Integrated Moving Average to Temporal Fusion Transformers: Longitudinal Observational Study | - |
| dc.type | Article | - |
| dc.description.nature | published_or_final_version | - |
| dc.identifier.doi | 10.2196/67156 | - |
| dc.identifier.pmid | 40354111 | - |
| dc.identifier.scopus | eid_2-s2.0-105004988488 | - |
| dc.identifier.volume | 27 | - |
| dc.identifier.issue | 1 | - |
| dc.identifier.eissn | 1438-8871 | - |
| dc.identifier.issnl | 1438-8871 | - |
