File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1093/gerona/glaf218
- Scopus: eid_2-s2.0-105021283573
- Find via

Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: Machine learning prediction of survival in centenarians after age 100: a retrospective, population-based cohort study
| Title | Machine learning prediction of survival in centenarians after age 100: a retrospective, population-based cohort study |
|---|---|
| Authors | |
| Keywords | Centenarians Machine learning Mortality Oldest-old Prediction model |
| Issue Date | 9-Oct-2025 |
| Publisher | Oxford University Press |
| Citation | The Journals of Gerontology, Series A: Biological Sciences and Medical Sciences, 2025, v. 80, n. 12 How to Cite? |
| Abstract | Background Whether survival at extreme ages can be accurately predicted remains unclear. This study explored the feasibility of using machine learning (ML) and electronic health records (EHRs) to predict mortality in centenarians and identify key survival determinants. Methods We analyzed 9718 centenarians (83% women) from the population-based EHR database in Hong Kong (2004-2018). Data were randomly split into 70% training and 30% testing cohorts. Using 82 predictors, including demographics, diagnoses, prescriptions, and laboratory results, we trained stepwise logistic regression and four ML algorithms to predict 1-year, 2-year, and 5-year all-cause mortality after age 100. Model performance was evaluated using discrimination (area under the receiver operating characteristic curve [AUROC]) and calibration metrics. In an independent cohort of 174 606 oldest-old adults aged 85-105 years, we further compared AUROCs of models incorporating the identified predictors versus comorbidity and frailty scores across different age groups. Results Among the ML models, eXtreme Gradient Boosting algorithm provided the best performance, with AUROCs of 0.707 (95% CI = 0.685-0.730) for 1-year mortality and 0.704 (0.686-0.723) for 2-year mortality in the testing cohort. However, all models showed poor calibration for 5-year mortality. Top three predictors of mortality included lower albumin levels, more frequent hospitalizations, and higher urea levels. Models including these predictors consistently outperformed comorbidity and frailty for mortality prediction among oldest-old adults. Conclusions Utilizing ML models and routinely collected EHRs can predict short-term survival in centenarians with moderate accuracy. Further research is needed to determine whether mortality predictors differ across age in the oldest-old population. |
| Persistent Identifier | http://hdl.handle.net/10722/368336 |
| ISSN | 2023 Impact Factor: 4.3 2023 SCImago Journal Rankings: 1.285 |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Mak, Jonathan K. L. | - |
| dc.contributor.author | Yue, Noel C. | - |
| dc.contributor.author | Li, Gloria Hoi-Yee | - |
| dc.contributor.author | Yuen, Jacqueline K. | - |
| dc.contributor.author | Auyeung, Tung Wai | - |
| dc.contributor.author | Tan, Kathryn Choon Beng | - |
| dc.contributor.author | Cheung, Ching-Lung | - |
| dc.date.accessioned | 2025-12-24T00:37:42Z | - |
| dc.date.available | 2025-12-24T00:37:42Z | - |
| dc.date.issued | 2025-10-09 | - |
| dc.identifier.citation | The Journals of Gerontology, Series A: Biological Sciences and Medical Sciences, 2025, v. 80, n. 12 | - |
| dc.identifier.issn | 1079-5006 | - |
| dc.identifier.uri | http://hdl.handle.net/10722/368336 | - |
| dc.description.abstract | <p>Background</p><p>Whether survival at extreme ages can be accurately predicted remains unclear. This study explored the feasibility of using machine learning (ML) and electronic health records (EHRs) to predict mortality in centenarians and identify key survival determinants.</p><p>Methods</p><p>We analyzed 9718 centenarians (83% women) from the population-based EHR database in Hong Kong (2004-2018). Data were randomly split into 70% training and 30% testing cohorts. Using 82 predictors, including demographics, diagnoses, prescriptions, and laboratory results, we trained stepwise logistic regression and four ML algorithms to predict 1-year, 2-year, and 5-year all-cause mortality after age 100. Model performance was evaluated using discrimination (area under the receiver operating characteristic curve [AUROC]) and calibration metrics. In an independent cohort of 174 606 oldest-old adults aged 85-105 years, we further compared AUROCs of models incorporating the identified predictors versus comorbidity and frailty scores across different age groups.</p><p>Results</p><p>Among the ML models, eXtreme Gradient Boosting algorithm provided the best performance, with AUROCs of 0.707 (95% CI = 0.685-0.730) for 1-year mortality and 0.704 (0.686-0.723) for 2-year mortality in the testing cohort. However, all models showed poor calibration for 5-year mortality. Top three predictors of mortality included lower albumin levels, more frequent hospitalizations, and higher urea levels. Models including these predictors consistently outperformed comorbidity and frailty for mortality prediction among oldest-old adults.</p><p>Conclusions</p><p>Utilizing ML models and routinely collected EHRs can predict short-term survival in centenarians with moderate accuracy. Further research is needed to determine whether mortality predictors differ across age in the oldest-old population.</p> | - |
| dc.language | eng | - |
| dc.publisher | Oxford University Press | - |
| dc.relation.ispartof | The Journals of Gerontology, Series A: Biological Sciences and Medical Sciences | - |
| dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
| dc.subject | Centenarians | - |
| dc.subject | Machine learning | - |
| dc.subject | Mortality | - |
| dc.subject | Oldest-old | - |
| dc.subject | Prediction model | - |
| dc.title | Machine learning prediction of survival in centenarians after age 100: a retrospective, population-based cohort study | - |
| dc.type | Article | - |
| dc.description.nature | published_or_final_version | - |
| dc.identifier.doi | 10.1093/gerona/glaf218 | - |
| dc.identifier.scopus | eid_2-s2.0-105021283573 | - |
| dc.identifier.volume | 80 | - |
| dc.identifier.issue | 12 | - |
| dc.identifier.eissn | 1758-535X | - |
| dc.identifier.issnl | 1079-5006 | - |
