File Download
Supplementary
-
Citations:
- Appears in Collections:
Article: Machine learning model for prediction of coronavirus disease 2019 within 6 months after three doses of BNT162b2 in Hong Kong
| Title | Machine learning model for prediction of coronavirus disease 2019 within 6 months after three doses of BNT162b2 in Hong Kong |
|---|---|
| Authors | |
| Issue Date | 23-Jun-2025 |
| Publisher | Hong Kong Academy of Medicine Press |
| Citation | Hong Kong medical journal, 2025, v. 31 How to Cite? |
| Abstract | Abstract Introduction: We aimed to develop a machine learning (ML) model to predict the risk of coronavirus disease 2019 (COVID-19) among three-dose BNT162b2 vaccine recipients in Hong Kong. Methods: A total of 304 individuals who had received three doses of BNT162b2 were recruited from three vaccination centres in Hong Kong between May and August 2021. The dataset was randomly divided into training (n=184) and testing (n=120) sets in a 6:4 ratio. Demographics, co-morbidities and medications, blood tests (complete blood count, liver and renal function tests, glycated haemoglobin level, lipid profile, and presence of hepatitis B surface antigen), and controlled attenuation parameter (CAP) were used to develop six ML models (logistic regression, linear discriminant analysis, random forest, naïve Bayes, neural network [NN], and extreme gradient boosting models) to predict COVID-19 risk. Model performance was assessed using area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and positive predictive value (PPV) and negative predictive value (NPV). Results: Among the study population (median age: 50.9 years [interquartile range=43.6-57.8]; men: 30.9% [n=94]), 27 participants (8.9%) developed COVID-19 within 6 months. Fifteen clinical variables were used to train the models. The NN model achieved the best performance, with an AUC of 0.74 (95% confidence interval [95% CI]=0.60-0.88). Using the optimal cut-off value based on the maximised Youden index, sensitivity, specificity, PPV, and NPV were 90% (95% CI=55%-100%), 58% (95% CI=48%-68%), 16% (95% CI=8%-29%), and 98% (95% CI=92%-100%), respectively. The top predictors in the NN model include age, prediabetes/diabetes, CAP, alanine aminotransferase level, and aspartate aminotransferase level. Conclusion: An NN model integrating 15 clinical variables effectively identified individuals at low risk of COVID-19 following three doses of BNT162b2. |
| Persistent Identifier | http://hdl.handle.net/10722/358839 |
| ISSN | 2023 Impact Factor: 3.1 2023 SCImago Journal Rankings: 0.261 |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Tan, Jing Tong | - |
| dc.contributor.author | Zhang, Ruiqi | - |
| dc.contributor.author | Chan, KH | - |
| dc.contributor.author | Qin, Jian | - |
| dc.contributor.author | Hung, Ivan FN | - |
| dc.contributor.author | Cheung, KS | - |
| dc.date.accessioned | 2025-08-13T07:48:21Z | - |
| dc.date.available | 2025-08-13T07:48:21Z | - |
| dc.date.issued | 2025-06-23 | - |
| dc.identifier.citation | Hong Kong medical journal, 2025, v. 31 | - |
| dc.identifier.issn | 1024-2708 | - |
| dc.identifier.uri | http://hdl.handle.net/10722/358839 | - |
| dc.description.abstract | <div>Abstract</div><div><strong>Introduction:</strong> We aimed to develop a machine learning (ML) model to predict the risk of coronavirus disease 2019 (COVID-19) among three-dose BNT162b2 vaccine recipients in Hong Kong.</div><div><br></div><div><strong>Methods:</strong> A total of 304 individuals who had received three doses of BNT162b2 were recruited from three vaccination centres in Hong Kong between May and August 2021. The dataset was randomly divided into training (n=184) and testing (n=120) sets in a 6:4 ratio. Demographics, co-morbidities and medications, blood tests (complete blood count, liver and renal function tests, glycated haemoglobin level, lipid profile, and presence of hepatitis B surface antigen), and controlled attenuation parameter (CAP) were used to develop six ML models (logistic regression, linear discriminant analysis, random forest, naïve Bayes, neural network [NN], and extreme gradient boosting models) to predict COVID-19 risk. Model performance was assessed using area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and positive predictive value (PPV) and negative predictive value (NPV).</div><div><br></div><div><strong>Results:</strong> Among the study population (median age: 50.9 years [interquartile range=43.6-57.8]; men: 30.9% [n=94]), 27 participants (8.9%) developed COVID-19 within 6 months. Fifteen clinical variables were used to train the models. The NN model achieved the best performance, with an AUC of 0.74 (95% confidence interval [95% CI]=0.60-0.88). Using the optimal cut-off value based on the maximised Youden index, sensitivity, specificity, PPV, and NPV were 90% (95% CI=55%-100%), 58% (95% CI=48%-68%), 16% (95% CI=8%-29%), and 98% (95% CI=92%-100%), respectively. The top predictors in the NN model include age, prediabetes/diabetes, CAP, alanine aminotransferase level, and aspartate aminotransferase level.</div><div><br></div><div><strong>Conclusion:</strong> An NN model integrating 15 clinical variables effectively identified individuals at low risk of COVID-19 following three doses of BNT162b2.</div> | - |
| dc.language | eng | - |
| dc.publisher | Hong Kong Academy of Medicine Press | - |
| dc.relation.ispartof | Hong Kong medical journal | - |
| dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
| dc.title | Machine learning model for prediction of coronavirus disease 2019 within 6 months after three doses of BNT162b2 in Hong Kong | - |
| dc.type | Article | - |
| dc.description.nature | published_or_final_version | - |
| dc.identifier.doi | 10.12809/hkmj2411879 | - |
| dc.identifier.volume | 31 | - |
| dc.identifier.eissn | 2226-8707 | - |
| dc.identifier.issnl | 1024-2708 | - |

