File Download
  Links for fulltext
     (May Require Subscription)

Article: Machine learning model for prediction of coronavirus disease 2019 within 6 months after three doses of BNT162b2 in Hong Kong

TitleMachine learning model for prediction of coronavirus disease 2019 within 6 months after three doses of BNT162b2 in Hong Kong
Authors
Issue Date23-Jun-2025
PublisherHong Kong Academy of Medicine Press
Citation
Hong Kong medical journal, 2025, v. 31 How to Cite?
Abstract
Abstract
Introduction: We aimed to develop a machine learning (ML) model to predict the risk of coronavirus disease 2019 (COVID-19) among three-dose BNT162b2 vaccine recipients in Hong Kong.

Methods: A total of 304 individuals who had received three doses of BNT162b2 were recruited from three vaccination centres in Hong Kong between May and August 2021. The dataset was randomly divided into training (n=184) and testing (n=120) sets in a 6:4 ratio. Demographics, co-morbidities and medications, blood tests (complete blood count, liver and renal function tests, glycated haemoglobin level, lipid profile, and presence of hepatitis B surface antigen), and controlled attenuation parameter (CAP) were used to develop six ML models (logistic regression, linear discriminant analysis, random forest, naïve Bayes, neural network [NN], and extreme gradient boosting models) to predict COVID-19 risk. Model performance was assessed using area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and positive predictive value (PPV) and negative predictive value (NPV).

Results: Among the study population (median age: 50.9 years [interquartile range=43.6-57.8]; men: 30.9% [n=94]), 27 participants (8.9%) developed COVID-19 within 6 months. Fifteen clinical variables were used to train the models. The NN model achieved the best performance, with an AUC of 0.74 (95% confidence interval [95% CI]=0.60-0.88). Using the optimal cut-off value based on the maximised Youden index, sensitivity, specificity, PPV, and NPV were 90% (95% CI=55%-100%), 58% (95% CI=48%-68%), 16% (95% CI=8%-29%), and 98% (95% CI=92%-100%), respectively. The top predictors in the NN model include age, prediabetes/diabetes, CAP, alanine aminotransferase level, and aspartate aminotransferase level.

Conclusion: An NN model integrating 15 clinical variables effectively identified individuals at low risk of COVID-19 following three doses of BNT162b2.

Persistent Identifierhttp://hdl.handle.net/10722/358839
ISSN
2023 Impact Factor: 3.1
2023 SCImago Journal Rankings: 0.261

 

DC FieldValueLanguage
dc.contributor.authorTan, Jing Tong-
dc.contributor.authorZhang, Ruiqi-
dc.contributor.authorChan, KH-
dc.contributor.authorQin, Jian-
dc.contributor.authorHung, Ivan FN-
dc.contributor.authorCheung, KS-
dc.date.accessioned2025-08-13T07:48:21Z-
dc.date.available2025-08-13T07:48:21Z-
dc.date.issued2025-06-23-
dc.identifier.citationHong Kong medical journal, 2025, v. 31-
dc.identifier.issn1024-2708-
dc.identifier.urihttp://hdl.handle.net/10722/358839-
dc.description.abstract<div>Abstract</div><div><strong>Introduction:</strong> We aimed to develop a machine learning (ML) model to predict the risk of coronavirus disease 2019 (COVID-19) among three-dose BNT162b2 vaccine recipients in Hong Kong.</div><div><br></div><div><strong>Methods:</strong> A total of 304 individuals who had received three doses of BNT162b2 were recruited from three vaccination centres in Hong Kong between May and August 2021. The dataset was randomly divided into training (n=184) and testing (n=120) sets in a 6:4 ratio. Demographics, co-morbidities and medications, blood tests (complete blood count, liver and renal function tests, glycated haemoglobin level, lipid profile, and presence of hepatitis B surface antigen), and controlled attenuation parameter (CAP) were used to develop six ML models (logistic regression, linear discriminant analysis, random forest, naïve Bayes, neural network [NN], and extreme gradient boosting models) to predict COVID-19 risk. Model performance was assessed using area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and positive predictive value (PPV) and negative predictive value (NPV).</div><div><br></div><div><strong>Results:</strong> Among the study population (median age: 50.9 years [interquartile range=43.6-57.8]; men: 30.9% [n=94]), 27 participants (8.9%) developed COVID-19 within 6 months. Fifteen clinical variables were used to train the models. The NN model achieved the best performance, with an AUC of 0.74 (95% confidence interval [95% CI]=0.60-0.88). Using the optimal cut-off value based on the maximised Youden index, sensitivity, specificity, PPV, and NPV were 90% (95% CI=55%-100%), 58% (95% CI=48%-68%), 16% (95% CI=8%-29%), and 98% (95% CI=92%-100%), respectively. The top predictors in the NN model include age, prediabetes/diabetes, CAP, alanine aminotransferase level, and aspartate aminotransferase level.</div><div><br></div><div><strong>Conclusion:</strong> An NN model integrating 15 clinical variables effectively identified individuals at low risk of COVID-19 following three doses of BNT162b2.</div>-
dc.languageeng-
dc.publisherHong Kong Academy of Medicine Press-
dc.relation.ispartofHong Kong medical journal-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.titleMachine learning model for prediction of coronavirus disease 2019 within 6 months after three doses of BNT162b2 in Hong Kong-
dc.typeArticle-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.12809/hkmj2411879-
dc.identifier.volume31-
dc.identifier.eissn2226-8707-
dc.identifier.issnl1024-2708-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats