File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Robust identification key predictors of short- and long-term weight status in children and adolescents by machine learning

TitleRobust identification key predictors of short- and long-term weight status in children and adolescents by machine learning
Authors
Keywordschild
feature selection
feature stability
machine learning
obesity
Issue Date24-Sep-2024
PublisherFrontiers Media
Citation
Frontiers in Public Health, 2024, v. 12 How to Cite?
Abstract

Background: Early identification of high-risk individuals for weight problems in children and adolescents is crucial for implementing timely preventive measures. While machine learning (ML) techniques have shown promise in addressing this complex challenge with high-dimensional data, feature selection is vital for identifying the key predictors that can facilitate effective and targeted interventions. This study aims to utilize feature selection process to identify a robust and minimal set of predictors that can aid in the early prediction of short- and long-term weight problems in children and adolescents. Methods: We utilized demographic, physical, and psychological wellbeing predictors to model weight status (normal, underweight, overweight, and obese) for 1-, 3-, and 5-year periods. To select the most influential features, we employed four feature selection methods: (1) Chi-Square test; (2) Information Gain; (3) Random Forest; (4) eXtreme Gradient Boosting (XGBoost) with six ML approaches. The stability of the feature selection methods was assessed by Jaccard's index, Spearman's rank correlation and Pearson's correlation. Model evaluation was performed by various accuracy metrics. Results: With 3,862,820 million student-visits were included in this population-based study, the mean age of 11.6 (SD = 3.64) for the training set and 10.8 years (SD = 3.50) for the temporal test set. From the initial set of 38 predictors, we identified 6, 9, and 13 features for 1-, 3-, and 5-year predictions, respectively, by the best performed feature selection method of Chi-Square test in XGBoost models. These feature sets demonstrated excellent stability and achieved prediction accuracies of 0.82, 0.73, and 0.70; macro-AUCs of 0.94, 0.86, and 0.83; micro-AUCs of 0.96, 0.93, and 0.92 for different prediction windows, respectively. Weight, height, sex, total score of self-esteem, and age were consistently the most influential predictors across all prediction windows. Additionally, several psychological and social wellbeing predictors showed relatively high importance in long-term weight status prediction. Conclusions: We demonstrate the potential of ML in identifying key predictors of weight status in children and adolescents. While traditional anthropometric measures remain important, psychological and social wellbeing factors also emerge as crucial predictors, potentially informing targeted interventions to address childhood and adolescence weight problems.


Persistent Identifierhttp://hdl.handle.net/10722/350643
ISSN
2023 Impact Factor: 3.0
2023 SCImago Journal Rankings: 0.895

 

DC FieldValueLanguage
dc.contributor.authorLiu, Hengyan-
dc.contributor.authorLeng, Yang-
dc.contributor.authorWu, Yik Chung-
dc.contributor.authorChau, Pui Hing-
dc.contributor.authorChung, Thomas Wai Hung-
dc.contributor.authorFong, Daniel Yee Tak-
dc.date.accessioned2024-10-31T00:30:35Z-
dc.date.available2024-10-31T00:30:35Z-
dc.date.issued2024-09-24-
dc.identifier.citationFrontiers in Public Health, 2024, v. 12-
dc.identifier.issn2296-2565-
dc.identifier.urihttp://hdl.handle.net/10722/350643-
dc.description.abstract<p>Background: Early identification of high-risk individuals for weight problems in children and adolescents is crucial for implementing timely preventive measures. While machine learning (ML) techniques have shown promise in addressing this complex challenge with high-dimensional data, feature selection is vital for identifying the key predictors that can facilitate effective and targeted interventions. This study aims to utilize feature selection process to identify a robust and minimal set of predictors that can aid in the early prediction of short- and long-term weight problems in children and adolescents. Methods: We utilized demographic, physical, and psychological wellbeing predictors to model weight status (normal, underweight, overweight, and obese) for 1-, 3-, and 5-year periods. To select the most influential features, we employed four feature selection methods: (1) Chi-Square test; (2) Information Gain; (3) Random Forest; (4) eXtreme Gradient Boosting (XGBoost) with six ML approaches. The stability of the feature selection methods was assessed by Jaccard's index, Spearman's rank correlation and Pearson's correlation. Model evaluation was performed by various accuracy metrics. Results: With 3,862,820 million student-visits were included in this population-based study, the mean age of 11.6 (SD = 3.64) for the training set and 10.8 years (SD = 3.50) for the temporal test set. From the initial set of 38 predictors, we identified 6, 9, and 13 features for 1-, 3-, and 5-year predictions, respectively, by the best performed feature selection method of Chi-Square test in XGBoost models. These feature sets demonstrated excellent stability and achieved prediction accuracies of 0.82, 0.73, and 0.70; macro-AUCs of 0.94, 0.86, and 0.83; micro-AUCs of 0.96, 0.93, and 0.92 for different prediction windows, respectively. Weight, height, sex, total score of self-esteem, and age were consistently the most influential predictors across all prediction windows. Additionally, several psychological and social wellbeing predictors showed relatively high importance in long-term weight status prediction. Conclusions: We demonstrate the potential of ML in identifying key predictors of weight status in children and adolescents. While traditional anthropometric measures remain important, psychological and social wellbeing factors also emerge as crucial predictors, potentially informing targeted interventions to address childhood and adolescence weight problems.</p>-
dc.languageeng-
dc.publisherFrontiers Media-
dc.relation.ispartofFrontiers in Public Health-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subjectchild-
dc.subjectfeature selection-
dc.subjectfeature stability-
dc.subjectmachine learning-
dc.subjectobesity-
dc.titleRobust identification key predictors of short- and long-term weight status in children and adolescents by machine learning-
dc.typeArticle-
dc.identifier.doi10.3389/fpubh.2024.1414046-
dc.identifier.pmid39381765-
dc.identifier.scopuseid_2-s2.0-85206065767-
dc.identifier.volume12-
dc.identifier.eissn2296-2565-
dc.identifier.issnl2296-2565-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats