File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1016/j.legalmed.2020.101744
- Scopus: eid_2-s2.0-85087687763
- PMID: 32659707
- WOS: WOS:000579855100009
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Evaluation of marker selection methods and statistical models for chronological age prediction based on DNA methylation
Title | Evaluation of marker selection methods and statistical models for chronological age prediction based on DNA methylation |
---|---|
Authors | |
Keywords | DNA methylation Age prediction Forward selection LASSO Multiple linear regression Machine learning |
Issue Date | 2020 |
Publisher | Elsevier BV. The Journal's web site is located at http://www.elsevier.com/locate/legalmed |
Citation | Legal Medicine, 2020, v. 47, article no. 101744 How to Cite? |
Abstract | In forensic investigation, retrieving biological information from DNA evidence is a promising field of interest. One of the applications is on the estimation of the age of the donor based on DNA methylation. A large number of studies focused on age prediction using the 450 K Human Methylation Beadchip. Various marker selection methods and prediction models have been considered. However, there is a lack of research evaluating different high-dimensional variable selection methods of CpG sites with various models for age prediction. The aim of this study is to evaluate four variable selection methods (forward selection, LASSO, elastic net and SCAD) combined with a classical statistical model and sophisticated machine learning models based on the mean absolute deviation (MAD) and the root-mean-square error (RMSE). We used publicly available 450 K data set containing 991 whole blood samples (age 19–101 years). We found that the multiple linear regression model with 16 markers selected from the forward selection method performed very well in age prediction (MAD = 3.76 years and RMSE = 5.01 years). On the other hand, the highly advanced ultrahigh dimensional variable selection methods and sophisticated machine learning algorithms appeared unnecessary for age prediction based on DNA methylation. |
Persistent Identifier | http://hdl.handle.net/10722/304015 |
ISSN | 2023 Impact Factor: 1.3 2023 SCImago Journal Rankings: 0.491 |
ISI Accession Number ID |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lau, PY | - |
dc.contributor.author | Fung, WK | - |
dc.date.accessioned | 2021-09-23T08:54:03Z | - |
dc.date.available | 2021-09-23T08:54:03Z | - |
dc.date.issued | 2020 | - |
dc.identifier.citation | Legal Medicine, 2020, v. 47, article no. 101744 | - |
dc.identifier.issn | 1344-6223 | - |
dc.identifier.uri | http://hdl.handle.net/10722/304015 | - |
dc.description.abstract | In forensic investigation, retrieving biological information from DNA evidence is a promising field of interest. One of the applications is on the estimation of the age of the donor based on DNA methylation. A large number of studies focused on age prediction using the 450 K Human Methylation Beadchip. Various marker selection methods and prediction models have been considered. However, there is a lack of research evaluating different high-dimensional variable selection methods of CpG sites with various models for age prediction. The aim of this study is to evaluate four variable selection methods (forward selection, LASSO, elastic net and SCAD) combined with a classical statistical model and sophisticated machine learning models based on the mean absolute deviation (MAD) and the root-mean-square error (RMSE). We used publicly available 450 K data set containing 991 whole blood samples (age 19–101 years). We found that the multiple linear regression model with 16 markers selected from the forward selection method performed very well in age prediction (MAD = 3.76 years and RMSE = 5.01 years). On the other hand, the highly advanced ultrahigh dimensional variable selection methods and sophisticated machine learning algorithms appeared unnecessary for age prediction based on DNA methylation. | - |
dc.language | eng | - |
dc.publisher | Elsevier BV. The Journal's web site is located at http://www.elsevier.com/locate/legalmed | - |
dc.relation.ispartof | Legal Medicine | - |
dc.subject | DNA methylation | - |
dc.subject | Age prediction | - |
dc.subject | Forward selection | - |
dc.subject | LASSO | - |
dc.subject | Multiple linear regression | - |
dc.subject | Machine learning | - |
dc.title | Evaluation of marker selection methods and statistical models for chronological age prediction based on DNA methylation | - |
dc.type | Article | - |
dc.identifier.email | Fung, WK: wingfung@hkucc.hku.hk | - |
dc.identifier.authority | Fung, WK=rp00696 | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.1016/j.legalmed.2020.101744 | - |
dc.identifier.pmid | 32659707 | - |
dc.identifier.scopus | eid_2-s2.0-85087687763 | - |
dc.identifier.hkuros | 325605 | - |
dc.identifier.volume | 47 | - |
dc.identifier.spage | article no. 101744 | - |
dc.identifier.epage | article no. 101744 | - |
dc.identifier.isi | WOS:000579855100009 | - |
dc.publisher.place | Netherlands | - |