File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Chronological age estimation from human microbiomes with transformer-based Robust Principal Component Analysis

TitleChronological age estimation from human microbiomes with transformer-based Robust Principal Component Analysis
Authors
Issue Date1-Dec-2025
PublisherNature Research
Citation
Communications Biology, 2025, v. 8, n. 1 How to Cite?
AbstractDeep learning for microbiome analysis has shown potential for understanding microbial communities and human phenotypes. Here, we propose an approach, Transformer-based Robust Principal Component Analysis(TRPCA), which leverages the strengths of transformer architectures and interpretability of Robust Principal Component Analysis. To investigate benefits of TRPCA over conventional machine learning models, we benchmarked performance on age prediction from three body sites(skin, oral, gut), with 16S rRNA gene amplicon(16S) and whole-genome sequencing(WGS) data. We demonstrated prediction of age from longitudinal samples and combined classification and regression tasks via multi-task learning(MTL). TRPCA improves age prediction accuracy from human microbiome samples, achieving the largest reduction in Mean Absolute Error for WGS skin (MAE: 8.03, 28% reduction) and 16S skin (MAE: 5.09, 14% reduction) samples, compared to conventional approaches. Additionally, TRPCA’s MTL approach achieves an accuracy of 89% for birth country prediction across 5 countries, while improving age prediction from WGS stool samples. Notably, TRPCA uncovers a link between subject and error prediction through residual analysis for paired samples across sequencing method (16S/WGS) and body site(oral/gut). These findings highlight TRPCA’s utility in improving age prediction while maintaining feature-level interpretability, and elucidating connections between individuals and microbiomes.
Persistent Identifierhttp://hdl.handle.net/10722/366013

 

DC FieldValueLanguage
dc.contributor.authorMyers, Tyler-
dc.contributor.authorSong, Se Jin-
dc.contributor.authorChen, Yang-
dc.contributor.authorDe Pessemier, Britta-
dc.contributor.authorKhatib, Lora-
dc.contributor.authorMcDonald, Daniel-
dc.contributor.authorHuang, Shi-
dc.contributor.authorGallo, Richard-
dc.contributor.authorCallewaert, Chris-
dc.contributor.authorHavulinna, Aki S.-
dc.contributor.authorLahti, Leo-
dc.contributor.authorRoeselers, Guus-
dc.contributor.authorLaiola, Manolo-
dc.contributor.authorShetty, Sudarshan A.-
dc.contributor.authorKelley, Scott T.-
dc.contributor.authorKnight, Rob-
dc.contributor.authorBartko, Andrew-
dc.date.accessioned2025-11-14T02:40:57Z-
dc.date.available2025-11-14T02:40:57Z-
dc.date.issued2025-12-01-
dc.identifier.citationCommunications Biology, 2025, v. 8, n. 1-
dc.identifier.urihttp://hdl.handle.net/10722/366013-
dc.description.abstractDeep learning for microbiome analysis has shown potential for understanding microbial communities and human phenotypes. Here, we propose an approach, Transformer-based Robust Principal Component Analysis(TRPCA), which leverages the strengths of transformer architectures and interpretability of Robust Principal Component Analysis. To investigate benefits of TRPCA over conventional machine learning models, we benchmarked performance on age prediction from three body sites(skin, oral, gut), with 16S rRNA gene amplicon(16S) and whole-genome sequencing(WGS) data. We demonstrated prediction of age from longitudinal samples and combined classification and regression tasks via multi-task learning(MTL). TRPCA improves age prediction accuracy from human microbiome samples, achieving the largest reduction in Mean Absolute Error for WGS skin (MAE: 8.03, 28% reduction) and 16S skin (MAE: 5.09, 14% reduction) samples, compared to conventional approaches. Additionally, TRPCA’s MTL approach achieves an accuracy of 89% for birth country prediction across 5 countries, while improving age prediction from WGS stool samples. Notably, TRPCA uncovers a link between subject and error prediction through residual analysis for paired samples across sequencing method (16S/WGS) and body site(oral/gut). These findings highlight TRPCA’s utility in improving age prediction while maintaining feature-level interpretability, and elucidating connections between individuals and microbiomes.-
dc.languageeng-
dc.publisherNature Research-
dc.relation.ispartofCommunications Biology-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.titleChronological age estimation from human microbiomes with transformer-based Robust Principal Component Analysis-
dc.typeArticle-
dc.identifier.doi10.1038/s42003-025-08590-y-
dc.identifier.pmid40770074-
dc.identifier.scopuseid_2-s2.0-105013075463-
dc.identifier.volume8-
dc.identifier.issue1-
dc.identifier.eissn2399-3642-
dc.identifier.issnl2399-3642-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats