Predicting Age from Microbiome Data: Benchmarking Multi-Source Machine Learning Methods

dc.contributor.authorIshraq, Shadman
dc.contributor.departmentfi=Tietotekniikan laitos|en=Department of Computing|
dc.contributor.facultyfi=Teknillinen tiedekunta|en=Faculty of Technology|
dc.contributor.studysubjectfi=Tietotekniikka|en=Information and Communication Technology|
dc.date.accessioned2025-02-03T22:03:54Z
dc.date.available2025-02-03T22:03:54Z
dc.date.issued2024-12-30
dc.description.abstractThe microbiome holds significant potential as a predictor of biological processes, including age, due to its dynamic interaction with human health. This study addressed the challenge of predicting age using microbiome data by benchmarking tree-based machine learning models such as Random Forest (RF), Gradient Boosting Machine (GBM), and Extreme Gradient Boosting (XGBoost), in addition to the IntegratedLearner method. In this study, the LifeLines DEEP dataset was utilized, incorporating relative abundance, marker abundance, and pathway abundance data to predict age. Both single-omic and multi-omics models were developed, focusing on evaluating the impact of data integration on predictive performance. The results demonstrated that multi-omics models outperformed single-omic models, with GBM trained on multi-omics data sets and the stacked model used by the IntegratedLearner method achieved the highest predictive accuracy. Functional data sets, particularly pathway abundance, exhibited stronger correlations with age compared to taxonomic dataset, underscoring their significance for age prediction. Despite challenges posed by sparse, zero-inflated data and limited microbial diversity, the findings suggest that multi-omics integration enhances model performance and provides valuable insights into age-related biological processes.
dc.format.extent56
dc.identifier.olddbid196860
dc.identifier.oldhandle10024/179902
dc.identifier.urihttps://www.utupub.fi/handle/11111/19639
dc.identifier.urnURN:NBN:fi-fe202502039230
dc.language.isoeng
dc.rightsfi=Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty.|en=This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.|
dc.rights.accessrightsavoin
dc.source.identifierhttps://www.utupub.fi/handle/10024/179902
dc.subjectmicrobiome, gut microbiota, multi-omics, single-omic, IntegratedLearner, GBM, RF, XGBoost
dc.titlePredicting Age from Microbiome Data: Benchmarking Multi-Source Machine Learning Methods
dc.type.ontasotfi=Diplomityö|en=Master's thesis|

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Ishraq_Shadman_Thesis.pdf
Size:
1.7 MB
Format:
Adobe Portable Document Format