Phylogeny-Aware Analysis of Metagenome Community Ecology Based on Matched Reference Genomes while Bypassing Taxonomy

dc.contributor.authorZhu Qiyun
dc.contributor.authorHuang Shi
dc.contributor.authorGonzalez Antonio
dc.contributor.authorMcGrath Imran
dc.contributor.authorMcDonald Daniel
dc.contributor.authorHaiminen Niina
dc.contributor.authorArmstrong George
dc.contributor.authorVázquez-Baeza Yoshiki
dc.contributor.authorYu Julian
dc.contributor.authorKuczynski Justin
dc.contributor.authorSepich-Poore Gregory D.
dc.contributor.authorSwafford Austin D.
dc.contributor.authorDas Promi
dc.contributor.authorShaffer Justin P.
dc.contributor.authorLejzerowicz Franck
dc.contributor.authorBelda-Ferre Pedro
dc.contributor.authorHavulinna Aki S.
dc.contributor.authorMéric Guillaume
dc.contributor.authorNiiranen Teemu
dc.contributor.authorLahti Leo
dc.contributor.authorSalomaa Veikko
dc.contributor.authorKim Ho-Cheol
dc.contributor.authorJain Mohit
dc.contributor.authorInouye Michael
dc.contributor.authorGilbert Jack A.
dc.contributor.authorKnight Rob
dc.contributor.organizationfi=data-analytiikka|en=Data-analytiikka|
dc.contributor.organizationfi=sisätautioppi|en=Internal Medicine|
dc.contributor.organizationfi=tyks, vsshp|en=tyks, varha|
dc.contributor.organizationfi=väestötutkimuskeskus|en=Centre for Population Health Research (POP Centre)|
dc.contributor.organization-code1.2.246.10.2458963.20.40502528769
dc.contributor.organization-code1.2.246.10.2458963.20.68940835793
dc.contributor.organization-code2607008
dc.converis.publication-id175219183
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/175219183
dc.date.accessioned2022-10-28T12:27:28Z
dc.date.available2022-10-28T12:27:28Z
dc.description.abstract<p>We introduce the operational genomic unit (OGU) method, a metagenome analysis strategy that directly exploits sequence alignment hits to individual reference genomes as the minimum unit for assessing the diversity of microbial communities and their relevance to environmental factors. This approach is independent of taxonomic classification, granting the possibility of maximal resolution of community composition, and organizes features into an accurate hierarchy using a phylogenomic tree. The outputs are suitable for contemporary analytical protocols for community ecology, differential abundance, and supervised learning while supporting phylogenetic methods, such as UniFrac and phylofactorization, that are seldom applied to shotgun metagenomics despite being prevalent in 16S rRNA gene amplicon studies. As demonstrated in two real-world case studies, the OGU method produces biologically meaningful patterns from microbiome data sets. Such patterns further remain detectable at very low metagenomic sequencing depths. Compared with taxonomic unit-based analyses implemented in currently adopted metagenomics tools, and the analysis of 16S rRNA gene amplicon sequence variants, this method shows superiority in informing biologically relevant insights, including stronger correlation with body environment and host sex on the Human Microbiome Project data set and more accurate prediction of human age by the gut microbiomes of Finnish individuals included in the FINRISK 2002 cohort. We provide Woltka, a bioinformatics tool to implement this method, with full integration with the QIIME 2 package and the Qiita web platform, to facilitate adoption of the OGU method in future metagenomics studies.<br></p><p>IMPORTANCE Shotgun metagenomics is a powerful, yet computationally challenging, technique compared to 16S rRNA gene amplicon sequencing for decoding the composition and structure of microbial communities. Current analyses of metagenomic data are primarily based on taxonomic classification, which is limited in feature resolution. To solve these challenges, we introduce operational genomic units (OGUs), which are the individual reference genomes derived from sequence alignment results, without further assigning them taxonomy. The OGU method advances current read-based metagenomics in two dimensions: (i) providing maximal resolution of community composition and (ii) permitting use of phylogeny-aware tools. Our analysis of real-world data sets shows that it is advantageous over currently adopted metagenomic analysis methods and the finest-grained 16S rRNA analysis methods in predicting biological traits. We thus propose the adoption of OGUs as an effective practice in metagenomic studies.</p>
dc.identifier.eissn2379-5077
dc.identifier.jour-issn2379-5077
dc.identifier.olddbid176536
dc.identifier.oldhandle10024/159630
dc.identifier.urihttps://www.utupub.fi/handle/11111/32014
dc.identifier.urlhttps://journals.asm.org/doi/10.1128/msystems.00167-22
dc.identifier.urnURN:NBN:fi-fe2022081154042
dc.language.isoen
dc.okm.affiliatedauthorNiiranen, Teemu
dc.okm.affiliatedauthorLahti, Leo
dc.okm.affiliatedauthorDataimport, tyks, vsshp
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline3111 Biomedicineen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.discipline3111 Biolääketieteetfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherAMER SOC MICROBIOLOGY
dc.publisher.countryUnited Statesen_GB
dc.publisher.countryYhdysvallat (USA)fi_FI
dc.publisher.country-codeUS
dc.relation.articlenumbere00167-22
dc.relation.doi10.1128/msystems.00167-22
dc.relation.ispartofjournalMSystems
dc.relation.issue2
dc.relation.volume7
dc.source.identifierhttps://www.utupub.fi/handle/10024/159630
dc.titlePhylogeny-Aware Analysis of Metagenome Community Ecology Based on Matched Reference Genomes while Bypassing Taxonomy
dc.year.issued2022

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
NiiranenEtAl2022PhylogenyAwareAnalysis.pdf
Size:
4.35 MB
Format:
Adobe Portable Document Format