An Unsupervised Algorithm for Host Identification in Flaviviruses

dc.contributor.authorTruong Phuoc Nguyen
dc.contributor.authorGarcia-Vallvé Santiago
dc.contributor.authorPuigbò Pere
dc.contributor.organizationfi=ekologia ja evoluutiobiologia|en=Ecology and Evolutionary Biology |
dc.contributor.organizationfi=fysiologia ja genetiikka|en=Physiology and Genetics|
dc.contributor.organization-code2606402
dc.contributor.organization-code2606404
dc.converis.publication-id59731875
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/59731875
dc.date.accessioned2022-10-28T14:18:11Z
dc.date.available2022-10-28T14:18:11Z
dc.description.abstractEarly characterization of emerging viruses is essential to control their spread, such as the Zika Virus outbreak in 2014. Among other non-viral factors, host information is essential for the surveillance and control of virus spread. Flaviviruses (genus Flavivirus), akin to other viruses, are modulated by high mutation rates and selective forces to adapt their codon usage to that of their hosts. However, a major challenge is the identification of potential hosts for novel viruses. Usually, potential hosts of emerging zoonotic viruses are identified after several confirmed cases. This is inefficient for deterring future outbreaks. In this paper, we introduce an algorithm to identify the host range of a virus from its raw genome sequences. The proposed strategy relies on comparing codon usage frequencies across viruses and hosts, by means of a normalized Codon Adaptation Index (CAI). We have tested our algorithm on 94 flaviviruses and 16 potential hosts. This novel method is able to distinguish between arthropod and vertebrate hosts for several flaviviruses with high values of accuracy (virus group 91.9% and host type 86.1%) and specificity (virus group 94.9% and host type 79.6%), in comparison to empirical observations. Overall, this algorithm may be useful as a complementary tool to current phylogenetic methods in monitoring current and future viral outbreaks by understanding host-virus relationships.
dc.identifier.jour-issn2075-1729
dc.identifier.olddbid187473
dc.identifier.oldhandle10024/170567
dc.identifier.urihttps://www.utupub.fi/handle/11111/43043
dc.identifier.urnURN:NBN:fi-fe2021093049013
dc.language.isoen
dc.okm.affiliatedauthorTruong, Phuoc
dc.okm.affiliatedauthorPuigbo, Pedro
dc.okm.discipline1184 Genetics, developmental biology, physiologyen_GB
dc.okm.discipline3111 Biomedicineen_GB
dc.okm.discipline3141 Health care scienceen_GB
dc.okm.discipline1184 Genetiikka, kehitysbiologia, fysiologiafi_FI
dc.okm.discipline3111 Biolääketieteetfi_FI
dc.okm.discipline3141 Terveystiedefi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherMDPI
dc.publisher.countrySwitzerlanden_GB
dc.publisher.countrySveitsifi_FI
dc.publisher.country-codeCH
dc.relation.articlenumberARTN 442
dc.relation.doi10.3390/life11050442
dc.relation.ispartofjournalLife
dc.relation.issue5
dc.relation.volume11
dc.source.identifierhttps://www.utupub.fi/handle/10024/170567
dc.titleAn Unsupervised Algorithm for Host Identification in Flaviviruses
dc.year.issued2021

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
life-11-00442-v2.pdf
Size:
981.53 KB
Format:
Adobe Portable Document Format
Description:
Publisher's PDF