Deep learning tools are top performers in long non-coding RNA prediction

Ammunét Tea; Wang Ning; Khan Sofia; Elo Laura L

Deep learning tools are top performers in long non-coding RNA prediction

dc.contributor.author	Ammunét Tea
dc.contributor.author	Wang Ning
dc.contributor.author	Khan Sofia
dc.contributor.author	Elo Laura L
dc.contributor.organization	fi=biolääketieteen laitos\|en=Institute of Biomedicine\|
dc.contributor.organization-code	2609201
dc.converis.publication-id	175411816
dc.converis.url	https://research.utu.fi/converis/portal/Publication/175411816
dc.date.accessioned	2025-08-28T00:32:28Z
dc.date.available	2025-08-28T00:32:28Z
dc.description.abstract	The increasing amount of transcriptomic data has brought to light vast numbers of potential novel RNA transcripts. Accurately distinguishing novel long non-coding RNAs (lncRNAs) from protein-coding messenger RNAs (mRNAs) has challenged bioinformatic tool developers. Most recently, tools implementing deep learning architectures have been developed for this task, with the potential of discovering sequence features and their interactions still not surfaced in current knowledge. We compared the performance of deep learning tools with other predictive tools that are currently used in lncRNA coding potential prediction. A total of 15 tools representing the variety of available methods were investigated. In addition to known annotated transcripts, we also evaluated the use of the tools in actual studies with real-life data. The robustness and scalability of the tools' performance was tested with varying sized test sets and test sets with different proportions of lncRNAs and mRNAs. In addition, the ease-of-use for each tested tool was scored. Deep learning tools were top performers in most metrics and labelled transcripts similarly with each other in the real-life dataset. However, the proportion of lncRNAs and mRNAs in the test sets affected the performance of all tools. Computational resources were utilized differently between the top-ranking tools, thus the nature of the study may affect the decision of choosing one well-performing tool over another. Nonetheless, the results suggest favouring the novel deep learning tools over other tools currently in broad use.
dc.format.pagerange	241
dc.identifier.eissn	2041-2657
dc.identifier.jour-issn	2041-2649
dc.identifier.olddbid	205901
dc.identifier.oldhandle	10024/188928
dc.identifier.uri	https://www.utupub.fi/handle/11111/36259
dc.identifier.url	https://academic.oup.com/bfg/article/21/3/230/6523275
dc.identifier.urn	URN:NBN:fi-fe2022081153829
dc.language.iso	en
dc.okm.affiliatedauthor	Ammunet, Tea
dc.okm.affiliatedauthor	Wang, Ning
dc.okm.affiliatedauthor	Khan, Sofia
dc.okm.affiliatedauthor	Elo, Laura
dc.okm.affiliatedauthor	Dataimport, Biolääketieteen laitoksen yhteiset
dc.okm.discipline	318 Medical biotechnology	en_GB
dc.okm.internationalcopublication	not an international co-publication
dc.okm.internationality	International publication
dc.okm.type	A2 Scientific Article
dc.publisher	OXFORD UNIV PRESS
dc.publisher.country	United Kingdom	en_GB
dc.publisher.country	Britannia	fi_FI
dc.publisher.country-code	GB
dc.relation.doi	10.1093/bfgp/elab045
dc.relation.ispartofjournal	Briefings in Functional Genomics
dc.relation.issue	3
dc.relation.volume	21
dc.source.identifier	https://www.utupub.fi/handle/10024/188928
dc.title	Deep learning tools are top performers in long non-coding RNA prediction
dc.year.issued	2022

Tiedostot

Näytetään 1 - 1 / 1

Name:: elab045.pdf
Size:: 898.88 KB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Rinnakkaistallenteet