Introducing deep learning -based methods into the variant calling analysis pipeline

dc.contributor.authorLysenkov, Vladislav
dc.contributor.departmentfi=Tulevaisuuden teknologioiden laitos|en=Department of Future Technologies|
dc.contributor.facultyfi=Luonnontieteiden ja tekniikan tiedekunta|en=Faculty of Science and Engineering|
dc.contributor.studysubjectfi=Bioinformatics|en=Bioinformatics|
dc.date.accessioned2019-06-17T21:00:53Z
dc.date.available2019-06-17T21:00:53Z
dc.date.issued2019-05-24
dc.description.abstractBiological interpretation of the genetic variation enhances our understanding of normal and pathological phenotypes, and may lead to the development of new therapeutics. However, it is heavily dependent on the genomic data analysis, which might be inaccurate due to the various sequencing errors and inconsistencies caused by these errors. Modern analysis pipelines already utilize heuristic and statistical techniques, but the rate of falsely identified mutations remains high and variable, particular sequencing technology, settings and variant type. Recently, several tools based on deep neural networks have been published. The neural networks are supposed to find motifs in the data that were not previously seen. The performance of these novel tools is assessed in terms of precision and recall, as well as computational efficiency. Following the established best practices in both variant detection and benchmarking, the discussed tools demonstrate accuracy metrics and computational efficiency that spur further discussion.
dc.format.extent89
dc.identifier.olddbid164797
dc.identifier.oldhandle10024/147956
dc.identifier.urihttps://www.utupub.fi/handle/11111/12195
dc.identifier.urnURN:NBN:fi-fe2019061720740
dc.language.isoeng
dc.rightsfi=Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty.|en=This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.|
dc.rights.accessrightsavoin
dc.source.identifierhttps://www.utupub.fi/handle/10024/147956
dc.subjectnext generation sequencing, variant calling, machine learning, deep learning, benchmark, accuracy, precision, recall
dc.titleIntroducing deep learning -based methods into the variant calling analysis pipeline
dc.type.ontasotfi=Pro gradu -tutkielma|en=Master's thesis|

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Lysenkov_Vladislav_Thesis.pdf
Size:
3.83 MB
Format:
Adobe Portable Document Format