Nextpie: a web-based reporting tool and database for reproducible nextflow pipelines

dc.contributor.authorGhimire, Bishwa
dc.contributor.authorBooth, Nicholas
dc.contributor.authorLönnberg, Tapio
dc.contributor.authorAittokallio, Tero
dc.contributor.organizationfi=InFLAMES Lippulaiva|en=InFLAMES Flagship|
dc.contributor.organizationfi=MediCity|en=MediCity|
dc.contributor.organizationfi=Turun biotiedekeskus|en=Turku Bioscience Centre|
dc.contributor.organization-code1.2.246.10.2458963.20.18586209670
dc.contributor.organization-code1.2.246.10.2458963.20.68445910604
dc.contributor.organization-code1.2.246.10.2458963.20.83772236069
dc.converis.publication-id505185804
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/505185804
dc.date.accessioned2026-01-21T12:30:39Z
dc.date.available2026-01-21T12:30:39Z
dc.description.abstract<p><b>Motivation</b></p><p>High-throughput genomic data analysis consists of the inexorably intertwined inputs and outputs of a vast array of bioinformatic analysis tools. To guarantee streamlined and reproducible analyses, the often complex data analysis pipelines need to be run using workflow management tools. Nextflow is one popular tool commonly used to automate such pipelines. Nextflow records key pipeline data, such as the submission time, start time, completion time, CPU usage, memory usage, and disk usage for each task run. These data are stored in log files, often scattered across a file system. Therefore, aggregating information about resource usage critical for the optimization of Nextflow pipelines and improving reproducibility, as well as parsing and managing such log data, can quickly become cumbersome.</p><p><b>Results</b></p><p>Here, we present a web-based tool, Nextpie, which provides both a database and a reporting tool for Nextflow pipelines. Nextpie stores comprehensive resource usage information in a relational database, thus facilitating and accelerating the performance of a variety of data analyses and interactive visualizations, providing an easily comprehensible overview of a pipeline’s resource usage.</p><p><b>Availability and implementation</b></p><p>The Nextpie source code, user documentation, an SQLite database with test data, and a Nextflow example pipeline are available at GitHub (<a href="https://github.com/bishwaG/Nextpie">https://github.com/bishwaG/Nextpie</a>).</p>
dc.identifier.eissn2635-0041
dc.identifier.olddbid212590
dc.identifier.oldhandle10024/195608
dc.identifier.urihttps://www.utupub.fi/handle/11111/52759
dc.identifier.urlhttps://doi.org/10.1093/bioadv/vbaf252
dc.identifier.urnURN:NBN:fi-fe202601215924
dc.language.isoen
dc.okm.affiliatedauthorGhimire, Bishwa
dc.okm.affiliatedauthorBooth, Nicholas
dc.okm.affiliatedauthorLönnberg, Tapio
dc.okm.affiliatedauthorAittokallio, Tero
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline217 Medical engineeringen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.discipline217 Lääketieteen tekniikkafi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherOxford University Press (OUP)
dc.publisher.countryUnited Kingdomen_GB
dc.publisher.countryBritanniafi_FI
dc.publisher.country-codeGB
dc.relation.articlenumbervbaf252
dc.relation.doi10.1093/bioadv/vbaf252
dc.relation.ispartofjournalBioinformatics Advances
dc.relation.issue1
dc.relation.volume5
dc.source.identifierhttps://www.utupub.fi/handle/10024/195608
dc.titleNextpie: a web-based reporting tool and database for reproducible nextflow pipelines
dc.year.issued2025

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
vbaf252.pdf
Size:
738.33 KB
Format:
Adobe Portable Document Format