TFBSFootprinter: a multiomics tool for prediction of transcription factor binding sites in vertebrate species

dc.contributor.authorBarker, Harlan R.
dc.contributor.authorParkkila, Seppo
dc.contributor.authorTolvanen, Martti E.E.
dc.contributor.organizationfi=terveysteknologia|en=Health Technology|
dc.contributor.organization-code1.2.246.10.2458963.20.28696315432
dc.converis.publication-id499343308
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/499343308
dc.date.accessioned2025-08-27T23:32:48Z
dc.date.available2025-08-27T23:32:48Z
dc.description.abstract<p><b>Background:</b> Transcription factor (TF) proteins play a critical role in the regulation of eukaryotic gene expression via sequence-specific binding to genomic locations known as transcription factor binding sites (TFBSs). Accurate prediction of TFBSs is essential for understanding gene regulation, disease mechanisms, and drug discovery. These studies are therefore relevant not only in humans but also in model organisms and domesticated and wild animals. However, current tools for the automatic analysis of TFBSs in gene promoter regions are limited in their usability across multiple species. To our knowledge, no tools currently exist that allow for automatic analysis of TFBSs in gene promoter regions for many species. <br></p><p><b>Methodology and Findings:</b> The TFBSFootprinter tool combines multiomic transcription-relevant data for more accurate prediction of functional TFBSs in 317 vertebrate species. In humans, this includes vertebrate sequence conservation (GERP), proximity to transcription start sites (FANTOM5), correlation of expression between target genes and TFs predicted to bind promoters (FANTOM5), overlap with ChIP-Seq TF metaclusters (GTRD), overlap with ATAC-Seq peaks (ENCODE), eQTLs (GTEx), and the observed/expected CpG ratio (Ensembl). In non-human vertebrates, this includes GERP, proximity to transcription start sites, and CpG ratio. <br></p><p>TFBSFootprinter analyses are based on the Ensembl transcript ID for simplicity of use and require minimal setup steps. Benchmarking of the TFBSFootprinter on a manually curated and experimentally verified dataset of TFBSs produced superior results when using all multiomic data (average area under the receiver operating characteristic curve, 0.881), compared with DeepBind (0.798), DeepSEA (0.682), FIMO (0.817) and traditional PWM (0.854). The results were further improved by selecting the best overall combination of multiomic data (0.910). Additionally, we determined combinations of multiomic data that provide the best model of binding for each TF. TFBSFootprinter is available as Conda and Python packages.</p>
dc.format.pagerange204
dc.format.pagerange223
dc.identifier.eissn2154-1272
dc.identifier.jour-issn2154-1264
dc.identifier.olddbid204161
dc.identifier.oldhandle10024/187188
dc.identifier.urihttps://www.utupub.fi/handle/11111/52322
dc.identifier.urlhttps://doi.org/10.1080/21541264.2025.2521764
dc.identifier.urnURN:NBN:fi-fe2025082790348
dc.language.isoen
dc.okm.affiliatedauthorTolvanen, Martti
dc.okm.discipline1182 Biochemistry, cell and molecular biologyen_GB
dc.okm.discipline3111 Biomedicineen_GB
dc.okm.discipline1182 Biokemia, solu- ja molekyylibiologiafi_FI
dc.okm.discipline3111 Biolääketieteetfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherInforma UK Limited
dc.publisher.countryUnited Statesen_GB
dc.publisher.countryYhdysvallat (USA)fi_FI
dc.publisher.country-codeUS
dc.publisher.placePHILADELPHIA
dc.relation.doi10.1080/21541264.2025.2521764
dc.relation.ispartofjournalTranscription
dc.relation.issue2-3
dc.relation.volume16
dc.source.identifierhttps://www.utupub.fi/handle/10024/187188
dc.titleTFBSFootprinter: a multiomics tool for prediction of transcription factor binding sites in vertebrate species
dc.year.issued2025

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
TFBSFootprinter a multiomics tool for prediction of transcription factor binding sites in vertebrate species.pdf
Size:
2.26 MB
Format:
Adobe Portable Document Format