qSNE: quadratic rate t-SNE optimizer with automatic parameter tuning for large datasets

dc.contributor.authorHäkkinen A
dc.contributor.authorKoiranen J
dc.contributor.authorCasado J
dc.contributor.authorKaipio K
dc.contributor.authorLehtonen O
dc.contributor.authorPetrucci E
dc.contributor.authorHynninen J
dc.contributor.authorHietanen S
dc.contributor.authorCarpén O
dc.contributor.authorPasquini L
dc.contributor.authorBiffoni M
dc.contributor.authorLehtonen R
dc.contributor.authorHautaniemi S
dc.contributor.organizationfi=biolääketieteen laitos|en=Institute of Biomedicine|
dc.contributor.organizationfi=synnytys- ja naistentautioppi|en=Obstetrics and Gynaecology|
dc.contributor.organizationfi=tyks, vsshp|en=tyks, varha|
dc.contributor.organization-code1.2.246.10.2458963.20.74725736230
dc.contributor.organization-code1.2.246.10.2458963.20.77952289591
dc.contributor.organization-code2607100
dc.converis.publication-id50793942
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/50793942
dc.date.accessioned2022-10-27T12:09:35Z
dc.date.available2022-10-27T12:09:35Z
dc.description.abstract<p><strong>Motivation: </strong>Non-parametric dimensionality reduction techniques, such as t-distributed stochastic neighbor embedding (t-SNE), are the most frequently used methods in the exploratory analysis of single-cell datasets. Current implementations scale poorly to massive datasets and often require downsampling or interpolative approximations, which can leave less-frequent populations undiscovered and much information unexploited.</p><p><strong>Results: </strong>We implemented a fast t-SNE package, qSNE, which uses a quasi-Newton optimizer, allowing quadratic convergence rate and automatic perplexity (level of detail) optimizer. Our results show that these improvements make qSNE significantly faster than regular t-SNE packages and enables full analysis of large datasets, such as mass cytometry data, without downsampling.</p>
dc.format.pagerange5086
dc.format.pagerange5092
dc.identifier.eissn1367-4811
dc.identifier.jour-issn1367-4803
dc.identifier.olddbid173591
dc.identifier.oldhandle10024/156685
dc.identifier.urihttps://www.utupub.fi/handle/11111/32720
dc.identifier.urnURN:NBN:fi-fe2021042822327
dc.language.isoen
dc.okm.affiliatedauthorKaipio, Katja
dc.okm.affiliatedauthorHynninen, Johanna
dc.okm.affiliatedauthorHietanen, Sakari
dc.okm.affiliatedauthorCarpen, Olli
dc.okm.affiliatedauthorDataimport, tyks, vsshp
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline318 Medical biotechnologyen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.discipline318 Lääketieteen bioteknologiafi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisher.countryUnited Kingdomen_GB
dc.publisher.countryBritanniafi_FI
dc.publisher.country-codeGB
dc.relation.doi10.1093/bioinformatics/btaa637
dc.relation.ispartofjournalBioinformatics
dc.relation.issue20
dc.relation.volume36
dc.source.identifierhttps://www.utupub.fi/handle/10024/156685
dc.titleqSNE: quadratic rate t-SNE optimizer with automatic parameter tuning for large datasets
dc.year.issued2020

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
btaa637.pdf
Size:
4.01 MB
Format:
Adobe Portable Document Format
Description:
Publisher's version