Coralysis enables sensitive identification of imbalanced cell types and states in single-cell data via multi-level integration

dc.contributor.authorSousa, António G. G.
dc.contributor.authorSmolander, Johannes
dc.contributor.authorJunttila, Sini
dc.contributor.authorElo, Laura L.
dc.contributor.organizationfi=InFLAMES Lippulaiva|en=InFLAMES Flagship|
dc.contributor.organizationfi=Turun biotiedekeskus|en=Turku Bioscience Centre|
dc.contributor.organization-code1.2.246.10.2458963.20.18586209670
dc.contributor.organization-code1.2.246.10.2458963.20.68445910604
dc.contributor.organization-code2609201
dc.converis.publication-id505369787
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/505369787
dc.date.accessioned2026-01-21T12:17:22Z
dc.date.available2026-01-21T12:17:22Z
dc.description.abstract<p>Complex single-cell analyses now routinely integrate multiple datasets, followed by cell-type annotation and differential expression analysis. Current state-of-the-art integration methods often struggle with imbalanced cell types across datasets particularly when highly similar but distinct cell types are not present in all datasets. Inaccurate integration leads to incorrect annotations, affecting downstream analyses such as differential expression. To streamline single-cell data analysis, we introduce Coralysis, an all-in-one package featuring a sensitive integration algorithm, reference-mapping for accurate automatic annotation, and fine-grained cell-state identification. We demonstrate that Coralysis shows consistently high performance across diverse integration tasks, outperforming state-of-the-art methods particularly in challenging settings when similar cell types are imbalanced or missing. It accurately predicts cell-type identities across various annotation scenarios. A key strength of Coralysis is its ability to provide cell-specific probability scores, enabling the identification of transient and stable cell-states, along with their differential expression patterns. Importantly, Coralysis performs robustly on different types of single-cell data from transcriptomics to proteomics. Overall, Coralysis includes all the main steps of single-cell data analysis; it preserves subtle biological variation by improving the integration and annotation of imbalanced cell types, and identifies fine-grained cell-states—enabling a faithful analysis of the cellular landscape in complex single-cell experiments.<br></p>
dc.identifier.eissn1362-4962
dc.identifier.jour-issn0305-1048
dc.identifier.olddbid212306
dc.identifier.oldhandle10024/195324
dc.identifier.urihttps://www.utupub.fi/handle/11111/48087
dc.identifier.urlhttps://doi.org/10.1093/nar/gkaf1128
dc.identifier.urnURN:NBN:fi-fe202601216790
dc.language.isoen
dc.okm.affiliatedauthorGoncalves de Sousa, Antonio
dc.okm.affiliatedauthorSmolander, Johannes
dc.okm.affiliatedauthorJunttila, Sini
dc.okm.affiliatedauthorElo, Laura
dc.okm.discipline112 Statistics and probabilityen_GB
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline1182 Biochemistry, cell and molecular biologyen_GB
dc.okm.discipline112 Tilastotiedefi_FI
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.discipline1182 Biokemia, solu- ja molekyylibiologiafi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherOxford University Press (OUP)
dc.publisher.countryUnited Kingdomen_GB
dc.publisher.countryBritanniafi_FI
dc.publisher.country-codeGB
dc.relation.articlenumbergkaf1128
dc.relation.doi10.1093/nar/gkaf1128
dc.relation.ispartofjournalNucleic Acids Research
dc.relation.issue21
dc.relation.volume53
dc.source.identifierhttps://www.utupub.fi/handle/10024/195324
dc.titleCoralysis enables sensitive identification of imbalanced cell types and states in single-cell data via multi-level integration
dc.year.issued2025

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
gkaf1128.pdf
Size:
22.5 MB
Format:
Adobe Portable Document Format