dMMR prediction from colorectal cancer histopathology: Leveraging non-tumor and low-magnification regions

dc.contributor.authorPetäinen, Liisa
dc.contributor.authorVäyrynen, Juha P.
dc.contributor.authorBöhm, Jan
dc.contributor.authorRuusuvuori, Pekka
dc.contributor.authorAhtiainen, Maarit
dc.contributor.authorElomaa, Hanna
dc.contributor.authorKarjalainen, Henna
dc.contributor.authorKastinen, Meeri
dc.contributor.authorTapiainen, Vilja V.
dc.contributor.authorÄijälä, Ville K.
dc.contributor.authorSirniö, Päivi
dc.contributor.authorTuomisto, Anne
dc.contributor.authorMäkinen, Markus J.
dc.contributor.authorMecklin, Jukka-Pekka
dc.contributor.authorPölönen, Ilkka
dc.contributor.authorÄyrämö, Sami
dc.contributor.organizationfi=biolääketieteen laitos|en=Institute of Biomedicine|
dc.contributor.organizationfi=InFLAMES Lippulaiva|en=InFLAMES Flagship|
dc.contributor.organization-code1.2.246.10.2458963.20.77952289591
dc.contributor.organization-code1.2.246.10.2458963.20.68445910604
dc.converis.publication-id522873195
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/522873195
dc.date.accessioned2026-04-24T16:34:15Z
dc.description.abstract<h3>Background and Objective</h3><p>Colorectal cancer is the second leading cause of cancer-related mortality worldwide, posing a substantial burden on healthcare systems. Identifying DNA mismatch repair deficiency (dMMR) is critical for guiding treatment, yet conventional methods rely on labor-intensive DNA analysis. While deep-learning approaches have shown promise for predicting dMMR from histopathological images, most studies focus exclusively on tumor regions and single-scale representations. This study systematically evaluates the predictive value of tumor and non-tumor regions across multiple magnifications for dMMR prediction from whole-slide images (WSIs).</p><h3>Methods</h3><p>A total of 24 different modeling approaches were evaluated, varying by tissue origin (tumor vs. non-tumor), magnification level (5x and 20x), and tile embedding strategy, including digital pathology foundation models. Tile embeddings were further trained with 1228 WSIs using multiple-instance learning (MIL) based approach. The best-performing configurations were selected for external evaluation. External testing was carried out on two independent cohorts consisting of 1010 and 457 WSIs, respectively.</p><h3>Results</h3><p>Non-tumorous regions demonstrated measurable predictive value, although performance remained lower than that obtained from tumor regions (F1 = 0.896, precision = 0.888, sensitivity = 0.594, specificity = 0.982). Among the nine models selected during internal validation, the top three models—one multi-scale approach and two models trained on 20x tumor regions—achieved F1 scores of 0.870–0.889 with precision of 0.885–0.920, sensitivity of 0.852, and specificity of 0.889–0.926. On external validation, the top three models, all based on foundation-model tile embeddings, achieved F1 scores of 0.916–0.919 on the first cohort and 0.928–0.934 on the second cohort. Across cohorts, specificity remained consistently high (0.964–0.992), while sensitivity ranged from 0.500 to 0.682.</p><h3>Conclusion</h3><p>This study demonstrates that dMMR status in colorectal cancer can be effectively predicted from histopathological WSIs using MIL-based models, with moderate generalizability across independent cohorts. In addition to confirming the predictive value of tumor regions, the results reveal that non-tumorous tissue also contains detectable predictive signals, suggesting that microenvironmental features may contribute to dMMR-associated histological patterns. Furthermore, the use of foundation model–derived embeddings improved generalizability across datasets. Future work should explore integrating non-tumor tissue features and clinical data to further improve predictive performance.</p>
dc.identifier.eissn1872-7565
dc.identifier.jour-issn0169-2607
dc.identifier.urihttps://www.utupub.fi/handle/11111/58755
dc.identifier.urlhttps://doi.org/10.1016/j.cmpb.2026.109317
dc.identifier.urnURN:NBN:fi-fe2026042332854
dc.language.isoen
dc.okm.affiliatedauthorRuusuvuori, Pekka
dc.okm.discipline3111 Biomedicineen_GB
dc.okm.discipline3111 Biolääketieteetfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherElsevier
dc.publisher.countryNetherlandsen_GB
dc.publisher.countryAlankomaatfi_FI
dc.publisher.country-codeNL
dc.relation.articlenumber109317
dc.relation.doi10.1016/j.cmpb.2026.109317
dc.relation.ispartofjournalComputer Methods and Programs in Biomedicine
dc.relation.volume280
dc.titledMMR prediction from colorectal cancer histopathology: Leveraging non-tumor and low-magnification regions
dc.year.issued2026

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
1-s2.0-S0169260726000854-main.pdf
Size:
3.52 MB
Format:
Adobe Portable Document Format