Improving performance in colorectal cancer histology decomposition using deep and ensemble machine learning

dc.contributor.authorPrezja, Fabi
dc.contributor.authorAnnala, Leevi
dc.contributor.authorKiiskinen, Sampsa
dc.contributor.authorLahtinen, Suvi
dc.contributor.authorOjala, Timo
dc.contributor.authorRuusuvuori, Pekka
dc.contributor.authorKuopio, Teijo
dc.contributor.organizationfi=biolääketieteen laitos|en=Institute of Biomedicine|
dc.contributor.organizationfi=tyks, vsshp|en=tyks, varha|
dc.contributor.organization-code1.2.246.10.2458963.20.77952289591
dc.converis.publication-id458235771
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/458235771
dc.date.accessioned2025-08-27T21:50:17Z
dc.date.available2025-08-27T21:50:17Z
dc.description.abstract<p>In routine colorectal cancer management, histologic samples stained with hematoxylin and eosin are commonly used. Nonetheless, their potential for defining objective biomarkers for patient stratification and treatment selection is still being explored. The current gold standard relies on expensive and time-consuming genetic tests. However, recent research highlights the potential of convolutional neural networks (CNNs) to facilitate the extraction of clinically relevant biomarkers from these readily available images. These CNN-based biomarkers can predict patient outcomes comparably to golden standards, with the added advantages of speed, automation, and minimal cost. The predictive potential of CNN-based biomarkers fundamentally relies on the ability of CNNs to accurately classify diverse tissue types from whole slide microscope images. Consequently, enhancing the accuracy of tissue class decomposition is critical to amplifying the prognostic potential of imaging-based biomarkers. This study introduces a hybrid deep transfer learning and ensemble machine learning model that improves upon previous approaches, including a transformer and neural architecture search baseline for this task. We employed a pairing of the EfficientNetV2 architecture with a random forest classification head. Our model achieved 96.74% accuracy (95% CI: 96.3%-97.1%) on the external test set and 99.89% on the internal test set. Recognizing the potential of these models in the task, we have made them publicly available.<br></p>
dc.identifier.eissn2405-8440
dc.identifier.jour-issn2405-8440
dc.identifier.olddbid201236
dc.identifier.oldhandle10024/184263
dc.identifier.urihttps://www.utupub.fi/handle/11111/47872
dc.identifier.urlhttps://doi.org/10.1016/j.heliyon.2024.e37561
dc.identifier.urnURN:NBN:fi-fe2025082785293
dc.language.isoen
dc.okm.affiliatedauthorRuusuvuori, Pekka
dc.okm.affiliatedauthorDataimport, tyks, vsshp
dc.okm.discipline3122 Cancersen_GB
dc.okm.discipline3122 Syöpätauditfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisher.countryUnited Kingdomen_GB
dc.publisher.countryBritanniafi_FI
dc.publisher.country-codeGB
dc.relation.articlenumberARTN e37561
dc.relation.doi10.1016/j.heliyon.2024.e37561
dc.relation.ispartofjournalHeliyon
dc.relation.issue18
dc.relation.volume10
dc.source.identifierhttps://www.utupub.fi/handle/10024/184263
dc.titleImproving performance in colorectal cancer histology decomposition using deep and ensemble machine learning
dc.year.issued2024

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
1-s2.0-S240584402413592X-main.pdf
Size:
4.18 MB
Format:
Adobe Portable Document Format