Empirical investigation of multi-source cross-validation in clinical ECG classification

dc.contributor.authorLeinonen, Tuija
dc.contributor.authorWong, David
dc.contributor.authorVasankari, Antti
dc.contributor.authorWahab, Ali
dc.contributor.authorNadarajah, Ramesh
dc.contributor.authorKaisti, Matti
dc.contributor.authorAirola, Antti
dc.contributor.organizationfi=terveysteknologia|en=Health Technology|
dc.contributor.organization-code1.2.246.10.2458963.20.28696315432
dc.converis.publication-id458889243
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/458889243
dc.date.accessioned2025-08-27T21:40:47Z
dc.date.available2025-08-27T21:40:47Z
dc.description.abstractTraditionally, machine learning-based clinical prediction models have been trained and evaluated on patient data from a single source, such as a hospital. Cross-validation methods can be used to estimate the accuracy of such models on new patients originating from the same source, by repeated random splitting of the data. However, such estimates tend to be highly overoptimistic when compared to accuracy obtained from deploying models to sources not represented in the dataset, such as a new hospital. The increasing availability of multi-source medical datasets provides new opportunities for obtaining more comprehensive and realistic evaluations of expected accuracy through source-level cross-validation designs. In this study, we present a systematic empirical evaluation of standard K-fold cross-validation and leave-source-out cross-validation methods in a multi-source setting. We consider the task of electrocardiogram based cardiovascular disease classification, combining and harmonizing the openly available PhysioNet/CinC Challenge 2021 and the Shandong Provincial Hospital datasets for our study. Our results show that K-fold cross-validation, both on single-source and multi-source data, systemically overestimates prediction performance when the end goal is to generalize to new sources. Leave-source-out cross-validation provides more reliable performance estimates, having close to zero bias though larger variability. The evaluation highlights the dangers of obtaining misleading cross-validation results on medical data and demonstrates how these issues can be mitigated when having access to multi-source data.
dc.identifier.eissn1879-0534
dc.identifier.jour-issn0010-4825
dc.identifier.olddbid200869
dc.identifier.oldhandle10024/183896
dc.identifier.urihttps://www.utupub.fi/handle/11111/47251
dc.identifier.urlhttps://doi.org/10.1016/j.compbiomed.2024.109271
dc.identifier.urnURN:NBN:fi-fe2025082785161
dc.language.isoen
dc.okm.affiliatedauthorLeinonen, Tuija
dc.okm.affiliatedauthorVasankari, Antti
dc.okm.affiliatedauthorKaisti, Matti
dc.okm.affiliatedauthorAirola, Antti
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline217 Medical engineeringen_GB
dc.okm.discipline3121 Internal medicineen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.discipline217 Lääketieteen tekniikkafi_FI
dc.okm.discipline3121 Sisätauditfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherElsevier
dc.publisher.countryUnited Statesen_GB
dc.publisher.countryYhdysvallat (USA)fi_FI
dc.publisher.country-codeUS
dc.relation.articlenumber109271
dc.relation.doi10.1016/j.compbiomed.2024.109271
dc.relation.ispartofjournalComputers in Biology and Medicine
dc.relation.volume183
dc.source.identifierhttps://www.utupub.fi/handle/10024/183896
dc.titleEmpirical investigation of multi-source cross-validation in clinical ECG classification
dc.year.issued2024

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
1-s2.0-S0010482524013568-main.pdf
Size:
2.14 MB
Format:
Adobe Portable Document Format