Domain randomization using synthetic electrocardiograms for training neural networks

dc.contributor.authorKaisti Matti
dc.contributor.authorLaitala Juho
dc.contributor.authorWong David
dc.contributor.authorAirola Antti
dc.contributor.organizationfi=terveysteknologia|en=Health Technology|
dc.contributor.organization-code1.2.246.10.2458963.20.28696315432
dc.contributor.organization-code2610303
dc.converis.publication-id180291633
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/180291633
dc.date.accessioned2025-08-27T20:45:40Z
dc.date.available2025-08-27T20:45:40Z
dc.description.abstract<p>We present a method for training neural networks with synthetic electrocardiograms that mimic signals produced by a wearable single lead electrocardiogram monitor. We use domain randomization where the synthetic signal properties such as the waveform shape, RR-intervals and noise are varied for every training example. Models trained with synthetic data are compared to their counterparts trained with real data. Detection of r-waves in electrocardiograms recorded during different physical activities and in atrial fibrillation is used to assess the performance. By allowing the randomization of the synthetic signals to increase beyond what is typically observed in the real-world data the performance is on par or superseding the performance of networks trained with real data. Experiments show robust model performance using different seeds and on different unseen test sets that were fully separated from the training phase. The ability of the model to generalize well to hidden test sets without any specific tuning provides a simple and explainable alternative to more complex adversarial domain adaptation methods for model generalization. This method opens up the possibility of extending the use of synthetic data towards domain insensitive cardiac disease classification when disease specific a priori information is used in the electrocardiogram generation. Additionally, the method provides training with free-to-collect data with accurate labels, control of the data distribution eliminating class imbalances that are typically observed in health-related data, and the generated data is inherently private.</p>
dc.identifier.eissn1873-2860
dc.identifier.jour-issn0933-3657
dc.identifier.olddbid200182
dc.identifier.oldhandle10024/183209
dc.identifier.urihttps://www.utupub.fi/handle/11111/45793
dc.identifier.urlhttps://www.sciencedirect.com/science/article/pii/S0933365723000970?via%3Dihub
dc.identifier.urnURN:NBN:fi-fe2025082788996
dc.language.isoen
dc.okm.affiliatedauthorKaisti, Matti
dc.okm.affiliatedauthorLaitala, Juho
dc.okm.affiliatedauthorAirola, Antti
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherELSEVIER
dc.publisher.countryNetherlandsen_GB
dc.publisher.countryAlankomaatfi_FI
dc.publisher.country-codeNL
dc.relation.articlenumber102583
dc.relation.doi10.1016/j.artmed.2023.102583
dc.relation.ispartofjournalArtificial Intelligence in Medicine
dc.relation.volume143
dc.source.identifierhttps://www.utupub.fi/handle/10024/183209
dc.titleDomain randomization using synthetic electrocardiograms for training neural networks
dc.year.issued2023

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
KaistiEtAl2023DomainRandomization.pdf
Size:
860.62 KB
Format:
Adobe Portable Document Format