Finnish perspective on using synthetic health data to protect privacy: the PRIVASA project

dc.contributor.authorPitkämäki, Tinja
dc.contributor.authorPahikkala, Tapio
dc.contributor.authorMontoya Perez, Ileana
dc.contributor.authorMovahedi, Parisa
dc.contributor.authorNieminen, Valtteri
dc.contributor.authorSoutherington, Tom
dc.contributor.authorVaiste, Juho
dc.contributor.authorJafaritadi, Mojtaba
dc.contributor.authorKhan, Muhammad Irfan
dc.contributor.authorKontio, Elina
dc.contributor.authorRanttila, Pertti
dc.contributor.authorPajula, Juha
dc.contributor.authorPölönen, Harri
dc.contributor.authorDegerli, Aysen
dc.contributor.authorPlomp, Johan
dc.contributor.authorAirola, Antti
dc.contributor.organizationfi=data-analytiikka|en=Data-analytiikka|
dc.contributor.organizationfi=johtaminen ja organisointi|en=Management and Organisation|
dc.contributor.organizationfi=oikeustiede|en=Laws|
dc.contributor.organizationfi=terveysteknologia|en=Health Technology|
dc.contributor.organizationfi=tutkimuspalvelut|en=Research Services|
dc.contributor.organization-code1.2.246.10.2458963.20.28696315432
dc.contributor.organization-code1.2.246.10.2458963.20.53046050752
dc.contributor.organization-code1.2.246.10.2458963.20.55151349721
dc.contributor.organization-code1.2.246.10.2458963.20.68940835793
dc.contributor.organization-code1.2.246.10.2458963.20.95121698369
dc.converis.publication-id459120766
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/459120766
dc.date.accessioned2025-08-28T00:54:23Z
dc.date.available2025-08-28T00:54:23Z
dc.description.abstract<p>The use of synthetic data could facilitate data-driven innovation across industries and applications. Synthetic data can be generated using a range of methods, from statistical modeling to machine learning and generative AI, resulting in datasets of different formats and utility. In the health sector, the use of synthetic data is often motivated by privacy concerns. As generative AI is becoming an everyday tool, there is a need for practice-oriented insights into the prospects and limitations of synthetic data, especially in the privacy sensitive domains. We present an interdisciplinary outlook on the topic, focusing on, but not limited to, the Finnish regulatory context. First, we emphasize the need for working definitions to avoid misplaced assumptions. Second, we consider use cases for synthetic data, viewing it as a helpful tool for experimentation, decision-making, and building data literacy. Yet the complementary uses of synthetic datasets should not diminish the continued efforts to collect and share high-quality real-world data. Third, we discuss how privacy-preserving synthetic datasets fall into the existing data protection frameworks. Neither the process of synthetic data generation nor synthetic datasets are automatically exempt from the regulatory obligations concerning personal data. Finally, we explore the future research directions for generating synthetic data and conclude by discussing potential future developments at the societal level.</p>
dc.format.pagerange138
dc.format.pagerange163
dc.identifier.eissn2771-392X
dc.identifier.olddbid206652
dc.identifier.oldhandle10024/189679
dc.identifier.urihttps://www.utupub.fi/handle/11111/48079
dc.identifier.urlhttps://www.aimspress.com/article/doi/10.3934/aci.2024009
dc.identifier.urnURN:NBN:fi-fe2025082791328
dc.language.isoen
dc.okm.affiliatedauthorPitkämäki, Tinja
dc.okm.affiliatedauthorPahikkala, Tapio
dc.okm.affiliatedauthorMontoya Perez, Ileana
dc.okm.affiliatedauthorMovahedi, Parisa
dc.okm.affiliatedauthorNieminen, Valtteri
dc.okm.affiliatedauthorSoutherington, Tom
dc.okm.affiliatedauthorVaiste, Juho
dc.okm.affiliatedauthorAirola, Antti
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherAmerican Institute of Mathematical Sciences (AIMS)
dc.publisher.countryUnited Statesen_GB
dc.publisher.countryYhdysvallat (USA)fi_FI
dc.publisher.country-codeUS
dc.relation.doi10.3934/aci.2024009
dc.relation.ispartofjournalApplied Computing and Intelligence
dc.relation.issue2
dc.relation.volume4
dc.source.identifierhttps://www.utupub.fi/handle/10024/189679
dc.titleFinnish perspective on using synthetic health data to protect privacy: the PRIVASA project
dc.year.issued2024

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
10.3934_aci.2024009.pdf
Size:
4.28 MB
Format:
Adobe Portable Document Format