Anomymization in the context of generative AI : Aligning computer science and legal standards

dc.contributor.authorRomano, Melanie
dc.contributor.departmentfi=Tietotekniikan laitos|en=Department of Computing|
dc.contributor.facultyfi=Teknillinen tiedekunta|en=Faculty of Technology|
dc.contributor.studysubjectfi=Tieto- ja viestintätekniikka|en=Information and Communication Technology|
dc.date.accessioned2025-09-01T21:05:03Z
dc.date.available2025-09-01T21:05:03Z
dc.date.issued2025-07-31
dc.description.abstractThis thesis investigates anonymization in the age of generative artificial intelligence (AI), with a focus on aligning technical approaches from computer science with legal standards, particularly European data protection law. Adopting a multidisciplinary framework, the study explores the evolving notion of personal data, the theoretical and practical mechanisms of anonymization, and the challenges posed by generative AI systems at various levels (including training data, models, inputs, and outputs). Special attention is paid to the potential of synthetic data generation as a privacy-preserving technique and to differential privacy as a semantic privacy model. The work critically examines whether outputs produced by generative AI can themselves constitute personal data, and to what extent anonymization methods remain effective against modern reidentification attacks. Legal uncertainties surrounding the definition and sufficiency of anonymization, especially in light of the GDPR, the AI Act, and opinions by regulatory bodies like the EDPB, are highlighted. Ultimately, this study contributes to bridging the gap between legal doctrine and technical realities, offering insights into the strengths, limitations, and necessary evolution of anonymization practices in data-intensive AI systems.
dc.format.extent125
dc.identifier.olddbid211067
dc.identifier.oldhandle10024/194094
dc.identifier.urihttps://www.utupub.fi/handle/11111/16915
dc.identifier.urnURN:NBN:fi-fe2025090193741
dc.language.isoeng
dc.rightsfi=Julkaisu on tekijänoikeussäännösten alainen. Teosta voi lukea ja tulostaa henkilökohtaista käyttöä varten. Käyttö kaupallisiin tarkoituksiin on kielletty.|en=This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.|
dc.rights.accessrightsavoin
dc.source.identifierhttps://www.utupub.fi/handle/10024/194094
dc.subjectanonymization, privacy, regulation, GDPR, synthetic-data, generative AI, generative models, EDBP, AI Act, Differential Privacy
dc.titleAnomymization in the context of generative AI : Aligning computer science and legal standards
dc.type.ontasotfi=Diplomityö|en=Master's thesis|

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Synthetic_data___CS_and_Laws.pdf
Size:
1.26 MB
Format:
Adobe Portable Document Format