Can text and data mining exceptions and synthetic data training mitigate copyright-related concerns in generative AI?

Manteghi, Maryna

Can text and data mining exceptions and synthetic data training mitigate copyright-related concerns in generative AI?

dc.contributor.author	Manteghi, Maryna
dc.contributor.organization	fi=oikeustiede\|en=Laws\|
dc.contributor.organization-code	1.2.246.10.2458963.20.53046050752
dc.converis.publication-id	457845589
dc.converis.url	https://research.utu.fi/converis/portal/Publication/457845589
dc.date.accessioned	2025-08-27T22:10:15Z
dc.date.available	2025-08-27T22:10:15Z
dc.description.abstract	<p>Rapidly emerging generative artificial intelligence (GenAI) models stand at the epicentre of current public discourse. They demonstrate impressive abilities to generate various types of data promptly and cost-effectively. However, AI developers need to train their systems on massive volumes of data which is usually copyrighted. Therefore, the growth of copyright-related concerns in the field of GenAI comes as no surprise. The study introduces two solutions which could mitigate the tension between copyright holders and AI developers, one legal (text and data mining (TDM) exceptions of the CDSM Directive) and one technical (synthetic data), highlighting the promises and challenges of both. First, the article will discuss the capability of TDM exceptions to facilitate the fundamental right to information and the freedom of research in the context of AI development. Next, the paper will analyse how providers of GenAI models can leverage synthetic data to comply with copyright law while training their systems and what risks might be associated with this approach. The findings of this study will indicate what issues, in both legal and technical spheres, should be addressed to ensure a balance of powers in the digital environment and effective functionality of the EU AI sector.</p>
dc.identifier.eissn	1757-997X
dc.identifier.jour-issn	1757-9961
dc.identifier.olddbid	201746
dc.identifier.oldhandle	10024/184773
dc.identifier.uri	https://www.utupub.fi/handle/11111/49242
dc.identifier.url	https://www.tandfonline.com/doi/full/10.1080/17579961.2024.2392928
dc.identifier.urn	URN:NBN:fi-fe2025082785494
dc.language.iso	en
dc.okm.affiliatedauthor	Manteghi, Maryna
dc.okm.discipline	513 Law	en_GB
dc.okm.discipline	513 Oikeustiede	fi_FI
dc.okm.internationalcopublication	not an international co-publication
dc.okm.internationality	International publication
dc.okm.type	A1 ScientificArticle
dc.publisher	Taylor and Francis Ltd.
dc.publisher.country	United Kingdom	en_GB
dc.publisher.country	Britannia	fi_FI
dc.publisher.country-code	GB
dc.relation.doi	10.1080/17579961.2024.2392928
dc.relation.ispartofjournal	Law, innovation and technology
dc.source.identifier	https://www.utupub.fi/handle/10024/184773
dc.title	Can text and data mining exceptions and synthetic data training mitigate copyright-related concerns in generative AI?
dc.year.issued	2024

Tiedostot

Näytetään 1 - 1 / 1

Name:: Can text and data mining exceptions and synthetic data training mitigate copyright-related concerns in generative AI .pdf
Size:: 937.49 KB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Rinnakkaistallenteet