LLM-Assisted Qualitative Data Analysis: Security and Privacy Concerns in Gamified Workforce Studies

dc.contributor.authorAdeseye, Aisvarya
dc.contributor.authorIsoaho, Jouni
dc.contributor.authorMohammad, Tahir
dc.contributor.organizationfi=kyberturvallisuusteknologia|en=Cyber Security Engineering|
dc.contributor.organizationfi=tietotekniikan laitos|en=Department of Computing|
dc.contributor.organization-code1.2.246.10.2458963.20.28753843706
dc.contributor.organization-code1.2.246.10.2458963.20.85312822902
dc.converis.publication-id492311870
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/492311870
dc.date.accessioned2025-08-27T21:37:19Z
dc.date.available2025-08-27T21:37:19Z
dc.description.abstractLarge language models (LLMs) have transformed textual or qualitative data processing and analysis by automating and enhancing interpretive accuracy, particularly in complex areas like cybersecurity, ethics, and compliance. This study examines the effective-ness of local LLMs in analyzing qualitative research using the data gathered from the case study on "perspectives on security and privacy issues associated with the introduction of gamified workforce studies". The research presented in this paper utilized 23 interview transcripts to evaluate three popular LLMs, namely LLaMA, Gemma, and Phi, running on a local infrastructure. We observed that LLaMA focuses on practical data security, Gemma on regulatory compliance, and Phi on ethical transparency and trust-building. By combining these models, researchers can gain a more comprehensive understanding of the complex implications of gamification in workforce studies. Local LLMs provide the added benefit of enhanced data privacy and security by processing sensitive data entirely within a controlled environment. This study explores the system and user prompts that can improve the interpretive accuracy of various qualitative research approaches, such as thematic analysis, frequency analysis, impact level analysis, sensitivity analysis, and disclosure analysis, demonstrating the potential of local LLMs for qualitative analysis for sensitive data. This study recommends the usage of LLMs for the initial stage of the qualitative analysis process to enhance the efficiency and effectiveness of subsequent completely manual or software-assisted manual analysis.
dc.format.pagerange60
dc.format.pagerange67
dc.identifier.jour-issn1877-0509
dc.identifier.olddbid200752
dc.identifier.oldhandle10024/183779
dc.identifier.urihttps://www.utupub.fi/handle/11111/47127
dc.identifier.urlhttps://doi.org/10.1016/j.procs.2025.03.011
dc.identifier.urnURN:NBN:fi-fe2025082785111
dc.language.isoen
dc.okm.affiliatedauthorIsoaho, Jouni
dc.okm.affiliatedauthorAdeseye, Aisvarya
dc.okm.affiliatedauthorMohammad, Tahir
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline213 Electronic, automation and communications engineering, electronicsen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.discipline213 Sähkö-, automaatio- ja tietoliikennetekniikka, elektroniikkafi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.publisher.countryNetherlandsen_GB
dc.publisher.countryAlankomaatfi_FI
dc.publisher.country-codeNL
dc.relation.conferenceInternational Conference on Ambient Systems, Networks and Technologies
dc.relation.doi10.1016/j.procs.2025.03.011
dc.relation.ispartofjournalProcedia Computer Science
dc.relation.volume257
dc.source.identifierhttps://www.utupub.fi/handle/10024/183779
dc.titleLLM-Assisted Qualitative Data Analysis: Security and Privacy Concerns in Gamified Workforce Studies
dc.title.bookThe 16th International Conference on Ambient Systems, Networks and Technologies Networks (ANT)/ the 8th International Conference on Emerging Data and Industry 4.0 (EDI40)
dc.year.issued2025

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
1-s2.0-S1877050925007471-main.pdf
Size:
596.55 KB
Format:
Adobe Portable Document Format