Automated Emotion Annotation of Finnish Parliamentary Speeches Using GPT-4
| dc.contributor.author | Tarkka, Otto | |
| dc.contributor.author | Koljonen, Jaakko | |
| dc.contributor.author | Korhonen, Markus | |
| dc.contributor.author | Laine, Juuso | |
| dc.contributor.author | Martiskainen, Kristian | |
| dc.contributor.author | Elo, Kimmo | |
| dc.contributor.author | Laippala, Veronika | |
| dc.contributor.organization | fi=digitaalinen kielentutkimus, espanja, italia, kiina, ranska, saksa|en=Digital Language Studies, Chinese, French, German, Italian, Spanish| | |
| dc.contributor.organization | fi=eduskuntatutkimuksen keskus|en=Centre for Parliamentary Studies| | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.36764574459 | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.38771386471 | |
| dc.converis.publication-id | 457172276 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/457172276 | |
| dc.date.accessioned | 2025-08-28T02:46:49Z | |
| dc.date.available | 2025-08-28T02:46:49Z | |
| dc.description.abstract | <p>Annotating datasets can often be prohibitively expensive and laborious. Emotion annotation specifically has been shown to be a difficult task in which even trained annotators rarely reach high agreement. With the introduction of ChatGPT, GPT-4 and other Large Language Models (LLMs), however, a new line of research has emerged that explores the possibilities of automated data annotation. In this paper, we apply GPT-4 to the task of annotating a dataset, which is subsequently used to train a BERT model for emotion analysis of Finnish parliamentary speeches. In our experiment, GPT-4 performs on par with trained annotators and the annotations it produces can be used to train a classifier that reaches micro F1 of 0.690. We compare this model to two other models that are trained on machine translated datasets and find that the model trained on GPT-4 annotated data outperforms them. Our paper offers new insight into the possibilities that LLMs have to offer for the analysis of parliamentary corpora.</p> | |
| dc.format.pagerange | 70 | |
| dc.format.pagerange | 76 | |
| dc.identifier.eisbn | 978-2-493814-24-1 | |
| dc.identifier.jour-issn | 2522-2686 | |
| dc.identifier.olddbid | 209684 | |
| dc.identifier.oldhandle | 10024/192711 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/49286 | |
| dc.identifier.url | https://aclanthology.org/2024.parlaclarin-1.11.pdf | |
| dc.identifier.urn | URN:NBN:fi-fe2025082792453 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Tarkka, Otto | |
| dc.okm.affiliatedauthor | Koljonen, Jaakko | |
| dc.okm.affiliatedauthor | Korhonen, Markus | |
| dc.okm.affiliatedauthor | Laine, Juuso | |
| dc.okm.affiliatedauthor | Martiskainen, Kristian | |
| dc.okm.affiliatedauthor | Elo, Kimmo | |
| dc.okm.affiliatedauthor | Laippala, Veronika | |
| dc.okm.discipline | 113 Computer and information sciences | en_GB |
| dc.okm.discipline | 6121 Languages | en_GB |
| dc.okm.discipline | 113 Tietojenkäsittely ja informaatiotieteet | fi_FI |
| dc.okm.discipline | 6121 Kielitieteet | fi_FI |
| dc.okm.internationalcopublication | not an international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A4 Conference Article | |
| dc.publisher.country | France | en_GB |
| dc.publisher.country | Ranska | fi_FI |
| dc.publisher.country-code | FR | |
| dc.relation.conference | ParlaCLARIN Workshop | |
| dc.relation.ispartofjournal | LREC Proceedings | |
| dc.source.identifier | https://www.utupub.fi/handle/10024/192711 | |
| dc.title | Automated Emotion Annotation of Finnish Parliamentary Speeches Using GPT-4 | |
| dc.title.book | Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) : ParlaCLARIN IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora | |
| dc.year.issued | 2024 |
Tiedostot
1 - 1 / 1
Ladataan...
- Name:
- 2024.parlaclarin-1.11_CC-BY-NC.pdf
- Size:
- 238.64 KB
- Format:
- Adobe Portable Document Format