Generative AI in assessing written responses of geography exams: challenges and potential

Jauhiainen, Jussi S.; Gagagorry Guerra, Agustín; Nylén, Tua; Mäki, Sanna

Generative AI in assessing written responses of geography exams: challenges and potential

dc.contributor.author	Jauhiainen, Jussi S.
dc.contributor.author	Gagagorry Guerra, Agustín
dc.contributor.author	Nylén, Tua
dc.contributor.author	Mäki, Sanna
dc.contributor.organization	fi=maantiede\|en=Geography \|
dc.contributor.organization-code	1.2.246.10.2458963.20.17647764921
dc.converis.publication-id	505817244
dc.converis.url	https://research.utu.fi/converis/portal/Publication/505817244
dc.date.accessioned	2026-01-21T12:10:54Z
dc.date.available	2026-01-21T12:10:54Z
dc.description.abstract	<p>This article examines the application of Large Language Models (LLM) – GPT-4, Claude, Cohere, and Llama – to assess students’ open-ended responses in Geography exams. The models’ assessment scores were compared to assessment and scores by the original multi-stage human assessment as well as two additional human expert scoring. The case study considers the high-stakes national matriculation exam in Finland. The exam results play a crucial role in determining individuals’ eligibility for higher education, including a study right in Geography at the university. We selected 18 essays that had originally been given 5 (basic), 10 (good) and 15 (excellent) points on a scale from 0 to 15 points. Findings show variability between LLMs and notable differences between LLM and human evaluations. The language of responses and grading instruction influenced LLM performance. These results highlight the potential and complexities of integrating generative AI today in learning assessments to score open-ended responses. Precise control of prompts and LLM settings proved crucial for the LLM to align with original assessment scores more closely.</p>
dc.identifier.eissn	1466-1845
dc.identifier.jour-issn	0309-8265
dc.identifier.olddbid	212199
dc.identifier.oldhandle	10024/195217
dc.identifier.uri	https://www.utupub.fi/handle/11111/41488
dc.identifier.url	https://doi.org/10.1080/03098265.2025.2593484
dc.identifier.urn	URN:NBN:fi-fe202601215614
dc.language.iso	en
dc.okm.affiliatedauthor	Jauhiainen, Jussi
dc.okm.affiliatedauthor	Garagorry Guerra, Agustín
dc.okm.affiliatedauthor	Nylén, Tua
dc.okm.affiliatedauthor	Mäki, Sanna
dc.okm.discipline	519 Social and economic geography	en_GB
dc.okm.internationalcopublication	not an international co-publication
dc.okm.internationality	International publication
dc.okm.type	A1 ScientificArticle
dc.publisher	Informa UK Limited
dc.publisher.country	United Kingdom	en_GB
dc.publisher.country	Britannia	fi_FI
dc.publisher.country-code	GB
dc.relation.doi	10.1080/03098265.2025.2593484
dc.relation.ispartofjournal	Journal of Geography in Higher Education
dc.source.identifier	https://www.utupub.fi/handle/10024/195217
dc.title	Generative AI in assessing written responses of geography exams: challenges and potential
dc.year.issued	2025

Tiedostot

Näytetään 1 - 1 / 1

Name:: Generative AI in assessing written responses of geography exams challenges and potential.pdf
Size:: 1023.67 KB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Rinnakkaistallenteet