Generative AI in assessing written responses of geography exams: challenges and potential

dc.contributor.authorJauhiainen, Jussi S.
dc.contributor.authorGagagorry Guerra, Agustín
dc.contributor.authorNylén, Tua
dc.contributor.authorMäki, Sanna
dc.contributor.organizationfi=maantiede|en=Geography |
dc.contributor.organization-code1.2.246.10.2458963.20.17647764921
dc.converis.publication-id505817244
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/505817244
dc.date.accessioned2026-01-21T12:10:54Z
dc.date.available2026-01-21T12:10:54Z
dc.description.abstract<p>This article examines the application of Large Language Models (LLM) – GPT-4, Claude, Cohere, and Llama – to assess students’ open-ended responses in Geography exams. The models’ assessment scores were compared to assessment and scores by the original multi-stage human assessment as well as two additional human expert scoring. The case study considers the high-stakes national matriculation exam in Finland. The exam results play a crucial role in determining individuals’ eligibility for higher education, including a study right in Geography at the university. We selected 18 essays that had originally been given 5 (basic), 10 (good) and 15 (excellent) points on a scale from 0 to 15 points. Findings show variability between LLMs and notable differences between LLM and human evaluations. The language of responses and grading instruction influenced LLM performance. These results highlight the potential and complexities of integrating generative AI today in learning assessments to score open-ended responses. Precise control of prompts and LLM settings proved crucial for the LLM to align with original assessment scores more closely.</p>
dc.identifier.eissn1466-1845
dc.identifier.jour-issn0309-8265
dc.identifier.olddbid212199
dc.identifier.oldhandle10024/195217
dc.identifier.urihttps://www.utupub.fi/handle/11111/41488
dc.identifier.urlhttps://doi.org/10.1080/03098265.2025.2593484
dc.identifier.urnURN:NBN:fi-fe202601215614
dc.language.isoen
dc.okm.affiliatedauthorJauhiainen, Jussi
dc.okm.affiliatedauthorGaragorry Guerra, Agustín
dc.okm.affiliatedauthorNylén, Tua
dc.okm.affiliatedauthorMäki, Sanna
dc.okm.discipline516 Educational sciencesen_GB
dc.okm.discipline519 Social and economic geographyen_GB
dc.okm.discipline516 Kasvatustieteetfi_FI
dc.okm.discipline519 Yhteiskuntamaantiede, talousmaantiedefi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherInforma UK Limited
dc.publisher.countryUnited Kingdomen_GB
dc.publisher.countryBritanniafi_FI
dc.publisher.country-codeGB
dc.relation.doi10.1080/03098265.2025.2593484
dc.relation.ispartofjournalJournal of Geography in Higher Education
dc.source.identifierhttps://www.utupub.fi/handle/10024/195217
dc.titleGenerative AI in assessing written responses of geography exams: challenges and potential
dc.year.issued2025

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Generative AI in assessing written responses of geography exams challenges and potential.pdf
Size:
1023.67 KB
Format:
Adobe Portable Document Format