Knowledge Tracing Models in Educational Data Mining: Historical Evolution, Categorization, and Empirical Evaluation

dc.contributor.authorDas Adhikary, Prince
dc.contributor.authorMetsämuuronen, Jari
dc.contributor.authorLaakso, Mikko-Jussi
dc.contributor.authorHeikkonen, Jukka
dc.contributor.organizationfi=data-analytiikka|en=Data-analytiikka|
dc.contributor.organizationfi=oppimisanalytiikan tutkimusinstituutti|en=Turku Research Institute for Learning Analytics|
dc.contributor.organization-code1.2.246.10.2458963.20.73636593326
dc.contributor.organization-code1.2.246.10.2458963.20.68940835793
dc.converis.publication-id523106374
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/523106374
dc.date.accessioned2026-04-30T15:29:48Z
dc.description.abstract<p>This article analyses computational models of Knowledge Tracing (KT), which address the complex sequence-modelling task of predicting dynamic, unobservable latent user states from historical interaction logs. First, we propose a comprehensive taxonomy identifying nine distinct and interconnected KT model families: psychometric; Bayesian; machine learning; deep learning; graph-based; temporal/sequential; multi-task; contrastive/self-supervised; and domain-adaptive. Secondly, we trace the historical evolution of KT architectures, from the foundational psychometric methods of the 1950s to the modern integration of attention mechanisms and graph neural networks. Thirdly, we systematically evaluate nine lightweight representative computational models—one from each category—across two large-scale datasets: ASSISTments 09-10 and DigiArvi 2025. We measure predictive calibration using accuracy, F1 score, ROC-AUC, average precision, and log loss under a strict computational time budget. Finally, our rigorous empirical analysis demonstrates that multi-task and temporal/sequential architectures yield the highest performance. Specifically, Fine-Grained Knowledge Tracing (FKT) achieved the best results on the DigiArvi dataset (accuracy: 0.77; F1 score: 0.85), while Temporal Item Response Theory (TIRT) performed best on the ASSISTments dataset (accuracy: 0.70; F1 score: 0.75). Traditional baselines, such as Logistic Regression (LR), remain highly competitive. Consequently, we advocate a shift towards ‘Green AI’ and standardized benchmarking to address the field’s fragmented evaluation standards, as we identify diminishing returns from increasing model complexity. Future research must leverage generative Artificial Intelligence (AI) and causal inference to move beyond simple prediction toward agentic AI systems capable of active pedagogical intervention.<br></p>
dc.format.pagerange49606
dc.format.pagerange49582
dc.identifier.eissn2169-3536
dc.identifier.urihttps://www.utupub.fi/handle/11111/60233
dc.identifier.urlhttps://ieeexplore.ieee.org/document/11457580
dc.identifier.urnURN:NBN:fi-fe2026043036772
dc.language.isoen
dc.okm.affiliatedauthorDas Adhikary, Prince
dc.okm.affiliatedauthorMetsämuuronen, Jari
dc.okm.affiliatedauthorLaakso, Mikko-Jussi
dc.okm.affiliatedauthorHeikkonen, Jukka
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherIEEE
dc.publisher.countryUnited Statesen_GB
dc.publisher.countryYhdysvallat (USA)fi_FI
dc.publisher.country-codeUS
dc.relation.doi10.1109/ACCESS.2026.3678846
dc.relation.ispartofjournalIEEE Access
dc.relation.volume14
dc.titleKnowledge Tracing Models in Educational Data Mining: Historical Evolution, Categorization, and Empirical Evaluation
dc.year.issued2026

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
Knowledge_Tracing_Models_in_Educational_Data_Mining_Historical_Evolution_Categorization_and_Empirical_Evaluation.pdf
Size:
3.5 MB
Format:
Adobe Portable Document Format