How to Make Sense of Reliability? Common Language Interpretation of Reliability and the Relation of Reliability to Effect Size

dc.contributor.authorMetsämuuronen, Jari
dc.contributor.authorNiemensivu, Timi
dc.contributor.organizationfi=oppimisanalytiikan tutkimusinstituutti|en=Turku Research Institute for Learning Analytics|
dc.contributor.organization-code1.2.246.10.2458963.20.73636593326
dc.converis.publication-id498940181
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/498940181
dc.date.accessioned2025-08-27T22:32:49Z
dc.date.available2025-08-27T22:32:49Z
dc.description.abstractCommunicating the factual meaning of a particular reliability estimate is sometimes difficult. What does a specific reliability estimate of 0.80 or 0.95 mean in common language? Deflation-corrected estimates of reliability (DCER) using Somers' D or Goodman-Kruskal G as the item-score correlations are transformed into forms where specific estimates from the family of common language effect sizes are visible. This makes it possible to communicate reliability estimates using a common language and to evaluate the magnitude of a particular reliability estimate in the same way and with the same metric as we do with effect size estimates. Using a DCER, we can say that with k = 40 items, if the reliability is 0.95, in 80 out of 100 random pairs of test takers from different subpopulations on all items combined, those with a higher item response will also score higher on the test. In this case, using the thresholds familiar from effect sizes, we can say that the reliability is "very high." The transformation of the reliability estimate into a common language effect size depends on the size of the item-score association estimates and the number of items, so no closed-form equations for the transformations are given. However, relevant thresholds are provided for practical use.
dc.identifier.eissn1552-3497
dc.identifier.jour-issn0146-6216
dc.identifier.olddbid202356
dc.identifier.oldhandle10024/185383
dc.identifier.urihttps://www.utupub.fi/handle/11111/46813
dc.identifier.urlhttps://doi.org/10.1177/01466216251350159
dc.identifier.urnURN:NBN:fi-fe2025082789766
dc.language.isoen
dc.okm.affiliatedauthorMetsämuuronen, Jari
dc.okm.affiliatedauthorNiemensivu, Timi
dc.okm.discipline111 Mathematicsen_GB
dc.okm.discipline112 Statistics and probabilityen_GB
dc.okm.discipline515 Psychologyen_GB
dc.okm.discipline516 Educational sciencesen_GB
dc.okm.discipline111 Matematiikkafi_FI
dc.okm.discipline112 Tilastotiedefi_FI
dc.okm.discipline515 Psykologiafi_FI
dc.okm.discipline516 Kasvatustieteetfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherSAGE Publications
dc.publisher.countryUnited Statesen_GB
dc.publisher.countryYhdysvallat (USA)fi_FI
dc.publisher.country-codeUS
dc.publisher.placeTHOUSAND OAKS
dc.relation.doi10.1177/01466216251350159
dc.relation.ispartofjournalApplied Psychological Measurement
dc.source.identifierhttps://www.utupub.fi/handle/10024/185383
dc.titleHow to Make Sense of Reliability? Common Language Interpretation of Reliability and the Relation of Reliability to Effect Size
dc.year.issued2025

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
metsamuuronen-niemensivu-2025-how-to-make-sense-of-reliability-common-language-interpretation-of-reliability-and-the.pdf
Size:
803.36 KB
Format:
Adobe Portable Document Format