How to Make Sense of Reliability? Common Language Interpretation of Reliability and the Relation of Reliability to Effect Size

Metsämuuronen, Jari; Niemensivu, Timi

How to Make Sense of Reliability? Common Language Interpretation of Reliability and the Relation of Reliability to Effect Size

dc.contributor.author	Metsämuuronen, Jari
dc.contributor.author	Niemensivu, Timi
dc.contributor.organization	fi=oppimisanalytiikan tutkimusinstituutti\|en=Turku Research Institute for Learning Analytics\|
dc.contributor.organization-code	1.2.246.10.2458963.20.73636593326
dc.converis.publication-id	498940181
dc.converis.url	https://research.utu.fi/converis/portal/Publication/498940181
dc.date.accessioned	2025-08-27T22:32:49Z
dc.date.available	2025-08-27T22:32:49Z
dc.description.abstract	Communicating the factual meaning of a particular reliability estimate is sometimes difficult. What does a specific reliability estimate of 0.80 or 0.95 mean in common language? Deflation-corrected estimates of reliability (DCER) using Somers' D or Goodman-Kruskal G as the item-score correlations are transformed into forms where specific estimates from the family of common language effect sizes are visible. This makes it possible to communicate reliability estimates using a common language and to evaluate the magnitude of a particular reliability estimate in the same way and with the same metric as we do with effect size estimates. Using a DCER, we can say that with k = 40 items, if the reliability is 0.95, in 80 out of 100 random pairs of test takers from different subpopulations on all items combined, those with a higher item response will also score higher on the test. In this case, using the thresholds familiar from effect sizes, we can say that the reliability is "very high." The transformation of the reliability estimate into a common language effect size depends on the size of the item-score association estimates and the number of items, so no closed-form equations for the transformations are given. However, relevant thresholds are provided for practical use.
dc.identifier.eissn	1552-3497
dc.identifier.jour-issn	0146-6216
dc.identifier.olddbid	202356
dc.identifier.oldhandle	10024/185383
dc.identifier.uri	https://www.utupub.fi/handle/11111/46813
dc.identifier.url	https://doi.org/10.1177/01466216251350159
dc.identifier.urn	URN:NBN:fi-fe2025082789766
dc.language.iso	en
dc.okm.affiliatedauthor	Metsämuuronen, Jari
dc.okm.affiliatedauthor	Niemensivu, Timi
dc.okm.discipline	112 Statistics and probability	en_GB
dc.okm.internationalcopublication	not an international co-publication
dc.okm.internationality	International publication
dc.okm.type	A1 ScientificArticle
dc.publisher	SAGE Publications
dc.publisher.country	United States	en_GB
dc.publisher.country	Yhdysvallat (USA)	fi_FI
dc.publisher.country-code	US
dc.publisher.place	THOUSAND OAKS
dc.relation.doi	10.1177/01466216251350159
dc.relation.ispartofjournal	Applied Psychological Measurement
dc.source.identifier	https://www.utupub.fi/handle/10024/185383
dc.title	How to Make Sense of Reliability? Common Language Interpretation of Reliability and the Relation of Reliability to Effect Size
dc.year.issued	2025

Tiedostot

Näytetään 1 - 1 / 1

Name:: metsamuuronen-niemensivu-2025-how-to-make-sense-of-reliability-common-language-interpretation-of-reliability-and-the.pdf
Size:: 803.36 KB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Rinnakkaistallenteet