Bias in O-Information Estimation

Gehlen, Johanna; Li, Jie; Hourican, Cillian; Tassi, Stavroula; Mishra, Pashupati P.; Lehtimäki, Terho; Kähönen, Mika; Raitakari, Olli; Bosch, Jos A.; Quax, Rick

Bias in O-Information Estimation

dc.contributor.author	Gehlen, Johanna
dc.contributor.author	Li, Jie
dc.contributor.author	Hourican, Cillian
dc.contributor.author	Tassi, Stavroula
dc.contributor.author	Mishra, Pashupati P.
dc.contributor.author	Lehtimäki, Terho
dc.contributor.author	Kähönen, Mika
dc.contributor.author	Raitakari, Olli
dc.contributor.author	Bosch, Jos A.
dc.contributor.author	Quax, Rick
dc.contributor.organization	fi=sydäntutkimuskeskus\|en=Cardiovascular Medicine (CAPC)\|
dc.contributor.organization-code	1.2.246.10.2458963.20.35734063924
dc.converis.publication-id	458970411
dc.converis.url	https://research.utu.fi/converis/portal/Publication/458970411
dc.date.accessioned	2025-08-27T23:13:52Z
dc.date.available	2025-08-27T23:13:52Z
dc.description.abstract	Higher-order relationships are a central concept in the science of complex systems. A popular method of attempting to estimate the higher-order relationships of synergy and redundancy from data is through the O-information. It is an information-theoretic measure composed of Shannon entropy terms that quantifies the balance between redundancy and synergy in a system. However, bias is not yet taken into account in the estimation of the O-information of discrete variables. In this paper, we explain where this bias comes from and explore it for fully synergistic, fully redundant, and fully independent simulated systems of n=3 variables. Specifically, we explore how the sample size and number of bins affect the bias in the O-information estimation. The main finding is that the O-information of independent systems is severely biased towards synergy if the sample size is smaller than the number of jointly possible observations. This could mean that triplets identified as highly synergistic may in fact be close to independent. A bias approximation based on the Miller-Maddow method is derived for the O-information. We find that for systems of n=3 variables the bias approximation can partially correct for the bias. However, simulations of fully independent systems are still required as null models to provide a benchmark of the bias of the O-information.
dc.identifier.eissn	1099-4300
dc.identifier.olddbid	203645
dc.identifier.oldhandle	10024/186672
dc.identifier.uri	https://www.utupub.fi/handle/11111/43112
dc.identifier.url	https://doi.org/10.3390/e26100837
dc.identifier.urn	URN:NBN:fi-fe2025082790178
dc.language.iso	en
dc.okm.affiliatedauthor	Raitakari, Olli
dc.okm.affiliatedauthor	Dataimport, tyks, vsshp
dc.okm.discipline	111 Mathematics	en_GB
dc.okm.internationalcopublication	international co-publication
dc.okm.internationality	International publication
dc.okm.type	A1 ScientificArticle
dc.publisher	MDPI
dc.publisher.country	Switzerland	en_GB
dc.publisher.country	Sveitsi	fi_FI
dc.publisher.country-code	CH
dc.relation.articlenumber	837
dc.relation.doi	10.3390/e26100837
dc.relation.ispartofjournal	Entropy
dc.relation.issue	10
dc.relation.volume	26
dc.source.identifier	https://www.utupub.fi/handle/10024/186672
dc.title	Bias in O-Information Estimation
dc.year.issued	2024

Tiedostot

Näytetään 1 - 1 / 1

Name:: entropy-26-00837-v2.pdf
Size:: 1011.98 KB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Rinnakkaistallenteet