Bias in O-Information Estimation

dc.contributor.authorGehlen, Johanna
dc.contributor.authorLi, Jie
dc.contributor.authorHourican, Cillian
dc.contributor.authorTassi, Stavroula
dc.contributor.authorMishra, Pashupati P.
dc.contributor.authorLehtimäki, Terho
dc.contributor.authorKähönen, Mika
dc.contributor.authorRaitakari, Olli
dc.contributor.authorBosch, Jos A.
dc.contributor.authorQuax, Rick
dc.contributor.organizationfi=sydäntutkimuskeskus|en=Cardiovascular Medicine (CAPC)|
dc.contributor.organization-code1.2.246.10.2458963.20.35734063924
dc.converis.publication-id458970411
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/458970411
dc.date.accessioned2025-08-27T23:13:52Z
dc.date.available2025-08-27T23:13:52Z
dc.description.abstractHigher-order relationships are a central concept in the science of complex systems. A popular method of attempting to estimate the higher-order relationships of synergy and redundancy from data is through the O-information. It is an information-theoretic measure composed of Shannon entropy terms that quantifies the balance between redundancy and synergy in a system. However, bias is not yet taken into account in the estimation of the O-information of discrete variables. In this paper, we explain where this bias comes from and explore it for fully synergistic, fully redundant, and fully independent simulated systems of n=3 variables. Specifically, we explore how the sample size and number of bins affect the bias in the O-information estimation. The main finding is that the O-information of independent systems is severely biased towards synergy if the sample size is smaller than the number of jointly possible observations. This could mean that triplets identified as highly synergistic may in fact be close to independent. A bias approximation based on the Miller-Maddow method is derived for the O-information. We find that for systems of n=3 variables the bias approximation can partially correct for the bias. However, simulations of fully independent systems are still required as null models to provide a benchmark of the bias of the O-information.
dc.identifier.eissn1099-4300
dc.identifier.olddbid203645
dc.identifier.oldhandle10024/186672
dc.identifier.urihttps://www.utupub.fi/handle/11111/43112
dc.identifier.urlhttps://doi.org/10.3390/e26100837
dc.identifier.urnURN:NBN:fi-fe2025082790178
dc.language.isoen
dc.okm.affiliatedauthorRaitakari, Olli
dc.okm.affiliatedauthorDataimport, tyks, vsshp
dc.okm.discipline111 Mathematicsen_GB
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherMDPI
dc.publisher.countrySwitzerlanden_GB
dc.publisher.countrySveitsifi_FI
dc.publisher.country-codeCH
dc.relation.articlenumber837
dc.relation.doi10.3390/e26100837
dc.relation.ispartofjournalEntropy
dc.relation.issue10
dc.relation.volume26
dc.source.identifierhttps://www.utupub.fi/handle/10024/186672
dc.titleBias in O-Information Estimation
dc.year.issued2024

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
entropy-26-00837-v2.pdf
Size:
1011.98 KB
Format:
Adobe Portable Document Format