Robust Modelling of Ordinal Survey Data Using Probabilistic Programming

dc.contributor.authorLahtinen, Aleksi
dc.contributor.authorEdwards, James Rhys
dc.contributor.authorCalmbach, Marc
dc.contributor.authorTautscher, Isabella
dc.contributor.authorLahti, Leo
dc.contributor.organizationfi=data-analytiikka|en=Data-analytiikka|
dc.contributor.organization-code1.2.246.10.2458963.20.68940835793
dc.converis.publication-id505865620
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/505865620
dc.date.accessioned2026-01-21T14:59:06Z
dc.date.available2026-01-21T14:59:06Z
dc.description.abstract<p>Surveys play a central role in much of the research conducted in the humanities and social sciences. A common data type encountered in surveys is the ordinal variable, which differs from nominal categorical variables. Several regression methods are available for analysing ordinal data, with the cumulative logistic model being one of the most widely used. However, ordinal survey data often present challenges, particularly in studies with small sample sizes, where some response categories and levels of explanatory variables can have low response rates. In such cases, classical statistical methods can produce unreliable or incomplete estimates. Here, we investigate the use of probabilistic programming, grounded in Bayesian analysis, as a more robust alternative for estimating category probabilities of ordinal variables and other model parameters. These models are better equipped to handle uncertainty and provide more reliable estimates, even in the presence of sparse data. We validate the approach with simulated data where the ground truth is known, and demonstrate the advantages of this approach by comparing it to its classical frequentist counterpart in the context of cultural participation and access survey.<br></p>
dc.format.pagerange608
dc.format.pagerange625
dc.identifier.olddbid213951
dc.identifier.oldhandle10024/196969
dc.identifier.urihttps://www.utupub.fi/handle/11111/56222
dc.identifier.urlhttps://doi.org/10.63744/eCwMjQ976nWf
dc.identifier.urnURN:NBN:fi-fe202601216316
dc.language.isoen
dc.okm.affiliatedauthorLahtinen, Aleksi
dc.okm.affiliatedauthorLahti, Leo
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeD3 Conference Article
dc.publisher.countryUnited Statesen_GB
dc.publisher.countryYhdysvallat (USA)fi_FI
dc.publisher.country-codeUS
dc.relation.conferenceComputational Humanities Research
dc.relation.doi10.63744/eCwMjQ976nWf
dc.relation.ispartofjournalAnthology of Computers and the Humanities
dc.relation.volume3
dc.source.identifierhttps://www.utupub.fi/handle/10024/196969
dc.titleRobust Modelling of Ordinal Survey Data Using Probabilistic Programming
dc.title.bookComputational Humanities Research 2025 : The proceedings of the Computational Humanities Research conference, held at the Luxembourg Centre for Contemporary and Digital History (C2DH) at the University of Luxembourg (December 9-12, 2025)
dc.year.issued2025

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
10.63744@eCwMjQ976nWf.pdf
Size:
1.96 MB
Format:
Adobe Portable Document Format