Robust data-driven identification of risk factors and their interactions: A simulation and a study of parental and demographic risk factors for schizophrenia

dc.contributor.authorGyllenberg D
dc.contributor.authorMcKeague IW
dc.contributor.authorSourander A
dc.contributor.authorBrown AS
dc.contributor.organizationfi=INVEST tutkimuskeskus ja lippulaiva|en=INVEST Research Flagship Centre|
dc.contributor.organizationfi=lastenpsykiatrian tutkimuskeskus|en=Research Centre for Child Psychiatry|
dc.contributor.organizationfi=tyks, vsshp|en=tyks, varha|
dc.contributor.organization-code1.2.246.10.2458963.20.83706093164
dc.contributor.organization-code2603023
dc.converis.publication-id48528693
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/48528693
dc.date.accessioned2022-10-28T14:38:50Z
dc.date.available2022-10-28T14:38:50Z
dc.description.abstractObjectives Few interactions between risk factors for schizophrenia have been replicated, but fitting all such interactions is difficult due to high-dimensionality. Our aims are to examine significant main and interaction effects for schizophrenia and the performance of our approach using simulated data.Methods We apply the machine learning technique elastic net to a high-dimensional logistic regression model to produce a sparse set of predictors, and then assess the significance of odds ratios (OR) with Bonferroni-corrected p-values and confidence intervals (CI). We introduce a simulation model that resembles a Finnish nested case-control study of schizophrenia which uses national registers to identify cases (n = 1,468) and controls (n = 2,975). The predictors include nine sociodemographic factors and all interactions (31 predictors).Results In the simulation, interactions with OR = 3 and prevalence = 4% were identified with <5% false positive rate and >= 80% power. None of the studied interactions were significantly associated with schizophrenia, but main effects of parental psychosis (OR = 5.2, CI 2.9-9.7; p < .001), urbanicity (1.3, 1.1-1.7; p = .001), and paternal age >= 35 (1.3, 1.004-1.6; p = .04) were significant.Conclusions We have provided an analytic pipeline for data-driven identification of main and interaction effects in case-control data. We identified highly replicated main effects for schizophrenia, but no interactions.
dc.format.pagerange1
dc.format.pagerange11
dc.identifier.jour-issn1049-8931
dc.identifier.olddbid189459
dc.identifier.oldhandle10024/172553
dc.identifier.urihttps://www.utupub.fi/handle/11111/44574
dc.identifier.urlhttps://doi.org/10.1002/mpr.1834
dc.identifier.urnURN:NBN:fi-fe2021042827409
dc.language.isoen
dc.okm.affiliatedauthorGyllenberg, David
dc.okm.affiliatedauthorSourander, Andre
dc.okm.affiliatedauthorDataimport, tyks, vsshp
dc.okm.discipline3124 Neurology and psychiatryen_GB
dc.okm.discipline3124 Neurologia ja psykiatriafi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherWILEY
dc.publisher.countryUnited Kingdomen_GB
dc.publisher.countryBritanniafi_FI
dc.publisher.country-codeGB
dc.relation.articlenumberARTN e1834
dc.relation.doi10.1002/mpr.1834
dc.relation.ispartofjournalInternational Journal of Methods in Psychiatric Research
dc.relation.issue4
dc.relation.volume29
dc.source.identifierhttps://www.utupub.fi/handle/10024/172553
dc.titleRobust data-driven identification of risk factors and their interactions: A simulation and a study of parental and demographic risk factors for schizophrenia
dc.year.issued2020

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
mpr.1834.pdf
Size:
1.53 MB
Format:
Adobe Portable Document Format
Description:
Publisher´s PDF