OSCAR: Optimal subset cardinality regression using the L0-pseudonorm with applications to prognostic modelling of prostate cancer

dc.contributor.authorHalkola Anni S.
dc.contributor.authorJoki Kaisa
dc.contributor.authorMirtti Tuomas
dc.contributor.authorMäkelä Marko M.
dc.contributor.authorAittokallio Tero
dc.contributor.authorLaajala Teemu D.
dc.contributor.organizationfi=sovellettu matematiikka|en=Applied mathematics|
dc.contributor.organization-code1.2.246.10.2458963.20.48078768388
dc.converis.publication-id179320823
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/179320823
dc.date.accessioned2025-08-27T22:31:19Z
dc.date.available2025-08-27T22:31:19Z
dc.description.abstract<p>In many real-world applications, such as those based on electronic health records, prognostic prediction of patient survival is based on heterogeneous sets of clinical laboratory measurements. To address the trade-off between the predictive accuracy of a prognostic model and the costs related to its clinical implementation, we propose an optimized <em>L</em><sub>0</sub>-pseudonorm approach to learn sparse solutions in multivariable regression. The model sparsity is maintained by restricting the number of nonzero coefficients in the model with a cardinality constraint, which makes the optimization problem NP-hard. In addition, we generalize the cardinality constraint for grouped feature selection, which makes it possible to identify key sets of predictors that may be measured together in a kit in clinical practice. We demonstrate the operation of our cardinality constraint-based feature subset selection method, named OSCAR, in the context of prognostic prediction of prostate cancer patients, where it enables<br>one to determine the key explanatory predictors at different levels of model sparsity. We further explore how the model sparsity affects the model accuracy and implementation cost. Lastly, we demonstrate generalization of the presented methodology to high-dimensional transcriptomics data.<br></p>
dc.identifier.eissn1553-734X
dc.identifier.jour-issn1553-7358
dc.identifier.olddbid202312
dc.identifier.oldhandle10024/185339
dc.identifier.urihttps://www.utupub.fi/handle/11111/46472
dc.identifier.urlhttps://doi.org/10.1371/journal.pcbi.1010333
dc.identifier.urnURN:NBN:fi-fe2023042638806
dc.language.isoen
dc.okm.affiliatedauthorHalkola, Anni
dc.okm.affiliatedauthorJoki, Kaisa
dc.okm.affiliatedauthorMäkelä, Marko
dc.okm.affiliatedauthorAittokallio, Tero
dc.okm.affiliatedauthorLaajala, Daniel
dc.okm.discipline3111 Biomedicineen_GB
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherPUBLIC LIBRARY SCIENCE
dc.publisher.countryUnited Statesen_GB
dc.publisher.countryYhdysvallat (USA)fi_FI
dc.publisher.country-codeUS
dc.relation.articlenumbere1010333
dc.relation.doi10.1371/journal.pcbi.1010333
dc.relation.ispartofjournalPLoS Computational Biology
dc.relation.issue3
dc.relation.volume19
dc.source.identifierhttps://www.utupub.fi/handle/10024/185339
dc.titleOSCAR: Optimal subset cardinality regression using the L0-pseudonorm with applications to prognostic modelling of prostate cancer
dc.year.issued2023

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
journal.pcbi.1010333.pdf
Size:
2.4 MB
Format:
Adobe Portable Document Format