Stable Iterative Variable Selection

dc.contributor.authorMahmoudian Mehrad
dc.contributor.authorVenäläinen Mikko S
dc.contributor.authorKlén Riku
dc.contributor.authorElo Laura L
dc.contributor.organizationfi=Turun biotiedekeskus|en=Turku Bioscience Centre|
dc.contributor.organizationfi=biolääketieteen laitos|en=Institute of Biomedicine|
dc.contributor.organizationfi=tietotekniikan laitos|en=Department of Computing|
dc.contributor.organization-code1.2.246.10.2458963.20.18586209670
dc.contributor.organization-code1.2.246.10.2458963.20.85312822902
dc.converis.publication-id66616134
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/66616134
dc.date.accessioned2025-08-28T03:40:49Z
dc.date.available2025-08-28T03:40:49Z
dc.description.abstract<p>Motivation: The emergence of datasets with tens of thousands of features, such as high-throughput omics biomedical data, highlights the importance of reducing the feature space into a distilled subset that can truly capture the signal for research and industry by aiding in finding more effective biomarkers for the question in hand. A good feature set also facilitates building robust predictive models with improved interpretability and convergence of the applied method due to the smaller feature space. <br></p><p>Results: Here, we present a robust feature selection method named Stable Iterative Variable Selection (SIVS) and assess its performance over both omics and clinical data types. As a performance assessment metric, we compared the number and goodness of the selected feature using SIVS to those selected by Least Absolute Shrinkage and Selection Operator regression. The results suggested that the feature space selected by SIVS was, on average, 41% smaller, without having a negative effect on the model performance. A similar result was observed for comparison with Boruta and caret RFE. <br></p><p>Availability and implementation: The method is implemented as an R package under GNU General Public License v3.0 and is accessible via Comprehensive R Archive Network (CRAN) via https://cran.r-project.org/package¼sivs. <br></p><p>Contact: laura.elo@utu.fi <br></p><p>Supplementary information: Supplementary data are available at Bioinformatics online.<br></p>
dc.identifier.eissn1367-4811
dc.identifier.jour-issn1367-4803
dc.identifier.olddbid210987
dc.identifier.oldhandle10024/194014
dc.identifier.urihttps://www.utupub.fi/handle/11111/56833
dc.identifier.urnURN:NBN:fi-fe2021100750325
dc.language.isoen
dc.okm.affiliatedauthorMahmoudian, Mehrad
dc.okm.affiliatedauthorVenäläinen, Mikko
dc.okm.affiliatedauthorKlén, Riku
dc.okm.affiliatedauthorElo, Laura
dc.okm.affiliatedauthorDataimport, Biolääketieteen laitoksen yhteiset
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline3111 Biomedicineen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.discipline3111 Biolääketieteetfi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherOxford University Press
dc.publisher.countryUnited Kingdomen_GB
dc.publisher.countryBritanniafi_FI
dc.publisher.country-codeGB
dc.relation.doi10.1093/bioinformatics/btab501
dc.relation.ispartofjournalBioinformatics
dc.source.identifierhttps://www.utupub.fi/handle/10024/194014
dc.titleStable Iterative Variable Selection
dc.year.issued2021

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
btab501.pdf
Size:
1.18 MB
Format:
Adobe Portable Document Format
Description:
Publisher's PDF