Stable Iterative Variable Selection
| dc.contributor.author | Mahmoudian Mehrad | |
| dc.contributor.author | Venäläinen Mikko S | |
| dc.contributor.author | Klén Riku | |
| dc.contributor.author | Elo Laura L | |
| dc.contributor.organization | fi=Turun biotiedekeskus|en=Turku Bioscience Centre| | |
| dc.contributor.organization | fi=biolääketieteen laitos|en=Institute of Biomedicine| | |
| dc.contributor.organization | fi=tietotekniikan laitos|en=Department of Computing| | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.18586209670 | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.85312822902 | |
| dc.converis.publication-id | 66616134 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/66616134 | |
| dc.date.accessioned | 2025-08-28T03:40:49Z | |
| dc.date.available | 2025-08-28T03:40:49Z | |
| dc.description.abstract | <p>Motivation: The emergence of datasets with tens of thousands of features, such as high-throughput omics biomedical data, highlights the importance of reducing the feature space into a distilled subset that can truly capture the signal for research and industry by aiding in finding more effective biomarkers for the question in hand. A good feature set also facilitates building robust predictive models with improved interpretability and convergence of the applied method due to the smaller feature space. <br></p><p>Results: Here, we present a robust feature selection method named Stable Iterative Variable Selection (SIVS) and assess its performance over both omics and clinical data types. As a performance assessment metric, we compared the number and goodness of the selected feature using SIVS to those selected by Least Absolute Shrinkage and Selection Operator regression. The results suggested that the feature space selected by SIVS was, on average, 41% smaller, without having a negative effect on the model performance. A similar result was observed for comparison with Boruta and caret RFE. <br></p><p>Availability and implementation: The method is implemented as an R package under GNU General Public License v3.0 and is accessible via Comprehensive R Archive Network (CRAN) via https://cran.r-project.org/package¼sivs. <br></p><p>Contact: laura.elo@utu.fi <br></p><p>Supplementary information: Supplementary data are available at Bioinformatics online.<br></p> | |
| dc.identifier.eissn | 1367-4811 | |
| dc.identifier.jour-issn | 1367-4803 | |
| dc.identifier.olddbid | 210987 | |
| dc.identifier.oldhandle | 10024/194014 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/56833 | |
| dc.identifier.urn | URN:NBN:fi-fe2021100750325 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Mahmoudian, Mehrad | |
| dc.okm.affiliatedauthor | Venäläinen, Mikko | |
| dc.okm.affiliatedauthor | Klén, Riku | |
| dc.okm.affiliatedauthor | Elo, Laura | |
| dc.okm.affiliatedauthor | Dataimport, Biolääketieteen laitoksen yhteiset | |
| dc.okm.discipline | 113 Computer and information sciences | en_GB |
| dc.okm.discipline | 3111 Biomedicine | en_GB |
| dc.okm.discipline | 113 Tietojenkäsittely ja informaatiotieteet | fi_FI |
| dc.okm.discipline | 3111 Biolääketieteet | fi_FI |
| dc.okm.internationalcopublication | not an international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A1 ScientificArticle | |
| dc.publisher | Oxford University Press | |
| dc.publisher.country | United Kingdom | en_GB |
| dc.publisher.country | Britannia | fi_FI |
| dc.publisher.country-code | GB | |
| dc.relation.doi | 10.1093/bioinformatics/btab501 | |
| dc.relation.ispartofjournal | Bioinformatics | |
| dc.source.identifier | https://www.utupub.fi/handle/10024/194014 | |
| dc.title | Stable Iterative Variable Selection | |
| dc.year.issued | 2021 |
Tiedostot
1 - 1 / 1
Ladataan...
- Name:
- btab501.pdf
- Size:
- 1.18 MB
- Format:
- Adobe Portable Document Format
- Description:
- Publisher's PDF