EpiSmokEr2: a robust epigenetic classifier for smoking status inference using Illumina EPIC methylation data
| dc.contributor.author | Zhu, Tianyu | |
| dc.contributor.author | Faragó, Teodóra | |
| dc.contributor.author | Bollepalli, Sailalitha | |
| dc.contributor.author | Heikkinen, Aino | |
| dc.contributor.author | Hukkanen, Mikaela | |
| dc.contributor.author | Raitakari, Olli | |
| dc.contributor.author | Lehtimäki, Terho | |
| dc.contributor.author | Korhonen, Tellervo | |
| dc.contributor.author | Kaprio, Jaakko | |
| dc.contributor.author | Fang, Fang | |
| dc.contributor.author | Lawrence, Kaitlyn G. | |
| dc.contributor.author | Sandler, Dale P. | |
| dc.contributor.author | Roberts Spildrejorde, Mari | |
| dc.contributor.author | Gervin, Kristina | |
| dc.contributor.author | Pan, Yanyu | |
| dc.contributor.author | Costeira, Ricardo | |
| dc.contributor.author | Bell, Jordana T. | |
| dc.contributor.author | Ollikainen, Miina | |
| dc.contributor.organization | fi=tyks, vsshp|en=tyks, varha| | |
| dc.contributor.organization | fi=väestötutkimuskeskus|en=Centre for Population Health Research (POP Centre)| | |
| dc.contributor.organization | fi=InFLAMES Lippulaiva|en=InFLAMES Flagship| | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.42471027641 | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.68445910604 | |
| dc.converis.publication-id | 515686691 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/515686691 | |
| dc.date.accessioned | 2026-04-24T17:32:52Z | |
| dc.description.abstract | <h3>Aim</h3><p>Tobacco smoking induces persistent DNA methylation (DNAm) changes in blood that can serve as long-term biomarkers for smoking exposure. We aimed to develop and validate a DNAm classifier of smoking status using Illumina EPIC array data.</p><h3>Methods</h3><p>We built Epigenetic Smoking status Estimator2 (EpiSmokEr2), a Least Absolute Shrinkage and Selection Operator (LASSO) regression-based DNAm classifier using 511 CpGs from Illumina Infinium MethylationEPIC array (EPIC) data. The model was trained on 1343 samples from the Young Finns Study cohort and validated across six independent datasets from four cohorts and two array platforms (EPIC and EPICv2).</p><h3>Results</h3><p>EpiSmokEr2 achieved an average sensitivity of 0.87 and specificity of 0.86 in distinguishing current from never smokers. Predicted smoking status correlated strongly with established DNAm smoking scores and GrimAge, indicating its ability to capture biologically relevant smoking effects. Simulation analysis showed EpiSmokEr2 was robust for up to 10% missing CpGs.</p><h3>Conclusion</h3><p>EpiSmokEr2 provides a reliable DNAm-based estimator of smoking status. It is available as an open-source R package on GitHub, facilitating broad use in epidemiological and clinical research.</p> | |
| dc.format.pagerange | 215 | |
| dc.format.pagerange | 205 | |
| dc.identifier.eissn | 1750-192X | |
| dc.identifier.jour-issn | 1750-1911 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/58983 | |
| dc.identifier.url | https://doi.org/10.1080/17501911.2026.2630841 | |
| dc.identifier.urn | URN:NBN:fi-fe2026042332981 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Raitakari, Olli | |
| dc.okm.affiliatedauthor | Dataimport, tyks, vsshp | |
| dc.okm.discipline | 1184 Genetics, developmental biology, physiology | en_GB |
| dc.okm.discipline | 1184 Genetiikka, kehitysbiologia, fysiologia | fi_FI |
| dc.okm.discipline | 3142 Public health care science, environmental and occupational health | en_GB |
| dc.okm.discipline | 3142 Kansanterveystiede, ympäristö ja työterveys | fi_FI |
| dc.okm.internationalcopublication | international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A1 ScientificArticle | |
| dc.publisher | Future Medicine Ltd. | |
| dc.publisher.country | United Kingdom | en_GB |
| dc.publisher.country | Britannia | fi_FI |
| dc.publisher.country-code | GB | |
| dc.relation.doi | 10.1080/17501911.2026.2630841 | |
| dc.relation.ispartofjournal | Epigenomics | |
| dc.relation.issue | 2 | |
| dc.relation.volume | 18 | |
| dc.title | EpiSmokEr2: a robust epigenetic classifier for smoking status inference using Illumina EPIC methylation data | |
| dc.year.issued | 2026 |
Tiedostot
1 - 1 / 1
Ladataan...
- Name:
- EpiSmokEr2 a robust epigenetic classifier for smoking status inference using Illumina EPIC methylation data.pdf
- Size:
- 5.54 MB
- Format:
- Adobe Portable Document Format