Comparing Deterministic and Stochastic Reinforcement Learning for Glucose Regulation in Type 1 Diabetes

dc.contributor.authorTimms, David
dc.contributor.authorHettiarachchi, Chirath
dc.contributor.authorSuominen, Hanna
dc.contributor.organizationfi=tietotekniikan laitos|en=Department of Computing|
dc.contributor.organization-code1.2.246.10.2458963.20.85312822902
dc.converis.publication-id499745855
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/499745855
dc.date.accessioned2026-01-21T14:48:39Z
dc.date.available2026-01-21T14:48:39Z
dc.description.abstractType 1 Diabetes (T1D) is a chronic condition affecting millions worldwide, requiring external insulin administration to regulate blood glucose levels and prevent serious complications. Artificial Pancreas Systems (APS) for managing T1D currently rely on manual input, which adds a cognitive burden on people with T1D and their carers. Research into alleviating this burden through Reinforcement Learning (RL) explores enabling the APS to autonomously learn and adapt to the complex dynamics of blood glucose regulation, demonstrating improvements in in-silico evaluations compared to traditional clinical approaches. This evaluation study compared the primary polarities of RL for glucose regulation, namely, stochastic (e.g., Proximal Policy Optimization (PPO) and deterministic (e.g., Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithms in-silico using quantitative and qualitative methods, patient specific clinical metrics, and the adult and adolescent cohorts of the U.S. Food and Drug Administration approved UVA/PADOVA 2008 model. Although the behavior of TD3 was easier to interpret, it did not typically outperform PPO, thereby challenging assessing their safety and suitability. This conclusion highlights the importance of improving RL algorithms in APS applications for both interpretability and predictive performance in future research.
dc.format.pagerange1039
dc.format.pagerange1043
dc.identifier.eisbn978-1-64368-608-0
dc.identifier.issn0926-9630
dc.identifier.jour-issn0926-9630
dc.identifier.olddbid213730
dc.identifier.oldhandle10024/196748
dc.identifier.urihttps://www.utupub.fi/handle/11111/55792
dc.identifier.urlhttps://doi.org/10.3233/shti250997
dc.identifier.urnURN:NBN:fi-fe202601216972
dc.language.isoen
dc.okm.affiliatedauthorSuominen, Hanna
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline217 Medical engineeringen_GB
dc.okm.discipline3121 Internal medicineen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.discipline217 Lääketieteen tekniikkafi_FI
dc.okm.discipline3121 Sisätauditfi_FI
dc.okm.internationalcopublicationinternational co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA4 Conference Article
dc.publisher.countryNetherlandsen_GB
dc.publisher.countryAlankomaatfi_FI
dc.publisher.country-codeNL
dc.relation.conferenceWorld Congress on Medical and Health Informatics
dc.relation.doi10.3233/SHTI250997
dc.relation.ispartofjournalStudies in Health Technology and Informatics
dc.relation.volume329
dc.source.identifierhttps://www.utupub.fi/handle/10024/196748
dc.titleComparing Deterministic and Stochastic Reinforcement Learning for Glucose Regulation in Type 1 Diabetes
dc.title.bookMEDINFO 2025 — Healthcare Smart × Medicine Deep: Proceedings of the 20th World Congress on Medical and Health Informatics
dc.year.issued2025

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
SHTI-329-SHTI250997.pdf
Size:
393.7 KB
Format:
Adobe Portable Document Format