Comparing Deterministic and Stochastic Reinforcement Learning for Glucose Regulation in Type 1 Diabetes

David Timms, Chirath Hettiarachchi*, Hanna Suominen

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference Paperpeer-review

Abstract

Type 1 Diabetes (T1D) is a chronic condition affecting millions worldwide, requiring external insulin administration to regulate blood glucose levels and prevent serious complications. Artificial Pancreas Systems (APS) for managing T1D currently rely on manual input, which adds a cognitive burden on people with T1D and their carers. Research into alleviating this burden through Reinforcement Learning (RL) explores enabling the APS to autonomously learn and adapt to the complex dynamics of blood glucose regulation, demonstrating improvements in in-silico evaluations compared to traditional clinical approaches. This evaluation study compared the primary polarities of RL for glucose regulation, namely, stochastic (e.g., Proximal Policy Optimization (PPO) and deterministic (e.g., Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithms in-silico using quantitative and qualitative methods, patient specific clinical metrics, and the adult and adolescent cohorts of the U.S. Food and Drug Administration approved UVA/PADOVA 2008 model. Although the behavior of TD3 was easier to interpret, it did not typically outperform PPO, thereby challenging assessing their safety and suitability. This conclusion highlights the importance of improving RL algorithms in APS applications for both interpretability and predictive performance in future research.

Keywords: Artificial Pancreas, Deep Learning, Evaluation Study, Type 1 Diabetes

Original languageEnglish
Title of host publicationMEDINFO 2025 - Healthcare Smart x Medicine Deep
Subtitle of host publicationProceedings of the 20th World Congress on Medical and Health Informatics
EditorsMowafa S. Househ, Mowafa S. Househ, Zain Ul Abideen Tariq, Mahmood Al-Zubaidi, Uzair Shah, Elaine Huesing
PublisherIOS Press BV
Pages1039-1043
Number of pages5
ISBN (Electronic)9781643686080
DOIs
Publication statusPublished - 7 Aug 2025
Event20th World Congress on Medical and Health Informatics, MEDINFO 2025 - Taipei, Taiwan
Duration: 9 Aug 202513 Aug 2025

Publication series

NameStudies in Health Technology and Informatics
Volume329
ISSN (Print)0926-9630
ISSN (Electronic)1879-8365

Conference

Conference20th World Congress on Medical and Health Informatics, MEDINFO 2025
Country/TerritoryTaiwan
CityTaipei
Period9/08/2513/08/25

Fingerprint

Dive into the research topics of 'Comparing Deterministic and Stochastic Reinforcement Learning for Glucose Regulation in Type 1 Diabetes'. Together they form a unique fingerprint.

Cite this