Reinforcement Learning Models and Algorithms for Diabetes Management

被引:11
|
作者
Yau, Kok-Lim Alvin [1 ]
Chong, Yung-Wey [2 ]
Fan, Xiumei [3 ]
Wu, Celimuge [4 ]
Saleem, Yasir [5 ]
Lim, Phei-Ching [6 ,7 ]
机构
[1] Univ Tunku Abdul Rahman UTAR, Lee Kong Chian Fac Engn & Sci LKCFES, Kajang 47500, Selangor, Malaysia
[2] Univ Sains Malaysia USM, Natl Adv IPv6 Ctr, Gelugor 11800, Penang, Malaysia
[3] Xian Univ Technol, Sch Automat & Informat Engn, Xian 710048, Shanxi, Peoples R China
[4] Univ Electrocommun, Grad Sch Informat & Engn, Tokyo 1828585, Japan
[5] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Ceredigion, Wales
[6] Hosp Pulau Pinang, Dept Pharm, George Town 11090, Penang, Malaysia
[7] Univ Sains Malaysia USM, Sch Pharmaceut Sci, Gelugor 11800, Penang, Malaysia
关键词
Diabetes; Glucose; Blood; Insulin; Reinforcement learning; Data models; Deep learning; Multi-agent systems; Q-learning; Actor-critic reinforcement learning; applied reinforcement learning; deep Q-network; deep reinforcement learning; diabetes; Markov decision process; multi-agent reinforcement learning; reinforcement learning; BLOOD-GLUCOSE VARIABILITY; INSULIN DELIVERY; PREDICTIVE CONTROL; MINIMAL MODEL; SECRETION; ACCURACY; BEHAVIOR; OUTCOMES; SYSTEM; SAFETY;
D O I
10.1109/ACCESS.2023.3259425
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the advancements in reinforcement learning (RL), new variants of this artificial intelligence approach have been introduced in the literature. This has led to increased interest in using RL to address complex issues in diabetes management. Using RL, a decision maker (or agent) observes decision-making factors (or state) from the dynamic operating environment, selects actions, and subsequently receives delayed rewards. The agent adapts its actions to changes in the operating environment to maximize its cumulative reward and improve system performance. This paper presents how various variants of RL have been used to improve diabetes management, such as a higher time in range during which the blood glucose level is within the normal range and a higher similarity between RL and physician's policies. Key highlights focus on the application of RL in diabetes management, including a taxonomy of the attributes of RL (e.g., roles and advantages), essential elements for training (e.g., data and simulators), representations of diabetes attributes in RL models, and variants of RL algorithms. In addition, this paper discusses open issues and potential future developments in the use of RL in diabetes management.
引用
收藏
页码:28391 / 28415
页数:25
相关论文
共 50 条
  • [31] Reinforcement learning algorithms: A brief survey
    Shakya, Ashish Kumar
    Pillai, Gopinatha
    Chakrabarty, Sohom
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 231
  • [32] Emphatic Algorithms for Deep Reinforcement Learning
    Jiang, Ray
    Zahavy, Tom
    Xu, Zhongwen
    White, Adam
    Hessel, Matteo
    Blundell, Charles
    van Hasselt, Hado
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [33] Distributed Neural Learning Algorithms for Multiagent Reinforcement Learning
    Dai, Pengcheng
    Liu, Hongzhe
    Yu, Wenwu
    Wang, He
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23) : 21039 - 21060
  • [34] Reinforcement Learning for Topic Models
    Costello, Jeremy
    Reformat, Marek Z.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 4332 - 4351
  • [35] Compositional Models for Reinforcement Learning
    Jong, Nicholas K.
    Stone, Peter
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2009, 5781 : 644 - 659
  • [36] Comparison of Deep Reinforcement Learning Algorithms in Data Center Cooling Management: A Case Study
    Hua, Tianyang
    Wan, Jianxiong
    Jaffry, Shan
    Rasheed, Zeeshan
    Li, Leixiao
    Ma, Zhiqiang
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 392 - 397
  • [37] Integrating Reinforcement Learning with Models of Representation Learning
    Jones, Matt
    Canas, Fabian
    COGNITION IN FLUX, 2010, : 1258 - 1263
  • [38] Deep learning, reinforcement learning, and world models
    Matsuo, Yutaka
    LeCun, Yann
    Sahani, Maneesh
    Precup, Doina
    Silver, David
    Sugiyama, Masashi
    Uchibe, Eiji
    Morimoto, Jun
    NEURAL NETWORKS, 2022, 152 : 267 - 275
  • [39] Shared Trained Models Selection and Management for Transfer Reinforcement Learning in Open IoT
    Noguchi, Hirofumi
    Isoda, Takuma
    Arai, Seisuke
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2170 - 2176
  • [40] Deep Reinforcement Learning Algorithms in Intelligent Infrastructure
    Serrano, Will
    INFRASTRUCTURES, 2019, 4 (03)