Reinforcement Learning Models and Algorithms for Diabetes Management

被引:11
|
作者
Yau, Kok-Lim Alvin [1 ]
Chong, Yung-Wey [2 ]
Fan, Xiumei [3 ]
Wu, Celimuge [4 ]
Saleem, Yasir [5 ]
Lim, Phei-Ching [6 ,7 ]
机构
[1] Univ Tunku Abdul Rahman UTAR, Lee Kong Chian Fac Engn & Sci LKCFES, Kajang 47500, Selangor, Malaysia
[2] Univ Sains Malaysia USM, Natl Adv IPv6 Ctr, Gelugor 11800, Penang, Malaysia
[3] Xian Univ Technol, Sch Automat & Informat Engn, Xian 710048, Shanxi, Peoples R China
[4] Univ Electrocommun, Grad Sch Informat & Engn, Tokyo 1828585, Japan
[5] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Ceredigion, Wales
[6] Hosp Pulau Pinang, Dept Pharm, George Town 11090, Penang, Malaysia
[7] Univ Sains Malaysia USM, Sch Pharmaceut Sci, Gelugor 11800, Penang, Malaysia
关键词
Diabetes; Glucose; Blood; Insulin; Reinforcement learning; Data models; Deep learning; Multi-agent systems; Q-learning; Actor-critic reinforcement learning; applied reinforcement learning; deep Q-network; deep reinforcement learning; diabetes; Markov decision process; multi-agent reinforcement learning; reinforcement learning; BLOOD-GLUCOSE VARIABILITY; INSULIN DELIVERY; PREDICTIVE CONTROL; MINIMAL MODEL; SECRETION; ACCURACY; BEHAVIOR; OUTCOMES; SYSTEM; SAFETY;
D O I
10.1109/ACCESS.2023.3259425
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the advancements in reinforcement learning (RL), new variants of this artificial intelligence approach have been introduced in the literature. This has led to increased interest in using RL to address complex issues in diabetes management. Using RL, a decision maker (or agent) observes decision-making factors (or state) from the dynamic operating environment, selects actions, and subsequently receives delayed rewards. The agent adapts its actions to changes in the operating environment to maximize its cumulative reward and improve system performance. This paper presents how various variants of RL have been used to improve diabetes management, such as a higher time in range during which the blood glucose level is within the normal range and a higher similarity between RL and physician's policies. Key highlights focus on the application of RL in diabetes management, including a taxonomy of the attributes of RL (e.g., roles and advantages), essential elements for training (e.g., data and simulators), representations of diabetes attributes in RL models, and variants of RL algorithms. In addition, this paper discusses open issues and potential future developments in the use of RL in diabetes management.
引用
收藏
页码:28391 / 28415
页数:25
相关论文
共 50 条
  • [21] Aggregation of reinforcement learning algorithms
    Jiang, Ju
    Kamel, Mohamed S.
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 68 - +
  • [22] Convergence of reinforcement learning algorithms and acceleration of learning
    Potapov, A
    Ali, MK
    PHYSICAL REVIEW E, 2003, 67 (02):
  • [23] Reinforcement Learning: A Paradigm Shift in Personalized Blood Glucose Management for Diabetes
    Denes-Fazakas, Lehel
    Szilagyi, Laszlo
    Kovacs, Levente
    De Gaetano, Andrea
    Eigner, Gyoergy
    BIOMEDICINES, 2024, 12 (09)
  • [24] Deep reinforcement learning algorithms for dynamic pricing and inventory management of perishable products
    Yavuz, Tugce
    Kaya, Onur
    APPLIED SOFT COMPUTING, 2024, 163
  • [25] A survey on Evolutionary Reinforcement Learning algorithms
    Zhu, Qingling
    Wu, Xiaoqiang
    Lin, Qiuzhen
    Ma, Lijia
    Li, Jianqiang
    Ming, Zhong
    Chen, Jianyong
    NEUROCOMPUTING, 2023, 556
  • [26] Aggregation of multiple reinforcement learning algorithms
    Jiang, Ju
    Kamel, Mohamed S.
    Chen, Lei
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2006, 15 (05) : 855 - 861
  • [27] Towards possibilistic reinforcement learning algorithms
    Sabbadin, R
    10TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3: MEETING THE GRAND CHALLENGE: MACHINES THAT SERVE PEOPLE, 2001, : 404 - 407
  • [28] Noise tolerance in reinforcement learning algorithms
    Ribeiro, Richardson
    Koerich, Alessandro L.
    Enembreck, Fabricio
    PROCEEDINGS OF THE IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY (IAT 2007), 2007, : 265 - 268
  • [29] Reinforcement Learning Algorithms: An Overview and Classification
    AlMahamid, Fadi
    Grolinger, Katarina
    2021 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2021,
  • [30] Evaluating the Performance of Reinforcement Learning Algorithms
    Jordan, Scott M.
    Chandak, Yash
    Cohen, Daniel
    Zhang, Mengxue
    Thomas, Philip S.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119