Reinforcement Learning Models and Algorithms for Diabetes Management

被引:11
|
作者
Yau, Kok-Lim Alvin [1 ]
Chong, Yung-Wey [2 ]
Fan, Xiumei [3 ]
Wu, Celimuge [4 ]
Saleem, Yasir [5 ]
Lim, Phei-Ching [6 ,7 ]
机构
[1] Univ Tunku Abdul Rahman UTAR, Lee Kong Chian Fac Engn & Sci LKCFES, Kajang 47500, Selangor, Malaysia
[2] Univ Sains Malaysia USM, Natl Adv IPv6 Ctr, Gelugor 11800, Penang, Malaysia
[3] Xian Univ Technol, Sch Automat & Informat Engn, Xian 710048, Shanxi, Peoples R China
[4] Univ Electrocommun, Grad Sch Informat & Engn, Tokyo 1828585, Japan
[5] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Ceredigion, Wales
[6] Hosp Pulau Pinang, Dept Pharm, George Town 11090, Penang, Malaysia
[7] Univ Sains Malaysia USM, Sch Pharmaceut Sci, Gelugor 11800, Penang, Malaysia
关键词
Diabetes; Glucose; Blood; Insulin; Reinforcement learning; Data models; Deep learning; Multi-agent systems; Q-learning; Actor-critic reinforcement learning; applied reinforcement learning; deep Q-network; deep reinforcement learning; diabetes; Markov decision process; multi-agent reinforcement learning; reinforcement learning; BLOOD-GLUCOSE VARIABILITY; INSULIN DELIVERY; PREDICTIVE CONTROL; MINIMAL MODEL; SECRETION; ACCURACY; BEHAVIOR; OUTCOMES; SYSTEM; SAFETY;
D O I
10.1109/ACCESS.2023.3259425
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the advancements in reinforcement learning (RL), new variants of this artificial intelligence approach have been introduced in the literature. This has led to increased interest in using RL to address complex issues in diabetes management. Using RL, a decision maker (or agent) observes decision-making factors (or state) from the dynamic operating environment, selects actions, and subsequently receives delayed rewards. The agent adapts its actions to changes in the operating environment to maximize its cumulative reward and improve system performance. This paper presents how various variants of RL have been used to improve diabetes management, such as a higher time in range during which the blood glucose level is within the normal range and a higher similarity between RL and physician's policies. Key highlights focus on the application of RL in diabetes management, including a taxonomy of the attributes of RL (e.g., roles and advantages), essential elements for training (e.g., data and simulators), representations of diabetes attributes in RL models, and variants of RL algorithms. In addition, this paper discusses open issues and potential future developments in the use of RL in diabetes management.
引用
收藏
页码:28391 / 28415
页数:25
相关论文
共 50 条
  • [41] Improved SARSA and DQN algorithms for reinforcement learning
    Yao, Guangyu
    Zhang, Nan
    Duan, Zhenhua
    Tian, Cong
    THEORETICAL COMPUTER SCIENCE, 2025, 1027
  • [42] Formalizing the ant algorithms in terms of reinforcement learning
    Nowé, A
    Verbeeck, K
    ADVANCES IN ARTIFICIAL LIFE, PROCEEDINGS, 1999, 1674 : 616 - 620
  • [43] Integrating reinforcement learning, bidding and genetic algorithms
    Qi, DH
    Sun, R
    IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 53 - 59
  • [44] EPOCH-INCREMENTAL REINFORCEMENT LEARNING ALGORITHMS
    Zajdel, Roman
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2013, 23 (03) : 623 - 635
  • [45] Parallelization of Reinforcement Learning Algorithms for Video Games
    Kopel, Marek
    Szczurek, Witold
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2021, 2021, 12672 : 195 - 207
  • [46] Universal Reinforcement Learning Algorithms: Survey and Experiments
    Aslanides, John
    Leike, Jan
    Hutter, Marcus
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1403 - 1410
  • [47] Application of Reinforcement Learning in Dynamic Pricing Algorithms
    Wang Jintian
    Zhou Lei
    2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS ( ICAL 2009), VOLS 1-3, 2009, : 419 - 423
  • [48] Offline Evaluation of Online Reinforcement Learning Algorithms
    Mandel, Travis
    Liu, Yun-En
    Brunskill, Emma
    Popovic, Zoran
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1926 - 1933
  • [49] Reinforcement Learning Algorithms with Selector, Tuner, or Estimator
    Ala’eddin Masadeh
    Zhengdao Wang
    Ahmed E. Kamal
    Arabian Journal for Science and Engineering, 2024, 49 : 4081 - 4095
  • [50] Reinforcement learning for online control of evolutionary algorithms
    Eiben, A. E.
    Horvath, Mark
    Kowalczyk, Wojtek
    Schut, Martijn C.
    ENGINEERING SELF-ORGANISING SYSTEMS, 2007, 4335 : 151 - +