Reinforcement Learning Models and Algorithms for Diabetes Management

被引：11

作者：

Yau, Kok-Lim Alvin ^{[1
]}

Chong, Yung-Wey ^{[2
]}

Fan, Xiumei ^{[3
]}

Wu, Celimuge ^{[4
]}

Saleem, Yasir ^{[5
]}

Lim, Phei-Ching ^{[6
,7
]}

机构：

[1] Univ Tunku Abdul Rahman UTAR, Lee Kong Chian Fac Engn & Sci LKCFES, Kajang 47500, Selangor, Malaysia

[2] Univ Sains Malaysia USM, Natl Adv IPv6 Ctr, Gelugor 11800, Penang, Malaysia

[3] Xian Univ Technol, Sch Automat & Informat Engn, Xian 710048, Shanxi, Peoples R China

[4] Univ Electrocommun, Grad Sch Informat & Engn, Tokyo 1828585, Japan

[5] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Ceredigion, Wales

[6] Hosp Pulau Pinang, Dept Pharm, George Town 11090, Penang, Malaysia

[7] Univ Sains Malaysia USM, Sch Pharmaceut Sci, Gelugor 11800, Penang, Malaysia

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Diabetes; Glucose; Blood; Insulin; Reinforcement learning; Data models; Deep learning; Multi-agent systems; Q-learning; Actor-critic reinforcement learning; applied reinforcement learning; deep Q-network; deep reinforcement learning; diabetes; Markov decision process; multi-agent reinforcement learning; reinforcement learning; BLOOD-GLUCOSE VARIABILITY; INSULIN DELIVERY; PREDICTIVE CONTROL; MINIMAL MODEL; SECRETION; ACCURACY; BEHAVIOR; OUTCOMES; SYSTEM; SAFETY;

D O I：

10.1109/ACCESS.2023.3259425

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the advancements in reinforcement learning (RL), new variants of this artificial intelligence approach have been introduced in the literature. This has led to increased interest in using RL to address complex issues in diabetes management. Using RL, a decision maker (or agent) observes decision-making factors (or state) from the dynamic operating environment, selects actions, and subsequently receives delayed rewards. The agent adapts its actions to changes in the operating environment to maximize its cumulative reward and improve system performance. This paper presents how various variants of RL have been used to improve diabetes management, such as a higher time in range during which the blood glucose level is within the normal range and a higher similarity between RL and physician's policies. Key highlights focus on the application of RL in diabetes management, including a taxonomy of the attributes of RL (e.g., roles and advantages), essential elements for training (e.g., data and simulators), representations of diabetes attributes in RL models, and variants of RL algorithms. In addition, this paper discusses open issues and potential future developments in the use of RL in diabetes management.

引用

页码：28391 / 28415

页数：25

共 50 条

[41] Improved SARSA and DQN algorithms for reinforcement learning
Yao, Guangyu
Zhang, Nan
Duan, Zhenhua
Tian, Cong
THEORETICAL COMPUTER SCIENCE, 2025, 1027
[42] Formalizing the ant algorithms in terms of reinforcement learning
Nowé, A
Verbeeck, K
ADVANCES IN ARTIFICIAL LIFE, PROCEEDINGS, 1999, 1674 : 616 - 620
[43] Integrating reinforcement learning, bidding and genetic algorithms
Qi, DH
Sun, R
IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 53 - 59
[44] EPOCH-INCREMENTAL REINFORCEMENT LEARNING ALGORITHMS
Zajdel, Roman
INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2013, 23 (03) : 623 - 635
[45] Parallelization of Reinforcement Learning Algorithms for Video Games
Kopel, Marek
Szczurek, Witold
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2021, 2021, 12672 : 195 - 207
[46] Universal Reinforcement Learning Algorithms: Survey and Experiments
Aslanides, John
Leike, Jan
Hutter, Marcus
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1403 - 1410
[47] Application of Reinforcement Learning in Dynamic Pricing Algorithms
Wang Jintian
Zhou Lei
2009 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND LOGISTICS ( ICAL 2009), VOLS 1-3, 2009, : 419 - 423
[48] Offline Evaluation of Online Reinforcement Learning Algorithms
Mandel, Travis
Liu, Yun-En
Brunskill, Emma
Popovic, Zoran
THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1926 - 1933
[49] Reinforcement Learning Algorithms with Selector, Tuner, or Estimator
Ala’eddin Masadeh
Zhengdao Wang
Ahmed E. Kamal
Arabian Journal for Science and Engineering, 2024, 49 : 4081 - 4095
[50] Reinforcement learning for online control of evolutionary algorithms
Eiben, A. E.
Horvath, Mark
Kowalczyk, Wojtek
Schut, Martijn C.
ENGINEERING SELF-ORGANISING SYSTEMS, 2007, 4335 : 151 - +

← 1 2 3 4 5 →