Optimal Drug Dosage Control Strategy of Immune Systems Using Reinforcement Learning

被引:2
|
作者
Chen, Lin [1 ]
Zhang, Yong-Wei [2 ]
Zhang, Shun-Chao [3 ]
机构
[1] Sun Yat Sen Univ, Affiliated Hosp 7, Sci Res Ctr, Shenzhen 518107, Peoples R China
[2] Guangdong Univ Technol, Sch Automation, Guangzhou 510006, Peoples R China
[3] Guangdong Univ Finance, Sch Internet Finance & Informat Engn, Guangzhou 510521, Peoples R China
关键词
Reinforcement learning; immune systems; immunotherapy; drug dosage control; robust control; neural networks; ZERO-SUM GAMES; TRACKING CONTROL; HJB SOLUTION; CANCER; HALLMARKS;
D O I
10.1109/ACCESS.2022.3233567
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, a reinforcement learning-based drug dosage control strategy is developed for immune systems with input constraints and dynamic uncertainties to sustain the number of tumor and immune cells in an acceptable level. First of all, the state of the immune system and the desired number of tumor and immune cells are constructed into an augmented state to derive an augmented immune system. By designing a discounted non-quadratic performance index function, the robust tracking control problem of immune systems with uncertainties is transformed into an optimal tracking control problem of nominal immune systems and the drug dosage can be limited within the specified range. Hereafter, a reinforcement learning algorithm and a critic-only structure are adopted to acquire the approximate optimal drug dosage control strategy. Furthermore, theoretical proof reveals that the proposed reinforcement learning-based drug dosage control strategy ensures the number of tumor and immune cells reaches the preset level under limited drug dosages and model uncertainties. Finally, simulation study verifies the availability of the developed drug dosage control strategy in different growth models of tumor cell.
引用
收藏
页码:1269 / 1279
页数:11
相关论文
共 50 条
  • [21] Reinforcement Learning-Based Adaptive Optimal Control for Partially Unknown Systems Using Differentiator
    Guo, Xinxin
    Yan, Weisheng
    Cui, Rongxin
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 1039 - 1044
  • [22] Stochastic LQ optimal control for Markov jumping systems with multiplicative noise using reinforcement learning
    Ye, Linwei
    Zhao, Zhonggai
    Liu, Fei
    SYSTEMS & CONTROL LETTERS, 2024, 186
  • [23] Optimal Control for Multi-agent Systems Using Off-Policy Reinforcement Learning
    Wang, Hao
    Chen, Zhiru
    Wang, Jun
    Lu, Lijun
    Li, Mingzhe
    2022 4TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS, ICCR, 2022, : 135 - 140
  • [24] Research on Optimal Control Strategy of Distributed Photovoltaic Based on Deep Reinforcement Learning
    Dai, Zhiqiang
    Xu, Yunuo
    Hu, Wei
    Wang, Haitao
    Lin, Kai
    Li, Binghui
    Guo, Qiuting
    Pei, Xun
    2023 2ND ASIAN CONFERENCE ON FRONTIERS OF POWER AND ENERGY, ACFPE, 2023, : 458 - 462
  • [25] Constrained adaptive optimal control using a reinforcement learning agent
    Lin, Wei-Song
    Zheng, Chen-Hong
    AUTOMATICA, 2012, 48 (10) : 2614 - 2619
  • [26] Optimal Balancing Control of Bipedal Robots Using Reinforcement Learning
    Peng, Fang
    Ding, Lijia
    Li, Zhijun
    Yang, Chenguang
    Su, Chun-Yi
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 2186 - 2191
  • [27] Exploring optimal control of epidemic spread using reinforcement learning
    Abu Quwsar Ohi
    M. F. Mridha
    Muhammad Mostafa Monowar
    Md. Abdul Hamid
    Scientific Reports, 10
  • [28] Exploring optimal control of epidemic spread using reinforcement learning
    Ohi, Abu Quwsar
    Mridha, M. F.
    Monowar, Muhammad Mostafa
    Hamid, Md. Abdul
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [29] Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning
    Chen, Pengzhan
    He, Zhiqiang
    Chen, Chuanxi
    Xu, Jiahong
    ALGORITHMS, 2018, 11 (05):
  • [30] Reinforcement learning-based optimal control of uncertain nonlinear systems
    Garcia, Miguel
    Dong, Wenjie
    INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (12) : 2839 - 2850