Exploring optimal control of epidemic spread using reinforcement learning

被引:0
|
作者
Abu Quwsar Ohi
M. F. Mridha
Muhammad Mostafa Monowar
Md. Abdul Hamid
机构
[1] Bangladesh University of Business and Technology,Department of Computer Science and Engineering
[2] King Abdulaziz University,Department of Information Technology, Faculty of Computing and Information Technology
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Pandemic defines the global outbreak of a disease having a high transmission rate. The impact of a pandemic situation can be lessened by restricting the movement of the mass. However, one of its concomitant circumstances is an economic crisis. In this article, we demonstrate what actions an agent (trained using reinforcement learning) may take in different possible scenarios of a pandemic depending on the spread of disease and economic factors. To train the agent, we design a virtual pandemic scenario closely related to the present COVID-19 crisis. Then, we apply reinforcement learning, a branch of artificial intelligence, that deals with how an individual (human/machine) should interact on an environment (real/virtual) to achieve the cherished goal. Finally, we demonstrate what optimal actions the agent perform to reduce the spread of disease while considering the economic factors. In our experiment, we let the agent find an optimal solution without providing any prior knowledge. After training, we observed that the agent places a long length lockdown to reduce the first surge of a disease. Furthermore, the agent places a combination of cyclic lockdowns and short length lockdowns to halt the resurgence of the disease. Analyzing the agent’s performed actions, we discover that the agent decides movement restrictions not only based on the number of the infectious population but also considering the reproduction rate of the disease. The estimation and policy of the agent may improve the human-strategy of placing lockdown so that an economic crisis may be avoided while mitigating an infectious disease.
引用
收藏
相关论文
共 50 条
  • [31] Output Feedback Optimal Tracking Control Using Reinforcement Q-Learning
    Rizvi, Syed Ali Asad
    Lin, Zongli
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 3423 - 3428
  • [32] Robust Optimal Control of Continuous Time Linear System using Reinforcement Learning
    Sami, Abdul
    Memon, Attaullah Y.
    2018 AUSTRALIAN & NEW ZEALAND CONTROL CONFERENCE (ANZCC), 2018, : 154 - 159
  • [33] Deep Reinforcement Learning for Time Optimal Velocity Control using Prior Knowledge
    Hartmann, Gabriel
    Shiller, Zvi
    Azaria, Amos
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 186 - 193
  • [34] Robust flow control and optimal sensor placement using deep reinforcement learning
    Paris, Romain
    Beneddine, Samir
    Dandois, Julien
    JOURNAL OF FLUID MECHANICS, 2021, 913
  • [35] Optimal Drug Dosage Control Strategy of Immune Systems Using Reinforcement Learning
    Chen, Lin
    Zhang, Yong-Wei
    Zhang, Shun-Chao
    IEEE ACCESS, 2023, 11 : 1269 - 1279
  • [36] Robust Optimal Well Control using an Adaptive Multigrid Reinforcement Learning Framework
    Atish Dixit
    Ahmed H. Elsheikh
    Mathematical Geosciences, 2023, 55 : 345 - 375
  • [37] Optimal control for An Active Phase Change Material System Using Reinforcement Learning
    Ebrahimpour, Misagh
    Santoro, Bruno
    Yu, Wei
    Young, Brent
    Farid, Mohammed
    2022 IEEE INTERNATIONAL SYMPOSIUM ON ADVANCED CONTROL OF INDUSTRIAL PROCESSES (ADCONIP 2022), 2022, : 67 - 72
  • [38] OPTIMAL WIRELESS RATE AND POWER CONTROL IN THE PRESENCE OF JAMMERS USING REINFORCEMENT LEARNING
    Raji, Fadlullah
    Miao, Lei
    arXiv, 2022,
  • [39] Optimal tracking control of mechatronic servo system using integral reinforcement learning
    Chen, Wei
    Hu, Jian
    Xu, Chenchen
    Zhou, Haibo
    Yao, Jianyong
    Nie, Weirong
    INTERNATIONAL JOURNAL OF CONTROL, 2023, 96 (12) : 3072 - 3082
  • [40] Precise Mobility Intervention for Epidemic Control Using Unobservable Information via Deep Reinforcement Learning
    Feng, Tao
    Xia, Tong
    Fan, Xiaochen
    Wang, Huandong
    Zong, Zefang
    Li, Yong
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2882 - 2892