Exploring optimal control of epidemic spread using reinforcement learning

被引：0

作者：

Abu Quwsar Ohi

M. F. Mridha

Muhammad Mostafa Monowar

Md. Abdul Hamid

机构：

[1] Bangladesh University of Business and Technology,Department of Computer Science and Engineering

[2] King Abdulaziz University,Department of Information Technology, Faculty of Computing and Information Technology

来源：

Scientific Reports | / 10卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Pandemic defines the global outbreak of a disease having a high transmission rate. The impact of a pandemic situation can be lessened by restricting the movement of the mass. However, one of its concomitant circumstances is an economic crisis. In this article, we demonstrate what actions an agent (trained using reinforcement learning) may take in different possible scenarios of a pandemic depending on the spread of disease and economic factors. To train the agent, we design a virtual pandemic scenario closely related to the present COVID-19 crisis. Then, we apply reinforcement learning, a branch of artificial intelligence, that deals with how an individual (human/machine) should interact on an environment (real/virtual) to achieve the cherished goal. Finally, we demonstrate what optimal actions the agent perform to reduce the spread of disease while considering the economic factors. In our experiment, we let the agent find an optimal solution without providing any prior knowledge. After training, we observed that the agent places a long length lockdown to reduce the first surge of a disease. Furthermore, the agent places a combination of cyclic lockdowns and short length lockdowns to halt the resurgence of the disease. Analyzing the agent’s performed actions, we discover that the agent decides movement restrictions not only based on the number of the infectious population but also considering the reproduction rate of the disease. The estimation and policy of the agent may improve the human-strategy of placing lockdown so that an economic crisis may be avoided while mitigating an infectious disease.

引用

共 50 条

[31] Output Feedback Optimal Tracking Control Using Reinforcement Q-Learning
Rizvi, Syed Ali Asad
Lin, Zongli
2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 3423 - 3428
[32] Robust Optimal Control of Continuous Time Linear System using Reinforcement Learning
Sami, Abdul
Memon, Attaullah Y.
2018 AUSTRALIAN & NEW ZEALAND CONTROL CONFERENCE (ANZCC), 2018, : 154 - 159
[33] Deep Reinforcement Learning for Time Optimal Velocity Control using Prior Knowledge
Hartmann, Gabriel
Shiller, Zvi
Azaria, Amos
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 186 - 193
[34] Robust flow control and optimal sensor placement using deep reinforcement learning
Paris, Romain
Beneddine, Samir
Dandois, Julien
JOURNAL OF FLUID MECHANICS, 2021, 913
[35] Optimal Drug Dosage Control Strategy of Immune Systems Using Reinforcement Learning
Chen, Lin
Zhang, Yong-Wei
Zhang, Shun-Chao
IEEE ACCESS, 2023, 11 : 1269 - 1279
[36] Robust Optimal Well Control using an Adaptive Multigrid Reinforcement Learning Framework
Atish Dixit
Ahmed H. Elsheikh
Mathematical Geosciences, 2023, 55 : 345 - 375
[37] Optimal control for An Active Phase Change Material System Using Reinforcement Learning
Ebrahimpour, Misagh
Santoro, Bruno
Yu, Wei
Young, Brent
Farid, Mohammed
2022 IEEE INTERNATIONAL SYMPOSIUM ON ADVANCED CONTROL OF INDUSTRIAL PROCESSES (ADCONIP 2022), 2022, : 67 - 72
[38] OPTIMAL WIRELESS RATE AND POWER CONTROL IN THE PRESENCE OF JAMMERS USING REINFORCEMENT LEARNING
Raji, Fadlullah
Miao, Lei
arXiv, 2022,
[39] Optimal tracking control of mechatronic servo system using integral reinforcement learning
Chen, Wei
Hu, Jian
Xu, Chenchen
Zhou, Haibo
Yao, Jianyong
Nie, Weirong
INTERNATIONAL JOURNAL OF CONTROL, 2023, 96 (12) : 3072 - 3082
[40] Precise Mobility Intervention for Epidemic Control Using Unobservable Information via Deep Reinforcement Learning
Feng, Tao
Xia, Tong
Fan, Xiaochen
Wang, Huandong
Zong, Zefang
Li, Yong
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2882 - 2892

← 1 2 3 4 5 →