Optimal policy learning for COVID-19 prevention using reinforcement learning

被引:18
|
作者
Uddin, M. Irfan [1 ]
Ali Shah, Syed Atif [2 ,3 ]
Al-Khasawneh, Mahmoud Ahmad [3 ]
Alarood, Ala Abdulsalam [4 ]
Alsolami, Eesa [5 ]
机构
[1] Kohat Univ Sci & Technol, Inst Comp, Kohat 26000, Pakistan
[2] Northern Univ, Fac Engn & Informat Technol, Khyber Pakhtunkhwa, Pakistan
[3] Al Madinah Int Univ, Fac Comp & Informat Technol, Kuala Lumpur, Malaysia
[4] Univ Jeddah, Coll Comp Sci & Engn, Jeddah, Saudi Arabia
[5] Univ Jeddah, Coll Comp Sci & Engn, Dept Cyber Secur, Jeddah, Saudi Arabia
关键词
Reinforcement learning; COVID-19; prevention; policy learning;
D O I
10.1177/0165551520959798
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
COVID-19 has changed the lifestyle of many people due to its rapid human-to-human transmission. The spread started at the end of January 2020, and different countries used different approaches in terms of testing, sanitization, lock down and quarantine centres to control the spread of the virus. People are getting back to working and routine life activities with new normal standards of testing, sanitization, social distancing and lock down. People are regularly tested to identify those who are infected with COVID-19 and isolate them from general public. However, testing all people unnecessarily is an expensive operation in terms of resources usage. There must be an optimal policy to test only those who have higher chances of being COVID-19 positive. Similarly, sanitization is used for individuals and streets to disinfect people and places. However, sanitization is also an expensive operation in terms of resources, and it is not possible to disinfect each and every individual and street. Social separating or lock down or quarantine centres focuses are different methodologies that are utilised to control the human-to-human transmission of the infection and separate the individuals who are contaminated with COVID-19. However, lock down and quarantine centres are expensive operations in terms of resources as it disturbs the affairs of state and the growth of economy. At the same time, it negatively affects the quality of life of a society. It is also not possible to provide resources to all citizens by locking them inside homes or quarantine centres for infinite time. All these parameters are expensive in terms of resources and have an effect on controlling the spread of the virus, quality of life of human, resources and economy. In this article, a novel intelligent method based on reinforcement learning (RL) is built up that quantifies the unique levels of testing, disinfection and lock down alongside its impact on the spread of the infection, personal satisfaction or quality of life, resource use and economy. Different RL algorithms are actualized and agents are prepared with these algorithms to interact with the environment to gain proficiency with the best strategy. The examinations exhibit that deep learning-based algorithms, for example, DQN and DDPG are performing better than customary RL algorithms, for example, Q-Learning and SARSA.
引用
下载
收藏
页码:336 / 348
页数:13
相关论文
共 50 条
  • [1] Optimal Policy Learning for Disease Prevention Using Reinforcement Learning
    Alam Khan, Zahid
    Feng, Zhengyong
    Uddin, M. Irfan
    Mast, Noor
    Ali Shah, Syed Atif
    Imtiaz, Muhammad
    Al-Khasawneh, Mahmoud Ahmad
    Mahmoud, Marwan
    SCIENTIFIC PROGRAMMING, 2020, 2020
  • [2] COVID-19 Vaccine Distribution Policy Design with Reinforcement Learning
    Tan, Pu
    2021 5TH INTERNATIONAL CONFERENCE ON ADVANCES IN IMAGE PROCESSING, ICAIP 2021, 2021, : 103 - 108
  • [3] Modeling and control of COVID-19 disease using deep reinforcement learning method
    Ghazizadeh, Nazanin
    Taghvaei, Sajjad
    Haghpanah, Seyyed Arash
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, : 3653 - 3670
  • [4] The role of policy learning in explaining COVID-19 policy changes
    Wang, Chan
    REVIEW OF POLICY RESEARCH, 2023,
  • [5] Reinforcement learning based framework for COVID-19 resource allocation
    Zong, Kai
    Luo, Cuicui
    COMPUTERS & INDUSTRIAL ENGINEERING, 2022, 167
  • [6] Towards Using Deep Reinforcement Learning for Better COVID-19 Vaccine Distribution Strategies
    TRAD, Fouad
    EL FALOU, Salah
    2022 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MACHINE LEARNING APPLICATIONS (CDMA 2022), 2022, : 7 - 12
  • [7] COVID-19 vaccine incentive scheduling using an optimally controlled reinforcement learning model
    Stuckey, K.
    Newton, P. K.
    PHYSICA D-NONLINEAR PHENOMENA, 2023, 445
  • [8] Optimal control strategy for COVID-19 concerning both life and economy based on deep reinforcement learning
    Deng, Wei
    Qi, Guoyuan
    Yu, Xinchen
    CHINESE PHYSICS B, 2021, 30 (12)
  • [9] Optimal control strategy for COVID-19 concerning both life and economy based on deep reinforcement learning
    邓为
    齐国元
    蔚昕晨
    Chinese Physics B, 2021, (12) : 25 - 37
  • [10] Learning From COVID-19: Prevention Is a Strategic Principle, Not an Option
    Saracci, Rodolfo
    AMERICAN JOURNAL OF PUBLIC HEALTH, 2020, 110 (12) : 1803 - 1804