Deep Reinforcement Learning for Large-Scale Epidemic Control

被引:14
|
作者
Libin, Pieter J. K. [1 ,2 ,3 ]
Moonens, Arno [1 ]
Verstraeten, Timothy [1 ]
Perez-Sanjines, Fabian [1 ]
Hens, Niel [3 ]
Lemey, Philippe [2 ]
Nowe, Ann [1 ]
机构
[1] Vrije Univ Brussel, Brussels, Belgium
[2] Katholieke Univ Leuven, Leuven, Belgium
[3] Hasselt Univ, Hasselt, Belgium
基金
欧盟地平线“2020”;
关键词
Multi-agent systems; Epidemic control; Pandemic influenza; Deep reinforcement learning; PANDEMIC INFLUENZA; POLICIES; ENGLAND;
D O I
10.1007/978-3-030-67670-4_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Epidemics of infectious diseases are an important threat to public health and global economies. Yet, the development of prevention strategies remains a challenging process, as epidemics are non-linear and complex processes. For this reason, we investigate a deep reinforcement learning approach to automatically learn prevention strategies in the context of pandemic influenza. Firstly, we construct a new epidemiological meta-population model, with 379 patches (one for each administrative district in Great Britain), that adequately captures the infection process of pandemic influenza. Our model balances complexity and computational efficiency such that the use of reinforcement learning techniques becomes attainable. Secondly, we set up a ground truth such that we can evaluate the performance of the "Proximal Policy Optimization" algorithm to learn in a single district of this epidemiological model. Finally, we consider a large-scale problem, by conducting an experiment where we aim to learn a joint policy to control the districts in a community of 11 tightly coupled districts, for which no ground truth can be established. This experiment shows that deep reinforcement learning can be used to learn mitigation policies in complex epidemiological models with a large state and action space.
引用
收藏
页码:155 / 170
页数:16
相关论文
共 50 条
  • [21] Sensors Integrated Control of PEMFC Gas Supply System Based on Large-Scale Deep Reinforcement Learning
    Li, Jiawen
    Yu, Tao
    [J]. SENSORS, 2021, 21 (02) : 1 - 19
  • [22] Large-Scale Traffic Grid Signal Control with Regional Reinforcement Learning
    Chu, Tianshu
    Qu, Shuhui
    Wang, Jie
    [J]. 2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 815 - 820
  • [23] Large-scale cost function learning for path planning using deep inverse reinforcement learning
    Wulfmeier, Markus
    Rao, Dushyant
    Wang, Dominic Zeng
    Ondruska, Peter
    Posner, Ingmar
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2017, 36 (10): : 1073 - 1087
  • [24] Large-scale Deep Learning at Baidu
    Yu, Kai
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2211 - 2211
  • [25] A large-scale traffic signal control algorithm based on multi-layer graph deep reinforcement learning
    Wang, Tao
    Zhu, Zhipeng
    Zhang, Jing
    Tian, Junfang
    Zhang, Wenyi
    [J]. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2024, 162
  • [26] A Large-Scale Multi-Agent Deep Reinforcement Learning Method for Cooperative Output Voltage Control of PEMFCs
    Li, Jiawen
    Cui, Haoyang
    Jiang, Wei
    Yu, Hengwen
    [J]. IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2024, 10 (01): : 78 - 94
  • [27] Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning
    Wang, Xiaoqiang
    Ke, Liangjun
    Qiao, Zhimin
    Chai, Xinghua
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (01) : 174 - 187
  • [28] A LARGE-SCALE PATH PLANNING ALGORITHM FOR UNDERWATER ROBOTS BASED ON DEEP REINFORCEMENT LEARNING
    Wang, Wenhui
    Li, Leqing
    Ye, Fumeng
    Peng, Yumin
    Ma, Yiming
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2024, 39 (03): : 204 - 210
  • [29] AUTONOMOUS NAVIGATION OF UAV IN LARGE-SCALE UNKNOWN COMPLEX ENVIRONMENT WITH DEEP REINFORCEMENT LEARNING
    Wang, Chao
    Wang, Jian
    Zhang, Xudong
    Zhang, Xiao
    [J]. 2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 858 - 862
  • [30] A DEEP REINFORCEMENT LEARNING APPROACH TO FLOCKING AND NAVIGATION OF UAVS IN LARGE-SCALE COMPLEX ENVIRONMENTS
    Wang, Chao
    Wang, Jian
    Zhang, Xudong
    [J]. 2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 1228 - 1232