共 24 条
Optimal adaptive nonpharmaceutical interventions to mitigate the outbreak of respiratory infections following the COVID-19 pandemic: a deep reinforcement learning study in Hong Kong, China
被引:2
|作者:
Yao, Yao
[1
]
Zhou, Hanchu
[1
]
Cao, Zhidong
[2
]
Zeng, Daniel Dajun
[2
]
Zhang, Qingpeng
[3
,4
,5
]
机构:
[1] City Univ Hong Kong, Sch Data Sci, Hong Kong, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[3] Univ Hong Kong, Musketeers Fdn Inst Data Sci, Hong Kong, Peoples R China
[4] Univ Hong Kong, LKS Fac Med, Dept Pharmacol & Pharm, Hong Kong, Peoples R China
[5] Univ Hong Kong, Musketeers Fdn Inst Data Sci, LKS Fac Med, Dept Pharmacol & Pharm, Hong Kong, Peoples R China
关键词:
Covid-19;
reinforcement learning;
artificial intelligence;
machine learning;
mathematical modelling;
infectious diseases;
D O I:
10.1093/jamia/ocad116
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Background Long-lasting nonpharmaceutical interventions (NPIs) suppressed the infection of COVID-19 but came at a substantial economic cost and the elevated risk of the outbreak of respiratory infectious diseases (RIDs) following the pandemic. Policymakers need data-driven evidence to guide the relaxation with adaptive NPIs that consider the risk of both COVID-19 and other RIDs outbreaks, as well as the available healthcare resources. Methods Combining the COVID-19 data of the sixth wave in Hong Kong between May 31, 2022 and August 28, 2022, 6-year epidemic data of other RIDs (2014-2019), and the healthcare resources data, we constructed compartment models to predict the epidemic curves of RIDs after the COVID-19-targeted NPIs. A deep reinforcement learning (DRL) model was developed to learn the optimal adaptive NPIs strategies to mitigate the outbreak of RIDs after COVID-19-targeted NPIs are lifted with minimal health and economic cost. The performance was validated by simulations of 1000 days starting August 29, 2022. We also extended the model to Beijing context. Findings Without any NPIs, Hong Kong experienced a major COVID-19 resurgence far exceeding the hospital bed capacity. Simulation results showed that the proposed DRL-based adaptive NPIs successfully suppressed the outbreak of COVID-19 and other RIDs to lower than capacity. DRL carefully controlled the epidemic curve to be close to the full capacity so that herd immunity can be reached in a relatively short period with minimal cost. DRL derived more stringent adaptive NPIs in Beijing. Interpretation DRL is a feasible method to identify the optimal adaptive NPIs that lead to minimal health and economic cost by facilitating gradual herd immunity of COVID-19 and mitigating the other RIDs outbreaks without overwhelming the hospitals. The insights can be extended to other countries/regions.
引用
收藏
页码:1543 / 1551
页数:9
相关论文