A model-based hybrid soft actor-critic deep reinforcement learning algorithm for optimal ventilator settings

被引:14
|
作者
Chen, Shaotao [1 ]
Qiu, Xihe [1 ]
Tan, Xiaoyu [2 ]
Fang, Zhijun [1 ]
Jin, Yaochu [3 ]
机构
[1] Shanghai Univ Engn Sci, Sch Elect & Elect Engn, Shanghai, Peoples R China
[2] Ant Grp, Hangzhou, Peoples R China
[3] Bielefeld Univ, Fac Technol, D-33619 Bielefeld, Germany
基金
中国国家自然科学基金;
关键词
Optimal ventilator settings; Reinforcement learning; Hybrid action space; Optimal strategy; Machine learning; SYSTEM;
D O I
10.1016/j.ins.2022.08.028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A ventilator is a device that mechanically assists in pumping air into the lungs, which is a life-saving supportive therapy in an intensive care unit (ICU). In clinical scenarios, each patient has unique physiological circumstances and specific respiratory diseases, thus requiring individualized ventilator settings. Long-term supervision by experienced clini-cians is essential to perform the task of precisely adjusting ventilator parameters and mak-ing timely modifications. Moreover, a tiny clinical error can result in severe lung injury, induce multi-system organ dysfunction, and increase mortality. To reduce the workload of clinicians and prevent medical errors, machine learning (ML), or more specifically, rein-forcement learning (RL) methods, have been developed to automatically adjust the venti-lator's parameters and select optimal strategies. However, the ventilator settings contain both continuous (e.g., frequency) and discrete parameters (e.g., ventilation mode), making it challenging for conventional RL-based approaches to handle such problems. Meanwhile, it is necessary to develop models with high data efficiency to overcome medical data insuf-ficiency. In this paper, we propose a model-based hybrid soft actor-critic (MHSAC) algo-rithm that is developed based on the classic soft actor-critic (SAC) and model-based policy optimization (MBPO) framework. This algorithm can learn both continuous and dis-crete policies according to the current and predictive state of patient's physiological infor-mation with high data efficiency. Results reveal that our proposed model significantly outperforms the baseline models, achieving superior efficiency and high accuracy in the OpenAI Gym simulation environment. Our proposed model is capable of resolving mixed action space problems, enhancing data efficiency, and accelerating convergence, which can generate practical optimal ventilator settings, minimize possible medical errors, and provide clinical decision support.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:47 / 64
页数:18
相关论文
共 50 条
  • [1] Model-Based Soft Actor-Critic
    Chien, Jen-Tzung
    Yang, Shu-Hsiang
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 2028 - 2035
  • [2] Averaged Soft Actor-Critic for Deep Reinforcement Learning
    Ding, Feng
    Ma, Guanfeng
    Chen, Zhikui
    Gao, Jing
    Li, Peng
    [J]. COMPLEXITY, 2021, 2021
  • [3] A deep residual reinforcement learning algorithm based on Soft Actor-Critic for autonomous navigation
    Wen, Shuhuan
    Shu, Yili
    Rad, Ahmad
    Wen, Zeteng
    Guo, Zhengzheng
    Gong, Simeng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [4] Integrated Actor-Critic for Deep Reinforcement Learning
    Zheng, Jiaohao
    Kurt, Mehmet Necip
    Wang, Xiaodong
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 505 - 518
  • [5] A soft actor-critic reinforcement learning algorithm for network intrusion detection
    Li, Zhengfa
    Huang, Chuanhe
    Deng, Shuhua
    Qiu, Wanyu
    Gao, Xieping
    [J]. COMPUTERS & SECURITY, 2023, 135
  • [6] A modified actor-critic reinforcement learning algorithm
    Mustapha, SM
    Lachiver, G
    [J]. 2000 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS 1 AND 2: NAVIGATING TO A NEW ERA, 2000, : 605 - 609
  • [7] Network Congestion Control Algorithm Based on Actor-Critic Reinforcement Learning Model
    Xu, Tao
    Gong, Lina
    Zhang, Wei
    Li, Xuhong
    Wang, Xia
    Pan, Wenwen
    [J]. ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955
  • [8] Evaluating Correctness of Reinforcement Learning based on Actor-Critic Algorithm
    Kim, Youngjae
    Hussain, Manzoor
    Suh, Jae-Won
    Hong, Jang-Eui
    [J]. 2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 320 - 325
  • [9] A World Model for Actor-Critic in Reinforcement Learning
    Panov, A. I.
    Ugadiarov, L. A.
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, 2023, 33 (03) : 467 - 477
  • [10] Optimal Dispatch of Integrated Electricity-gas System With Soft Actor-critic Deep Reinforcement Learning
    Qiao, Ji
    Wang, Xinying
    Zhang, Qing
    Zhang, Dongxia
    Pu, Tianjiao
    [J]. Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2021, 41 (03): : 819 - 832