Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications

被引:4
|
作者
Nambiar, Mila [1 ]
Ghosh, Supriyo [1 ,2 ]
Ong, Priscilla [1 ]
Chan, Yu En [1 ]
Bee, Yong Mong [3 ]
Krishnaswamy, Pavitra [1 ]
机构
[1] ASTAR, Inst Infocomm Res I2R, Singapore, Singapore
[2] Microsoft, Bengaluru, Karnataka, India
[3] Singapore Gen Hosp, Dept Endocrinol, Singapore, Singapore
关键词
Offline reinforcement learning; treatment optimization; sepsis treatment; type 2 diabetes treatment; sampling; safety constraints;
D O I
10.1145/3580305.3599800
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is increasing interest in data-driven approaches for recommending optimal treatment strategies in many chronic disease management and critical care applications. Reinforcement learning methods are well-suited to this sequential decision-making problem, but must be trained and evaluated exclusively on retrospective medical record datasets as direct online exploration is unsafe and infeasible. Despite this requirement, the vast majority of treatment optimization studies use off-policy RL methods (e.g., Double Deep Q Networks (DDQN) or its variants) that are known to perform poorly in purely offline settings. Recent advances in offline RL, such as Conservative Q-Learning (CQL), offer a suitable alternative. But there remain challenges in adapting these approaches to real-world applications where suboptimal examples dominate the retrospective dataset and strict safety constraints need to be satisfied. In this work, we introduce a practical and theoretically grounded transition sampling approach to address action imbalance during offline RL training. We perform extensive experiments on two real-world tasks for diabetes and sepsis treatment optimization to compare performance of the proposed approach against prominent off-policy and offline RL baselines (DDQN and CQL). Across a range of principled and clinically relevant metrics, we show that our proposed approach enables substantial improvements in expected health outcomes and in consistency with relevant practice and safety guidelines.
引用
收藏
页码:4673 / 4684
页数:12
相关论文
共 50 条
  • [31] ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation
    Pendyala, Abhijeet
    Dettmer, Justin
    Glasmachers, Tobias
    Atamna, Asma
    [J]. MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT I, 2024, 14505 : 78 - 92
  • [32] Toward the confident deployment of real-world reinforcement learning agents
    Hanna, Josiah P.
    [J]. AI MAGAZINE, 2024, 45 (03) : 396 - 403
  • [33] Challenges of real-world reinforcement learning: definitions, benchmarks and analysis
    Gabriel Dulac-Arnold
    Nir Levine
    Daniel J. Mankowitz
    Jerry Li
    Cosmin Paduraru
    Sven Gowal
    Todd Hester
    [J]. Machine Learning, 2021, 110 : 2419 - 2468
  • [34] Challenges of real-world reinforcement learning: definitions, benchmarks and analysis
    Dulac-Arnold, Gabriel
    Levine, Nir
    Mankowitz, Daniel J.
    Li, Jerry
    Paduraru, Cosmin
    Gowal, Sven
    Hester, Todd
    [J]. MACHINE LEARNING, 2021, 110 (09) : 2419 - 2468
  • [35] Editorial: Real-world applications of game theory and optimization
    Han, Dun
    Wang, Jianrong
    Wang, Jianbo
    Perc, Matjaz
    [J]. FRONTIERS IN PHYSICS, 2024, 12
  • [36] OPTIMIZATION OF BEHAVIORAL INTERVENTIONS: THREE REAL-WORLD APPLICATIONS
    Kugler, Kari C.
    Downs, Danielle Symons.
    Sherwood, Nancy
    Spring, Bonnie
    [J]. ANNALS OF BEHAVIORAL MEDICINE, 2016, 50 : S162 - S162
  • [37] Advances in optimization and prediction techniques: Real-world applications
    Lora, Alicia Troncoso
    [J]. AI COMMUNICATIONS, 2006, 19 (03) : 295 - 297
  • [38] Real-World Anomaly Detection Using Deep Learning
    Koppikar, Unnati
    Sujatha, C.
    Patil, Prakashgoud
    Mudenagudi, Uma
    [J]. INTELLIGENT COMPUTING AND COMMUNICATION, ICICC 2019, 2020, 1034 : 333 - 342
  • [39] Real-World Superresolution by Using Deep Degradation Learning
    Zhao, Rui
    Chen, Junhong
    Zhang, Zhen
    [J]. DATA SCIENCE (ICPCSEE 2022), PT I, 2022, 1628 : 209 - 218
  • [40] Offline Signature Verification on Real-World Documents
    Engin, Deniz
    Kantarci, Alperen
    Arslan, Secil
    Ekenel, Hazim Kemal
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3518 - 3526