Projection based inverse reinforcement learning for the analysis of dynamic treatment regimes

被引:0
|
作者
Syed Ihtesham Hussain Shah
Giuseppe De Pietro
Giovanni Paragliola
Antonio Coronato
机构
[1] Parthenope University,Department of ICT and Engineering
[2] National Research Council,ICAR
[3] Università Telematica Giustino Fortunato,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Inverse Reinforcement Learning (IRL); Dynamic Treatment Regime (DTR); Reinforcement Learning (RL); Decision Support System (DSS);
D O I
暂无
中图分类号
学科分类号
摘要
Dynamic Treatment Regimes (DTRs) are adaptive treatment strategies that allow clinicians to personalize dynamically the treatment for each patient based on their step-by-step response to their treatment. There are a series of predefined alternative treatments for each disease and any patient may associate with one of these treatments according to his/her demographics. DTRs for a certain disease are studied and evaluated by means of statistical approaches where patients are randomized at each step of the treatment and their responses are observed. Recently, the Reinforcement Learning (RL) paradigm has also been applied to determine DTRs. However, such approaches may be limited by the need to design a true reward function, which may be difficult to formalize when the expert knowledge is not well assessed, as when the DTR is in the design phase. To address this limitation, an extension of the RL paradigm, namely Inverse Reinforcement Learning (IRL), has been adopted to learn the reward function from data, such as those derived from DTR trials. In this paper, we define a Projection Based Inverse Reinforcement Learning (PB-IRL) approach to learn the true underlying reward function for given demonstrations (DTR trials). Such a reward function can be used both to evaluate the set of DTRs determined for a certain disease, as well as to enable an RL-based intelligent agent to self-learn the best way and then act as a decision support system for the clinician.
引用
收藏
页码:14072 / 14084
页数:12
相关论文
共 50 条
  • [1] Projection based inverse reinforcement learning for the analysis of dynamic treatment regimes
    Shah, Syed Ihtesham Hussain
    De Pietro, Giuseppe
    Paragliola, Giovanni
    Coronato, Antonio
    APPLIED INTELLIGENCE, 2023, 53 (11) : 14072 - 14084
  • [2] Adversarial reinforcement learning for dynamic treatment regimes
    Sun, Zhaohong
    Dong, Wei
    Li, Haomin
    Huang, Zhengxing
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 137
  • [3] Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach
    Saghafian, Soroush
    MANAGEMENT SCIENCE, 2024, 70 (09) : 5667 - 5690
  • [4] TREE-BASED REINFORCEMENT LEARNING FOR ESTIMATING OPTIMAL DYNAMIC TREATMENT REGIMES
    Tao, Yebin
    Wang, Lu
    Almirall, Daniel
    ANNALS OF APPLIED STATISTICS, 2018, 12 (03): : 1914 - 1938
  • [5] Near-Optimal Reinforcement Learning in Dynamic Treatment Regimes
    Zhang, Junzhe
    Bareinboim, Elias
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [6] Multiobjective tree-based reinforcement learning for estimating tolerant dynamic treatment regimes
    Song, Yao
    Wang, Lu
    BIOMETRICS, 2024, 80 (01)
  • [7] Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data
    Liu, Ying
    Logan, Brent
    Liu, Ning
    Xu, Zhiyuan
    Tang, Jian
    Wang, Yanzhi
    2017 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2017, : 380 - 385
  • [8] Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV
    Chao Yu
    Yinzhao Dong
    Jiming Liu
    Guoqi Ren
    BMC Medical Informatics and Decision Making, 19
  • [9] Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV
    Yu, Chao
    Dong, Yinzhao
    Liu, Jiming
    Ren, Guoqi
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (Suppl 2)
  • [10] A Privacy-Preserving Reinforcement Learning Approach for Dynamic Treatment Regimes on Health Data
    Sun, Xiaoqiang
    Sun, Zhiwei
    Wang, Ting
    Feng, Jie
    Wei, Jiakai
    Hu, Guangwu
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021