Projection based inverse reinforcement learning for the analysis of dynamic treatment regimes

被引:0
|
作者
Syed Ihtesham Hussain Shah
Giuseppe De Pietro
Giovanni Paragliola
Antonio Coronato
机构
[1] Parthenope University,Department of ICT and Engineering
[2] National Research Council,ICAR
[3] Università Telematica Giustino Fortunato,undefined
来源
Applied Intelligence | 2023年 / 53卷
关键词
Inverse Reinforcement Learning (IRL); Dynamic Treatment Regime (DTR); Reinforcement Learning (RL); Decision Support System (DSS);
D O I
暂无
中图分类号
学科分类号
摘要
Dynamic Treatment Regimes (DTRs) are adaptive treatment strategies that allow clinicians to personalize dynamically the treatment for each patient based on their step-by-step response to their treatment. There are a series of predefined alternative treatments for each disease and any patient may associate with one of these treatments according to his/her demographics. DTRs for a certain disease are studied and evaluated by means of statistical approaches where patients are randomized at each step of the treatment and their responses are observed. Recently, the Reinforcement Learning (RL) paradigm has also been applied to determine DTRs. However, such approaches may be limited by the need to design a true reward function, which may be difficult to formalize when the expert knowledge is not well assessed, as when the DTR is in the design phase. To address this limitation, an extension of the RL paradigm, namely Inverse Reinforcement Learning (IRL), has been adopted to learn the reward function from data, such as those derived from DTR trials. In this paper, we define a Projection Based Inverse Reinforcement Learning (PB-IRL) approach to learn the true underlying reward function for given demonstrations (DTR trials). Such a reward function can be used both to evaluate the set of DTRs determined for a certain disease, as well as to enable an RL-based intelligent agent to self-learn the best way and then act as a decision support system for the clinician.
引用
收藏
页码:14072 / 14084
页数:12
相关论文
共 50 条
  • [21] Analysis of Inverse Reinforcement Learning with Perturbed Demonstrations
    Melo, Francisco S.
    Lopes, Manuel
    Ferreira, Ricardo
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 349 - 354
  • [22] Score-based Inverse Reinforcement Learning
    El Asri, Layla
    Piot, Bilal
    Geist, Matthieu
    Laroche, Romain
    Pietquin, Olivier
    AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 457 - 465
  • [23] Sensitivity-Based Inverse Reinforcement Learning
    Tao, Zhaorong
    Chen, Zhichao
    Li, Yanjie
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 2856 - 2861
  • [24] Neuroevolution-Based Inverse Reinforcement Learning
    Budhraja, Karan K.
    Oates, Tim
    2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2017, : 67 - 76
  • [25] Inverse Reinforcement Learning based on Critical State
    Hwang, Kao-Shing
    Cheng, Tien-Yu
    Jiang, Wei-Cheng
    PROCEEDINGS OF THE 2015 CONFERENCE OF THE INTERNATIONAL FUZZY SYSTEMS ASSOCIATION AND THE EUROPEAN SOCIETY FOR FUZZY LOGIC AND TECHNOLOGY, 2015, 89 : 771 - 775
  • [26] Comparison of dynamic treatment regimes via inverse probability weighting
    Hernán, MA
    Lanoy, E
    Costagliola, D
    Robins, JM
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2006, 98 (03) : 237 - 242
  • [27] Semi-Supervised Off-Policy Reinforcement Learning and Value Estimation for Dynamic Treatment Regimes
    Sonabend-W, Aaron
    Laha, Nilanjana
    Ananthakrishnan, Ashwin N.
    Cai, Tianxi
    Mukherjee, Rajarshi
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [28] Inverse Reinforcement Learning Based Stochastic Driver Behavior Learning
    Ozkan, Mehmet F.
    Rocque, Abishek J.
    Ma, Yao
    IFAC PAPERSONLINE, 2021, 54 (20): : 882 - 888
  • [29] INVERSE REINFORCEMENT LEARNING BASED DRIVER BEHAVIOR ANALYSIS AND FUEL ECONOMY ASSESSMENT
    Ozkan, Mehmet Fatih
    Ma, Yao
    PROCEEDINGS OF THE ASME DYNAMIC SYSTEMS AND CONTROL CONFERENCE, DSCC2020, VOL 1, 2020,
  • [30] User Behavior Analysis in Online Health Community Based on Inverse Reinforcement Learning
    Zhang, Yaqi
    Wang, Xi
    Zuo, Zhiya
    Fan, Dan
    E-BUSINESS: NEW CHALLENGES AND OPPORTUNITIES FOR DIGITAL-ENABLED INTELLIGENT FUTURE, PT III, WHICEB 2024, 2024, 517 : 250 - 259