Multi-Objective Markov Decision Processes for Data-Driven Decision Support

被引:0
|
作者
Lizotte, Daniel J. [1 ]
Laber, Eric B. [2 ]
机构
[1] Univ Western Ontario, Dept Comp Sci, Dept Epidemiol & Biostat, 1151 Richmond St, London, ON N6A 3K7, Canada
[2] North Carolina State Univ, Dept Stat, Raleigh, NC 27695 USA
基金
加拿大自然科学与工程研究理事会;
关键词
multi-objective optimization; reinforcement learning; Markov decision processes; clinical decision support; evidence-based medicine; DYNAMIC TREATMENT REGIMES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present new methodology based on Multi-Objective Markov Decision Processes for developing sequential decision support systems from data. Our approach uses sequential decision-making data to provide support that is useful to many different decision-makers, each with different, potentially time-varying preference. To accomplish this, we develop an extension of fitted-Q iteration for multiple objectives that computes policies for all scalarization functions, i.e. preference functions, simultaneously from continuous-state, finite-horizon data. We identify and address several conceptual and computational challenges along the way, and we introduce a new solution concept that is appropriate when different actions have similar expected outcomes. Finally, we demonstrate an application of our method using data from the Clinical Antipsychotic Trials of Intervention Effectiveness and show that our approach offers decision-makers increased choice by a larger class of optimal policies.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] MULTI-OBJECTIVE MODEL CHECKING OF MARKOV DECISION PROCESSES
    Etessami, Kousha
    Kwiatkowska, Marta
    Vardi, Moshe Y.
    Yannakakis, Mihalis
    [J]. LOGICAL METHODS IN COMPUTER SCIENCE, 2008, 4 (04)
  • [2] Multi-objective model checking of Markov decision processes
    Etessami, K.
    Kwiatkowska, M.
    Vardi, M. Y.
    Yannakakis, M.
    [J]. TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PROCEEDINGS, 2007, 4424 : 50 - +
  • [3] Markov Decision Processes For Multi-Objective Satellite Task Planning
    Eddy, Duncan
    Kochenderfer, Mykel
    [J]. 2020 IEEE AEROSPACE CONFERENCE (AEROCONF 2020), 2020,
  • [4] Computing optimal stationary policies for multi-objective Markov decision processes
    Wiering, Marco A.
    de Jong, Edwin D.
    [J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 158 - +
  • [5] Multi-objective Robust Strategy Synthesis for Interval Markov Decision Processes
    Hahn, Ernst Moritz
    Hashemi, Vahid
    Hermanns, Holger
    Lahijanian, Morteza
    Turrini, Andrea
    [J]. QUANTITATIVE EVALUATION OF SYSTEMS (QEST 2017), 2017, 10503 : 207 - 223
  • [6] Fuzzy Multi-objective Markov Decision Programming
    曾庆宁
    [J]. 桂林电子科技大学学报, 1999, (01) : 45 - 48
  • [8] Multi-objective discounted semi-Markov decision processes with multiple constraints
    Wang, YH
    Zhang, S
    Zhang, JH
    [J]. PROCEEDINGS OF THE SECOND ASIAN MATHEMATICAL CONFERENCE 1995, 1998, : 551 - 555
  • [9] Data-Driven Decision Support for Business Processes: Causal Reasoning and Discovery
    Alaee, Ali J.
    Weidlich, Matthias
    Senderovich, Arik
    [J]. BUSINESS PROCESS MANAGEMENT FORUM, BPM 2024, 2024, 526 : 90 - 106
  • [10] Understanding data-driven decision support systems
    Power, Daniel J.
    [J]. INFORMATION SYSTEMS MANAGEMENT, 2008, 25 (02) : 149 - 154