Multi-Objective Markov Decision Processes for Data-Driven Decision Support

被引：0

作者：

Lizotte, Daniel J. ^{[1
]}

Laber, Eric B. ^{[2
]}

机构：

[1] Univ Western Ontario, Dept Comp Sci, Dept Epidemiol & Biostat, 1151 Richmond St, London, ON N6A 3K7, Canada

[2] North Carolina State Univ, Dept Stat, Raleigh, NC 27695 USA

来源：

JOURNAL OF MACHINE LEARNING RESEARCH | 2016年 / 17卷

基金：

加拿大自然科学与工程研究理事会;

关键词：

multi-objective optimization; reinforcement learning; Markov decision processes; clinical decision support; evidence-based medicine; DYNAMIC TREATMENT REGIMES;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present new methodology based on Multi-Objective Markov Decision Processes for developing sequential decision support systems from data. Our approach uses sequential decision-making data to provide support that is useful to many different decision-makers, each with different, potentially time-varying preference. To accomplish this, we develop an extension of fitted-Q iteration for multiple objectives that computes policies for all scalarization functions, i.e. preference functions, simultaneously from continuous-state, finite-horizon data. We identify and address several conceptual and computational challenges along the way, and we introduce a new solution concept that is appropriate when different actions have similar expected outcomes. Finally, we demonstrate an application of our method using data from the Clinical Antipsychotic Trials of Intervention Effectiveness and show that our approach offers decision-makers increased choice by a larger class of optimal policies.

引用

页数：28

共 50 条

[1] MULTI-OBJECTIVE MODEL CHECKING OF MARKOV DECISION PROCESSES
Etessami, Kousha
Kwiatkowska, Marta
Vardi, Moshe Y.
Yannakakis, Mihalis
[J]. LOGICAL METHODS IN COMPUTER SCIENCE, 2008, 4 (04)
[2] Multi-objective model checking of Markov decision processes
Etessami, K.
Kwiatkowska, M.
Vardi, M. Y.
Yannakakis, M.
[J]. TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PROCEEDINGS, 2007, 4424 : 50 - +
[3] Markov Decision Processes For Multi-Objective Satellite Task Planning
Eddy, Duncan
Kochenderfer, Mykel
[J]. 2020 IEEE AEROSPACE CONFERENCE (AEROCONF 2020), 2020,
[4] Computing optimal stationary policies for multi-objective Markov decision processes
Wiering, Marco A.
de Jong, Edwin D.
[J]. 2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 158 - +
[5] Multi-objective Robust Strategy Synthesis for Interval Markov Decision Processes
Hahn, Ernst Moritz
Hashemi, Vahid
Hermanns, Holger
Lahijanian, Morteza
Turrini, Andrea
[J]. QUANTITATIVE EVALUATION OF SYSTEMS (QEST 2017), 2017, 10503 : 207 - 223
[6] Fuzzy Multi-objective Markov Decision Programming
曾庆宁
[J]. 桂林电子科技大学学报, 1999, (01) : 45 - 48
[7] MULTI-OBJECTIVE INFINITE-HORIZON DISCOUNTED MARKOV DECISION-PROCESSES
WHITE, DJ
[J]. JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1982, 89 (02) : 639 - 647
[8] Multi-objective discounted semi-Markov decision processes with multiple constraints
Wang, YH
Zhang, S
Zhang, JH
[J]. PROCEEDINGS OF THE SECOND ASIAN MATHEMATICAL CONFERENCE 1995, 1998, : 551 - 555
[9] Data-Driven Decision Support for Business Processes: Causal Reasoning and Discovery
Alaee, Ali J.
Weidlich, Matthias
Senderovich, Arik
[J]. BUSINESS PROCESS MANAGEMENT FORUM, BPM 2024, 2024, 526 : 90 - 106
[10] Understanding data-driven decision support systems
Power, Daniel J.
[J]. INFORMATION SYSTEMS MANAGEMENT, 2008, 25 (02) : 149 - 154

← 1 2 3 4 5 →