A Survey of Multi-Objective Sequential Decision-Making

被引:302
|
作者
Roijers, Diederik M. [1 ]
Vamplew, Peter [2 ]
Whiteson, Shimon [1 ]
Dazeley, Richard [2 ]
机构
[1] Univ Amsterdam, Inst Informat, Amsterdam, Netherlands
[2] Univ Ballarat, Sch Sci Informat Technol & Engn, Ballarat, Vic 3353, Australia
关键词
MANY-OBJECTIVE OPTIMIZATION; OBSERVABLE MARKOV-PROCESSES; MULTI-POLICY OPTIMIZATION; INFINITE-HORIZON; REINFORCEMENT; ITERATION; UNCERTAINTY; ALGORITHM; NETWORKS; MODELS;
D O I
10.1613/jair.3987
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequential decision-making problems with multiple objectives arise naturally in practice and pose unique challenges for research in decision-theoretic planning and learning, which has largely focused on single-objective settings. This article surveys algorithms designed for sequential decision-making problems with multiple objectives. Though there is a growing body of literature on this subject, little of it makes explicit under what circumstances special methods are needed to solve multi-objective problems. Therefore, we identify three distinct scenarios in which converting such a problem to a single-objective one is impossible, infeasible, or undesirable. Furthermore, we propose a taxonomy that classifies multi-objective methods according to the applicable scenario, the nature of the scalarization function (which projects multi-objective values to scalar ones), and the type of policies considered. We show how these factors determine the nature of an optimal solution, which can be a single policy, a convex hull, or a Pareto front. Using this taxonomy, we survey the literature on multi-objective methods for planning and learning. Finally, we discuss key applications of such methods and outline opportunities for future work.
引用
收藏
页码:67 / 113
页数:47
相关论文
共 50 条
  • [21] Policy Gradient Approaches for Multi-Objective Sequential Decision Making: A Comparison
    Parisi, Simone
    Pirotta, Matteo
    Smacchia, Nicola
    Bascetta, Luca
    Restelli, Marcello
    [J]. 2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 79 - 86
  • [22] Decision-making for new technology: A multi-actor, multi-objective method
    Cunningham, Scott W.
    van der Lei, Telli E.
    [J]. PICMET '07: PORTLAND INTERNATIONAL CENTER FOR MANAGEMENT OF ENGINEERING AND TECHNOLOGY, VOLS 1-6, PROCEEDINGS: MANAGEMENT OF CONVERGING TECHNOLOGIES, 2007, : 1176 - 1185
  • [23] Decision-making for new technology: A multi-actor, multi-objective method
    Cunningham, Scott W.
    van der Lei, Telli E.
    [J]. TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2009, 76 (01) : 26 - 38
  • [24] Hierarchical multi-objective decision making
    Univ of Mannheim, Mannheim, Germany
    [J]. Eur J Oper Res, 1 (155-161):
  • [25] Hierarchical multi-objective decision making
    Homburg, C
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1998, 105 (01) : 155 - 161
  • [26] Towards Many-Objective Optimization: Objective Analysis, Multi-Objective Optimization and Decision-Making
    Zheng, J. H.
    Kou, Y. N.
    Jing, Z. X.
    Wu, Q. H.
    [J]. IEEE ACCESS, 2019, 7 : 93742 - 93751
  • [27] The Multi-Objective Decision-Making Model Based On Grey Correlation Degree
    Yang, Zhijun
    [J]. PROCEEDINGS OF 2013 IEEE INTERNATIONAL CONFERENCE ON GREY SYSTEMS AND INTELLIGENT SERVICES (GSIS), 2013, : 26 - 28
  • [28] A kind of layered affective cognition model for multi-objective decision-making
    School of Information Science and Technology, Beijing University of Chemical Technology, Beijing
    100029, China
    [J]. Kongzhi yu Juece Control Decis, 12 (2129-2136):
  • [29] The Multi-Objective Decision-Making Based on DEA and Analytic Hierarchy Process
    Chen, Guodong
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SERVICE SYSTEM (CSSS), 2014, 109 : 402 - 405
  • [30] A multi-objective decision-making process for reuse selection of historic buildings
    Wang, Huey-Jiun
    Zeng, Zhi-Teng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (02) : 1241 - 1249