A Survey of Multi-Objective Sequential Decision-Making

被引:302
|
作者
Roijers, Diederik M. [1 ]
Vamplew, Peter [2 ]
Whiteson, Shimon [1 ]
Dazeley, Richard [2 ]
机构
[1] Univ Amsterdam, Inst Informat, Amsterdam, Netherlands
[2] Univ Ballarat, Sch Sci Informat Technol & Engn, Ballarat, Vic 3353, Australia
关键词
MANY-OBJECTIVE OPTIMIZATION; OBSERVABLE MARKOV-PROCESSES; MULTI-POLICY OPTIMIZATION; INFINITE-HORIZON; REINFORCEMENT; ITERATION; UNCERTAINTY; ALGORITHM; NETWORKS; MODELS;
D O I
10.1613/jair.3987
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequential decision-making problems with multiple objectives arise naturally in practice and pose unique challenges for research in decision-theoretic planning and learning, which has largely focused on single-objective settings. This article surveys algorithms designed for sequential decision-making problems with multiple objectives. Though there is a growing body of literature on this subject, little of it makes explicit under what circumstances special methods are needed to solve multi-objective problems. Therefore, we identify three distinct scenarios in which converting such a problem to a single-objective one is impossible, infeasible, or undesirable. Furthermore, we propose a taxonomy that classifies multi-objective methods according to the applicable scenario, the nature of the scalarization function (which projects multi-objective values to scalar ones), and the type of policies considered. We show how these factors determine the nature of an optimal solution, which can be a single policy, a convex hull, or a Pareto front. Using this taxonomy, we survey the literature on multi-objective methods for planning and learning. Finally, we discuss key applications of such methods and outline opportunities for future work.
引用
收藏
页码:67 / 113
页数:47
相关论文
共 50 条
  • [1] Multi-Objective Decision-Making for Mobile Cloud Offloading: A Survey
    Wu, Huaming
    [J]. IEEE ACCESS, 2018, 6 : 3962 - 3976
  • [2] Estimating Objective Weights of Pareto-Optimal Policies for Multi-Objective Sequential Decision-Making
    Ikenaga, Akiko
    Arai, Sachiyo
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (02) : 393 - 402
  • [3] Multi-objective decision-making for road design
    Brauers, Willem Karel M.
    Zavadskas, Edmundas Kazimieras
    Peldschus, Friedel
    Turskis, Zenonas
    [J]. TRANSPORT, 2008, 23 (03) : 183 - 193
  • [4] MULTI-OBJECTIVE DECISION-MAKING IN WATER MANAGEMENT
    FLECKSEDER, H
    [J]. WATER SCIENCE AND TECHNOLOGY, 1981, 13 (03) : 115 - 127
  • [5] Multi-objective decision-making method of IS outsourcing
    Wang Zuzhu
    Zhou Xiaoxi
    [J]. PROCEEDINGS OF THE 2007 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING - MANAGEMENT AND ORGANIZATION STUDIES SECTION, 2007, : 1170 - 1175
  • [6] Construction Project Risk Decision-making Based on Grey Multi-objective Decision-making
    Li, Hong
    Yao, Zhong
    [J]. ADVANCES IN COMPUTING, CONTROL AND INDUSTRIAL ENGINEERING, 2012, 235 : 323 - 328
  • [7] Multi-Objective Optimization and Decision-Making in Context Steering
    Dockhorn, Alexander
    Mostaghim, Sanaz
    Kirst, Martin
    Zettwitz, Martin
    [J]. 2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 308 - 315
  • [8] A multi-objective decision-making method for loan portfolio
    Guo, ZQ
    Zhou, ZF
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS 1 AND 2, 2004, : 1943 - 1948
  • [9] Policy Gradient Approaches for Multi-Objective Sequential Decision Making
    Parisi, Simone
    Pirotta, Matteo
    Smacchia, Nicola
    Bascetta, Luca
    Restelli, Marcello
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2323 - 2330
  • [10] A multi-objective model for computer-mediated decision-making
    Wang, J
    Li, Y
    [J]. Proceedings of 2005 International Conference on Public Administration, 2005, : 712 - 718