A Survey of Multi-Objective Sequential Decision-Making

被引:302
|
作者
Roijers, Diederik M. [1 ]
Vamplew, Peter [2 ]
Whiteson, Shimon [1 ]
Dazeley, Richard [2 ]
机构
[1] Univ Amsterdam, Inst Informat, Amsterdam, Netherlands
[2] Univ Ballarat, Sch Sci Informat Technol & Engn, Ballarat, Vic 3353, Australia
关键词
MANY-OBJECTIVE OPTIMIZATION; OBSERVABLE MARKOV-PROCESSES; MULTI-POLICY OPTIMIZATION; INFINITE-HORIZON; REINFORCEMENT; ITERATION; UNCERTAINTY; ALGORITHM; NETWORKS; MODELS;
D O I
10.1613/jair.3987
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequential decision-making problems with multiple objectives arise naturally in practice and pose unique challenges for research in decision-theoretic planning and learning, which has largely focused on single-objective settings. This article surveys algorithms designed for sequential decision-making problems with multiple objectives. Though there is a growing body of literature on this subject, little of it makes explicit under what circumstances special methods are needed to solve multi-objective problems. Therefore, we identify three distinct scenarios in which converting such a problem to a single-objective one is impossible, infeasible, or undesirable. Furthermore, we propose a taxonomy that classifies multi-objective methods according to the applicable scenario, the nature of the scalarization function (which projects multi-objective values to scalar ones), and the type of policies considered. We show how these factors determine the nature of an optimal solution, which can be a single policy, a convex hull, or a Pareto front. Using this taxonomy, we survey the literature on multi-objective methods for planning and learning. Finally, we discuss key applications of such methods and outline opportunities for future work.
引用
收藏
页码:67 / 113
页数:47
相关论文
共 50 条
  • [31] Designing innovation policy mix: a multi-objective decision-making approach
    Ghazinoory, Sepehr
    Amiri, Maghsoud
    Ghazinoori, Soroush
    Alizadeh, Parisa
    [J]. ECONOMICS OF INNOVATION AND NEW TECHNOLOGY, 2019, 28 (04) : 365 - 385
  • [32] MULTI-OBJECTIVE PROGRAMMING MODELS - NEW WAYS IN REGIONAL DECISION-MAKING
    NIJKAMP, P
    RIETVELD, P
    [J]. REGIONAL SCIENCE AND URBAN ECONOMICS, 1976, 6 (03) : 253 - 274
  • [33] Multi-objective decision-making model based on CBM for an aircraft fleet
    Luo, Bin
    Lin, Lin
    [J]. ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955
  • [34] Decision-making analysis of multi-objective optimal operation of power system
    Shi, LB
    Zhang, Y
    [J]. POWERCON 2002: INTERNATIONAL CONFERENCE ON POWER SYSTEM TECHNOLOGY, VOLS 1-4, PROCEEDINGS, 2002, : 804 - 807
  • [35] A multi-objective decision-making method for the treatment scheme of landslide hazard
    Xie, QM
    Xia, YY
    [J]. JOURNAL OF UNIVERSITY OF SCIENCE AND TECHNOLOGY BEIJING, 2004, 11 (02): : 101 - 105
  • [36] Multi-objective collaborative decision-making for flood resource utilization in a reservoir
    Xinyu Wan
    Yuting Xue
    Lijuan Hua
    Qingyang Wu
    [J]. Stochastic Environmental Research and Risk Assessment, 2023, 37 : 4629 - 4640
  • [37] A multi-objective decision-making method for commercial banks loan portfolio
    Guo, ZQ
    Zhou, ZF
    [J]. DATA MINING AND KNOWLEDGE MANAGEMENT, 2004, 3327 : 221 - 228
  • [38] OBJECTIVES AND MULTI-OBJECTIVE DECISION-MAKING UNDER UNCERTAINTY - WILHELM,J
    WALTER, J
    [J]. EKONOMICKO-MATEMATICKY OBZOR, 1976, 12 (04): : 472 - 472
  • [39] Multi-objective collaborative decision-making for flood resource utilization in a reservoir
    Wan, Xinyu
    Xue, Yuting
    Hua, Lijuan
    Wu, Qingyang
    [J]. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2023, 37 (12) : 4629 - 4640
  • [40] Distributional Multi-Objective Decision Making
    Ropke, Willem
    Hayes, Conor F.
    Mannion, Patrick
    Howley, Enda
    Nowe, Ann
    Roijers, Diederik M.
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5711 - 5719