Policy Gradient Approaches for Multi-Objective Sequential Decision Making

被引:0
|
作者
Parisi, Simone [1 ]
Pirotta, Matteo [1 ]
Smacchia, Nicola [1 ]
Bascetta, Luca [1 ]
Restelli, Marcello [1 ]
机构
[1] Politecn Milan, Dept Elect Informat & Bioengn, Piazza Leonardo da Vinci 32, I-20133 Milan, Italy
关键词
FITTED-Q-ITERATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the use of policy gradient techniques to approximate the Pareto frontier in Multi-Objective Markov Decision Processes (MOMDPs). Despite the popularity of policy gradient algorithms and the fact that gradient ascent algorithms have been already proposed to numerically solve multi-objective optimization problems, especially in combination with multi-objective evolutionary algorithms, so far little attention has been paid to the use of gradient information to face multi-objective sequential decision problems. Two different Multi-Objective Reinforcement-Learning (MORL) approaches, called radial and Pareto following, that, starting from an initial policy, perform gradient-based policy-search procedures aimed at finding a set of non-dominated policies are here presented. Both algorithms are empirically evaluated and compared to state-of-the-art MORL algorithms on three MORL benchmark problems.
引用
收藏
页码:2323 / 2330
页数:8
相关论文
共 50 条
  • [1] Policy Gradient Approaches for Multi-Objective Sequential Decision Making: A Comparison
    Parisi, Simone
    Pirotta, Matteo
    Smacchia, Nicola
    Bascetta, Luca
    Restelli, Marcello
    [J]. 2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 79 - 86
  • [2] A Survey of Multi-Objective Sequential Decision-Making
    Roijers, Diederik M.
    Vamplew, Peter
    Whiteson, Shimon
    Dazeley, Richard
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 48 : 67 - 113
  • [3] Multi-objective optimization and decision making approaches to cricket team selection
    Ahmed, Faez
    Deb, Kalyanmoy
    Jindal, Abhilash
    [J]. APPLIED SOFT COMPUTING, 2013, 13 (01) : 402 - 414
  • [4] Distributional Multi-Objective Decision Making
    Ropke, Willem
    Hayes, Conor F.
    Mannion, Patrick
    Howley, Enda
    Nowe, Ann
    Roijers, Diederik M.
    [J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5711 - 5719
  • [5] Hierarchical multi-objective decision making
    Homburg, C
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1998, 105 (01) : 155 - 161
  • [6] Equilibrium approaches for Construction Processes - Multi-objective Decision Making for Construction Projects
    Peldschus, F.
    Zavadskas, E. K.
    [J]. BAUINGENIEUR, 2012, 87 : 210 - 215
  • [7] Designing innovation policy mix: a multi-objective decision-making approach
    Ghazinoory, Sepehr
    Amiri, Maghsoud
    Ghazinoori, Soroush
    Alizadeh, Parisa
    [J]. ECONOMICS OF INNOVATION AND NEW TECHNOLOGY, 2019, 28 (04) : 365 - 385
  • [8] Estimating Objective Weights of Pareto-Optimal Policies for Multi-Objective Sequential Decision-Making
    Ikenaga, Akiko
    Arai, Sachiyo
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (02) : 393 - 402
  • [9] Interactive bilevel multi-objective decision making
    Shi, X
    Xia, H
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1997, 48 (09) : 943 - 949
  • [10] Multi-objective optimisation and multi-criteria decision making in SLS using evolutionary approaches
    Padhye, Nikhil
    Deb, Kalyanmoy
    [J]. RAPID PROTOTYPING JOURNAL, 2011, 17 (06) : 458 - 478