Policy Gradient Approaches for Multi-Objective Sequential Decision Making

被引:0
|
作者
Parisi, Simone [1 ]
Pirotta, Matteo [1 ]
Smacchia, Nicola [1 ]
Bascetta, Luca [1 ]
Restelli, Marcello [1 ]
机构
[1] Politecn Milan, Dept Elect Informat & Bioengn, Piazza Leonardo da Vinci 32, I-20133 Milan, Italy
关键词
FITTED-Q-ITERATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the use of policy gradient techniques to approximate the Pareto frontier in Multi-Objective Markov Decision Processes (MOMDPs). Despite the popularity of policy gradient algorithms and the fact that gradient ascent algorithms have been already proposed to numerically solve multi-objective optimization problems, especially in combination with multi-objective evolutionary algorithms, so far little attention has been paid to the use of gradient information to face multi-objective sequential decision problems. Two different Multi-Objective Reinforcement-Learning (MORL) approaches, called radial and Pareto following, that, starting from an initial policy, perform gradient-based policy-search procedures aimed at finding a set of non-dominated policies are here presented. Both algorithms are empirically evaluated and compared to state-of-the-art MORL algorithms on three MORL benchmark problems.
引用
收藏
页码:2323 / 2330
页数:8
相关论文
共 50 条
  • [41] Environmentally conscious process planning and its multi-objective decision making
    Cao, HJ
    Liu, F
    Li, CB
    Chitrakar, A
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS 1 AND 2: INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT IN THE GLOBAL ECONOMY, 2005, : 985 - 989
  • [42] Application of fuzzy multi-objective decision making in spatial load forecasting
    North Carolina State Univ, Raleigh, United States
    [J]. IEEE Trans Power Syst, 3 (1185-1190):
  • [43] Multi-objective Coevolution and Decision-making for Cooperative and Competitive Environments
    Suresh, Anirudh
    Kongmanee, Jaturong
    Deb, Kalyanmoy
    Boddeti, Vishnu Naresh
    [J]. 2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 1601 - 1608
  • [44] Multi-Objective Decision-Making for Mobile Cloud Offloading: A Survey
    Wu, Huaming
    [J]. IEEE ACCESS, 2018, 6 : 3962 - 3976
  • [45] Supporting Multi-objective Decision Making Within a Supervisory Control Environment
    Sibley, Ciara
    Coyne, Joseph
    Avvari, Gopi Vinod
    Mishra, Manisha
    Pattipati, Krishna R.
    [J]. FOUNDATIONS OF AUGMENTED COGNITION: NEUROERGONOMICS AND OPERATIONAL NEUROSCIENCE, PT II, 2016, 9744 : 210 - 221
  • [46] An integrated multi-objective decision making process for the performance evaluation of the vendors
    Parthiban, P.
    Zubar, H. Abdul
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2013, 51 (13) : 3836 - 3848
  • [47] Application of Improved SAW in the Multi-objective Decision-making Process
    Zhang Hengquan
    Lin Bin
    [J]. SYSTEMS, ORGANIZATIONS AND MANAGEMENT: PROCEEDINGS OF THE 3RD WORKSHOP OF INTERNATIONAL SOCIETY IN SCIENTIFIC INVENTIONS, 2009, : 212 - 216
  • [49] Interactive decision making algorithm for multi-objective nonlinear knapsack problem
    Hikita, M
    Isada, Y
    [J]. ISIM'2000: PROCEEDINGS OF THE FIFTH CHINA-JAPAN INTERNATIONAL SYMPOSIUM ON INDUSTRIAL MANAGEMENT, 2000, : 350 - 355
  • [50] A Multi-Objective Decision-Making Approach for the Sustainable Maintenance of Roadways
    Shoghli, Omidreza
    De La Garza, Jesus M.
    [J]. CONSTRUCTION RESEARCH CONGRESS 2016: OLD AND NEW CONSTRUCTION TECHNOLOGIES CONVERGE IN HISTORIC SAN JUAN, 2016, : 1424 - 1434