A multi-objective deep reinforcement learning framework

被引:47
|
作者
Thanh Thi Nguyen [1 ]
Ngoc Duy Nguyen [2 ]
Vamplew, Peter [3 ]
Nahavandi, Saeid [2 ]
Dazeley, Richard [1 ]
Lim, Chee Peng [2 ]
机构
[1] Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia
[2] Deakin Univ, Inst Intelligent Syst Res & Innovat, Geelong, Vic, Australia
[3] Federation Univ, Sch Sci Engn & Informat Technol, Mt Helen, Australia
关键词
Reinforcement learning; Multi-objective; Deep learning; Single-policy; Multi-policy;
D O I
10.1016/j.engappai.2020.103915
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces a new scalable multi-objective deep reinforcement learning (MODRL) framework based on deep Q-networks. We develop a high-performance MODRL framework that supports both single-policy and multi-policy strategies, as well as both linear and non-linear approaches to action selection. The experimental results on two benchmark problems (two-objective deep sea treasure environment and three-objective Mountain Car problem) indicate that the proposed framework is able to find the Pareto-optimal solutions effectively. The proposed framework is generic and highly modularized, which allows the integration of different deep reinforcement learning algorithms in different complex problem domains. This therefore overcomes many disadvantages involved with standard multi-objective reinforcement learning methods in the current literature. The proposed framework acts as a testbed platform that accelerates the development of MODRL for solving increasingly complicated multi-objective problems.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A Two-Stage Multi-Objective Deep Reinforcement Learning Framework
    Chen, Diqi
    Wang, Yizhou
    Gao, Wen
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1063 - 1070
  • [2] Dynamic Weights in Multi-Objective Deep Reinforcement Learning
    Abels, Axel
    Roijers, Diederik M.
    Lenaerts, Tom
    Nowe, Ann
    Steckelmacher, Denis
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [3] Urban Driving with Multi-Objective Deep Reinforcement Learning
    Li, Changjian
    Czarnecki, Krzysztof
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 359 - 367
  • [4] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Horie, Naoto
    Matsui, Tohgoroh
    Moriyama, Koichi
    Mutoh, Atsuko
    Inuzuka, Nobuhiro
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (03) : 352 - 359
  • [5] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
    Naoto Horie
    Tohgoroh Matsui
    Koichi Moriyama
    Atsuko Mutoh
    Nobuhiro Inuzuka
    [J]. Artificial Life and Robotics, 2019, 24 : 352 - 359
  • [6] Multi-Objective Reinforcement Learning Based on Decomposition: A Taxonomy and Framework
    Felten, Florian
    Talbi, El-Ghazali
    Danoy, Gregoire
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 79 : 679 - 723
  • [7] Multi-objective path planning based on deep reinforcement learning
    Xu, Jian
    Huang, Fei
    Cui, Yunfei
    Du, Xue
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3273 - 3279
  • [8] Modular Multi-Objective Deep Reinforcement Learning with Decision Values
    Tajmajer, Tomasz
    [J]. PROCEEDINGS OF THE 2018 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2018, : 85 - 93
  • [9] Deep reinforcement learning for multi-objective game strategy selection
    Jiang, Ruhao
    Deng, Yanchen
    Chen, Yingying
    Luo, He
    An, Bo
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2024, 168
  • [10] Dynamic multi-objective sequence-wise recommendation framework via deep reinforcement learning
    Zhang, Xiankun
    Shang, Yuhu
    Ren, Yimeng
    Liang, Kun
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (02) : 1891 - 1911