A multi-objective deep reinforcement learning framework

被引：47

作者：

Thanh Thi Nguyen ^{[1
]}

Ngoc Duy Nguyen ^{[2
]}

Vamplew, Peter ^{[3
]}

Nahavandi, Saeid ^{[2
]}

Dazeley, Richard ^{[1
]}

Lim, Chee Peng ^{[2
]}

机构：

[1] Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia

[2] Deakin Univ, Inst Intelligent Syst Res & Innovat, Geelong, Vic, Australia

[3] Federation Univ, Sch Sci Engn & Informat Technol, Mt Helen, Australia

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2020年 / 96卷

关键词：

Reinforcement learning; Multi-objective; Deep learning; Single-policy; Multi-policy;

D O I：

10.1016/j.engappai.2020.103915

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper introduces a new scalable multi-objective deep reinforcement learning (MODRL) framework based on deep Q-networks. We develop a high-performance MODRL framework that supports both single-policy and multi-policy strategies, as well as both linear and non-linear approaches to action selection. The experimental results on two benchmark problems (two-objective deep sea treasure environment and three-objective Mountain Car problem) indicate that the proposed framework is able to find the Pareto-optimal solutions effectively. The proposed framework is generic and highly modularized, which allows the integration of different deep reinforcement learning algorithms in different complex problem domains. This therefore overcomes many disadvantages involved with standard multi-objective reinforcement learning methods in the current literature. The proposed framework acts as a testbed platform that accelerates the development of MODRL for solving increasingly complicated multi-objective problems.

引用

页数：12

共 50 条

[1] A Two-Stage Multi-Objective Deep Reinforcement Learning Framework
Chen, Diqi
Wang, Yizhou
Gao, Wen
[J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1063 - 1070
[2] Dynamic Weights in Multi-Objective Deep Reinforcement Learning
Abels, Axel
Roijers, Diederik M.
Lenaerts, Tom
Nowe, Ann
Steckelmacher, Denis
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[3] Urban Driving with Multi-Objective Deep Reinforcement Learning
Li, Changjian
Czarnecki, Krzysztof
[J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 359 - 367
[4] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
Horie, Naoto
Matsui, Tohgoroh
Moriyama, Koichi
Mutoh, Atsuko
Inuzuka, Nobuhiro
[J]. ARTIFICIAL LIFE AND ROBOTICS, 2019, 24 (03) : 352 - 359
[5] Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning
Naoto Horie
Tohgoroh Matsui
Koichi Moriyama
Atsuko Mutoh
Nobuhiro Inuzuka
[J]. Artificial Life and Robotics, 2019, 24 : 352 - 359
[6] Multi-Objective Reinforcement Learning Based on Decomposition: A Taxonomy and Framework
Felten, Florian
Talbi, El-Ghazali
Danoy, Gregoire
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 79 : 679 - 723
[7] Multi-objective path planning based on deep reinforcement learning
Xu, Jian
Huang, Fei
Cui, Yunfei
Du, Xue
[J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3273 - 3279
[8] Modular Multi-Objective Deep Reinforcement Learning with Decision Values
Tajmajer, Tomasz
[J]. PROCEEDINGS OF THE 2018 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2018, : 85 - 93
[9] Deep reinforcement learning for multi-objective game strategy selection
Jiang, Ruhao
Deng, Yanchen
Chen, Yingying
Luo, He
An, Bo
[J]. COMPUTERS & OPERATIONS RESEARCH, 2024, 168
[10] Dynamic multi-objective sequence-wise recommendation framework via deep reinforcement learning
Zhang, Xiankun
Shang, Yuhu
Ren, Yimeng
Liang, Kun
[J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (02) : 1891 - 1911

← 1 2 3 4 5 →