Urban Driving with Multi-Objective Deep Reinforcement Learning

被引：0

作者：

Li, Changjian ^{[1
]}

Czarnecki, Krzysztof ^{[1
]}

机构：

[1] Univ Waterloo, Waterloo, ON, Canada

来源：

AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS | 2019年

关键词：

reinforcement learning; multi-objective optimization; markov decision process (MDP); deep learning; autonomous driving;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Autonomous driving is a challenging domain that entails multiple aspects: a vehicle should be able to drive to its destination as fast as possible while avoiding collision, obeying traffic rules and ensuring the comfort of passengers. In this paper, we present a deep learning variant of thresholded lexicographic Q-learning for the task of urban driving. Our multi-objective DQN agent learns to drive on multilane roads and intersections, yielding and changing lanes according to traffic rules. We also propose an extension for factored Markov Decision Processes to the DQN architecture that provides auxiliary features for the Q function. This is shown to significantly improve data efficiency. 1 We then show that the learned policy is able to zero-shot transfer to a ring road without sacrificing performance.

引用

页码：359 / 367

页数：9

共 50 条

[31] Investigating the multi-objective optimization of quality and efficiency using deep reinforcement learning
Wang, Zhenhui
Lu, Juan
Chen, Chaoyi
Ma, Junyan
Liao, Xiaoping
[J]. APPLIED INTELLIGENCE, 2022, 52 (11) : 12873 - 12887
[32] Multi-objective deep reinforcement learning for emergency scheduling in a water distribution network
Chengyu Hu
Qiuming Wang
Wenyin Gong
Xuesong Yan
[J]. Memetic Computing, 2022, 14 : 211 - 223
[33] Dynamic multi-objective scheduling for flexible job shop by deep reinforcement learning
Luo, Shu
Zhang, Linxuan
Fan, Yushun
[J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 159
[34] Deep reinforcement learning for multi-objective placement of virtual machines in cloud datacenters
Luca Caviglione
Mauro Gaggero
Massimo Paolucci
Roberto Ronco
[J]. Soft Computing, 2021, 25 : 12569 - 12588
[35] Multi-objective deep reinforcement learning for emergency scheduling in a water distribution network
Hu, Chengyu
Wang, Qiuming
Gong, Wenyin
Yan, Xuesong
[J]. MEMETIC COMPUTING, 2022, 14 (02) : 211 - 223
[36] Multi-objective Reinforcement Learning for Responsive Grids
Perez, Julien
Germain-Renaud, Cecile
Kegl, Balazs
Loomis, Charles
[J]. JOURNAL OF GRID COMPUTING, 2010, 8 (03) : 473 - 492
[37] Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning
Kanazawa, Takuya
Gupta, Chetan
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 63 - 76
[38] PMDRL: Pareto-front-based multi-objective deep reinforcement learning
Yang F.
Huang H.
Shi W.
Ma Y.
Feng Y.
Cheng G.
Liu Z.
[J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (09) : 12663 - 12672
[39] Multi-Objective Deep Reinforcement Learning Assisted Service Function Chains Placement
Bi, Yu
Meixner, Carlos Colman
Bunyakitanon, Monchai
Vasilakos, Xenofon
Nejabati, Reza
Simeonidou, Dimitra
[J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (04): : 4134 - 4150
[40] An Improved Multi-Objective Deep Reinforcement Learning Algorithm Based on Envelope Update
Hu, Can
Zhu, Zhengwei
Wang, Lijia
Zhu, Chenyang
Yang, Yanfei
[J]. ELECTRONICS, 2022, 11 (16)

← 1 2 3 4 5 →