Urban Driving with Multi-Objective Deep Reinforcement Learning

被引:0
|
作者
Li, Changjian [1 ]
Czarnecki, Krzysztof [1 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
关键词
reinforcement learning; multi-objective optimization; markov decision process (MDP); deep learning; autonomous driving;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Autonomous driving is a challenging domain that entails multiple aspects: a vehicle should be able to drive to its destination as fast as possible while avoiding collision, obeying traffic rules and ensuring the comfort of passengers. In this paper, we present a deep learning variant of thresholded lexicographic Q-learning for the task of urban driving. Our multi-objective DQN agent learns to drive on multilane roads and intersections, yielding and changing lanes according to traffic rules. We also propose an extension for factored Markov Decision Processes to the DQN architecture that provides auxiliary features for the Q function. This is shown to significantly improve data efficiency. 1 We then show that the learned policy is able to zero-shot transfer to a ring road without sacrificing performance.
引用
收藏
页码:359 / 367
页数:9
相关论文
共 50 条
  • [31] Investigating the multi-objective optimization of quality and efficiency using deep reinforcement learning
    Wang, Zhenhui
    Lu, Juan
    Chen, Chaoyi
    Ma, Junyan
    Liao, Xiaoping
    [J]. APPLIED INTELLIGENCE, 2022, 52 (11) : 12873 - 12887
  • [32] Multi-objective deep reinforcement learning for emergency scheduling in a water distribution network
    Chengyu Hu
    Qiuming Wang
    Wenyin Gong
    Xuesong Yan
    [J]. Memetic Computing, 2022, 14 : 211 - 223
  • [33] Dynamic multi-objective scheduling for flexible job shop by deep reinforcement learning
    Luo, Shu
    Zhang, Linxuan
    Fan, Yushun
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 159
  • [34] Deep reinforcement learning for multi-objective placement of virtual machines in cloud datacenters
    Luca Caviglione
    Mauro Gaggero
    Massimo Paolucci
    Roberto Ronco
    [J]. Soft Computing, 2021, 25 : 12569 - 12588
  • [35] Multi-objective deep reinforcement learning for emergency scheduling in a water distribution network
    Hu, Chengyu
    Wang, Qiuming
    Gong, Wenyin
    Yan, Xuesong
    [J]. MEMETIC COMPUTING, 2022, 14 (02) : 211 - 223
  • [36] Multi-objective Reinforcement Learning for Responsive Grids
    Perez, Julien
    Germain-Renaud, Cecile
    Kegl, Balazs
    Loomis, Charles
    [J]. JOURNAL OF GRID COMPUTING, 2010, 8 (03) : 473 - 492
  • [37] Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning
    Kanazawa, Takuya
    Gupta, Chetan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 63 - 76
  • [38] PMDRL: Pareto-front-based multi-objective deep reinforcement learning
    Yang F.
    Huang H.
    Shi W.
    Ma Y.
    Feng Y.
    Cheng G.
    Liu Z.
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (09) : 12663 - 12672
  • [39] Multi-Objective Deep Reinforcement Learning Assisted Service Function Chains Placement
    Bi, Yu
    Meixner, Carlos Colman
    Bunyakitanon, Monchai
    Vasilakos, Xenofon
    Nejabati, Reza
    Simeonidou, Dimitra
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (04): : 4134 - 4150
  • [40] An Improved Multi-Objective Deep Reinforcement Learning Algorithm Based on Envelope Update
    Hu, Can
    Zhu, Zhengwei
    Wang, Lijia
    Zhu, Chenyang
    Yang, Yanfei
    [J]. ELECTRONICS, 2022, 11 (16)