Safe Reinforcement Learning for an Energy-Efficient Driver Assistance System

被引:2
|
作者
Hailemichael, Habtamu [1 ]
Ayalew, Beshah [1 ]
Kerbel, Lindsey [1 ]
Ivanco, Andrej [2 ]
Loiselle, Keith [2 ]
机构
[1] Clemson Univ, Automot Engn, Greenville, SC 29607 USA
[2] Allison Transmiss Inc, One Allison Way, Indianapolis, IN 46222 USA
来源
IFAC PAPERSONLINE | 2022年 / 55卷 / 37期
关键词
RL driver-assist; Safe reinforcement learning; Safety filtering; Control barrier functions;
D O I
10.1016/j.ifacol.2022.11.250
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL)-based driver assistance systems seek to improve fuel consumption via continual improvement of powertrain control actions considering experiential data from the field. However, the need to explore diverse experiences in order to learn optimal policies often limits the application of RL techniques in safety-critical systems like vehicle control. In this paper, an exponential control barrier function (ECBF) is derived and utilized to filter unsafe actions proposed by an RL-based driver assistance system. The RL agent freely explores and optimizes the performance objectives while unsafe actions are projected to the closest actions in the safe domain. The reward is structured so that driver's acceleration requests are met in a manner that boosts fuel economy and doesn't compromise comfort. The optimal gear and traction torque control actions that maximize the cumulative reward are computed via the Maximum a Posteriori Policy Optimization (MPO) algorithm configured for a hybrid action space. The proposed safe-RL scheme is trained and evaluated in car following scenarios where it is shown that it effectively avoids collision both during training and evaluation while delivering on the expected fuel economy improvements for the driver assistance system. Copyright (c) 2022 The Authors. This is an open access article under the CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0)
引用
收藏
页码:615 / 620
页数:6
相关论文
共 50 条
  • [31] Reinforcement Learning Based MEC Architecturewith Energy-Efficient Optimization for ARANs
    He, Qiang
    Lv, Yingjie
    Zhen, Li
    Yu, Keping
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022,
  • [32] Deep Reinforcement Learning for Energy-Efficient Power Control in Heterogeneous Networks
    Peng, Jianhao
    Zheng, Jiabao
    Zhang, Lin
    Xiao, Ming
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 141 - 146
  • [33] Decentralised reinforcement learning for energy-efficient scheduling in wireless sensor networks
    Mihaylov, Mihail
    Le Borgne, Yann-Ael
    INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2012, 9 (3-4) : 207 - 224
  • [34] Reinforcement Learning based Energy-Efficient Routing with Latency Constraints for FANETs
    Qi, Xuchen
    Li, Jieling
    Lv, Zefang
    Xiao, Liang
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 2638 - 2643
  • [35] Deep Reinforcement Learning for Energy-Efficient Networking with Reconfigurable Intelligent Surfaces
    Lee, Gilsoo
    Jung, Minchae
    Kasgari, Ali Taleb Zadeh
    Saad, Walid
    Bennis, Mehdi
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [36] EER-RL: Energy-Efficient Routing Based on Reinforcement Learning
    Mutombo, Vially Kazadi
    Lee, Seungyeon
    Lee, Jusuk
    Hong, Jiman
    MOBILE INFORMATION SYSTEMS, 2021, 2021
  • [37] Energy-Efficient Register Caching with Compiler Assistance
    Jones, Timothy M.
    O'Boyle, Michael F. P.
    Abella, Jaume
    Gonzalez, Antonio
    Ergin, Oguz
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2009, 6 (04) : 13
  • [38] Modular Approach To Energy Efficient Driver Assistance Incorporating Driver Acceptance
    Themann, Philipp
    Eckstein, Lutz
    2012 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2012, : 1023 - 1028
  • [39] An Energy-Efficient Heterogeneous System for Embedded Learning and Classification
    Majumdar, Abhinandan
    Cadambi, Srihari
    Chakradhar, Srimat T.
    IEEE EMBEDDED SYSTEMS LETTERS, 2011, 3 (01) : 42 - 45
  • [40] Developing HMI components for a driver assistance system for safe speed and safe distance
    Alonso, M.
    Plaza, J.
    Advances in Transportation Studies, 2010, (21): : 5 - 14