Guiding Attention in End-to-End Driving Models

被引:0
|
作者
Porres, Diego [1 ]
Xiao, Yi [1 ]
Villalonga, Gabriel [1 ]
Levy, Alexandre [1 ]
Lopez, Antonio M. [1 ,2 ]
机构
[1] Univ Autonoma Barcelona UAB, Comp Vis Ctr CVC, Barcelona, Spain
[2] Univ Autonoma Barcelona UAB, Dept Ciencies Computac, Barcelona, Spain
关键词
D O I
10.1109/IV55156.2024.10588598
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Vision-based end-to-end driving models trained by imitation learning can lead to affordable solutions for autonomous driving. However, training these well-performing models usually requires a huge amount of data, while still lacking explicit and intuitive activation maps to reveal the inner workings of these models while driving. In this paper, we study how to guide the attention of these models to improve their driving quality and obtain more intuitive activation maps by adding a loss term during training using salient semantic maps. In contrast to previous work, our method does not require these salient semantic maps to be available during testing time, as well as removing the need to modify the model's architecture to which it is applied. We perform tests using perfect and noisy salient semantic maps with encouraging results in both, the latter of which is inspired by possible errors encountered with real data. Using CIL++ as a representative state-of-the-art model and the CARLA simulator with its standard benchmarks, we conduct experiments that show the effectiveness of our method in training better autonomous driving models, especially when data and computational resources are scarce.
引用
收藏
页码:2353 / 2360
页数:8
相关论文
共 50 条
  • [31] An End-to-End TextSpotter with Explicit Alignment and Attention
    He, Tong
    Tian, Zhi
    Huang, Weilin
    Shen, Chunhua
    Qiao, Yu
    Sun, Changming
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5020 - 5029
  • [32] TRIGGERED ATTENTION FOR END-TO-END SPEECH RECOGNITION
    Moritz, Niko
    Hori, Takaaki
    Le Roux, Jonathan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5666 - 5670
  • [33] End-to-end Learning of Driving Models from Large-scale Video Datasets
    Xu, Huazhe
    Gao, Yang
    Yu, Fisher
    Darrell, Trevor
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3530 - 3538
  • [34] End-to-End Learning of Driving Models with Surround-View Cameras and Route Planners
    Hecker, Simon
    Dai, Dengxin
    Van Gool, Luc
    COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 449 - 468
  • [35] End-to-End Learning with Memory Models for Complex Autonomous Driving Tasks in Indoor Environments
    Lai, Zhihui
    Braunl, Thomas
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2023, 107 (03)
  • [36] End-to-End Learning with Memory Models for Complex Autonomous Driving Tasks in Indoor Environments
    Zhihui Lai
    Thomas Bräunl
    Journal of Intelligent & Robotic Systems, 2023, 107
  • [37] Approaches to end-to-end ecosystem models
    Fulton, Elizabeth A.
    JOURNAL OF MARINE SYSTEMS, 2010, 81 (1-2) : 171 - 183
  • [38] End-to-end delay models with priority
    Osterbo, O
    Performance Challenges for Efficient Next Generation Networks, Vols 6A-6C, 2005, 6A-6C : 1049 - 1058
  • [39] KNOWLEDGE DISTILLATION USING OUTPUT ERRORS FOR SELF-ATTENTION END-TO-END MODELS
    Kim, Ho-Gyeong
    Na, Hwidong
    Lee, Hoshik
    Lee, Jihyun
    Kang, Tae Gyoon
    Lee, Min-Joong
    Choi, Young Sang
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6181 - 6185
  • [40] Attention-based End-to-End Models for Small-Footprint Keyword Spotting
    Shan, Changhao
    Zhang, Junbo
    Wang, Yujun
    Xie, Lei
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2037 - 2041