Transformer-Based Sensor Fusion for Autonomous Vehicles: A Comprehensive Review

被引:0
|
作者
Abdulmaksoud, Ahmed [1 ]
Ahmed, Ryan [1 ]
机构
[1] McMaster Univ, Ctr Mechatron & Hybrid Technol CMHT, Dept Mech Engn, Hamilton, ON L8S 4L8, Canada
来源
IEEE ACCESS | 2025年 / 13卷
关键词
Sensor fusion; Transformers; Laser radar; Autonomous vehicles; Cameras; Computational modeling; Reviews; Three-dimensional displays; Object detection; Location awareness; Autonomous driving; artificial intelligence (AI); computer vision; deep learning; machine learning; sensor fusion; transformers;
D O I
10.1109/ACCESS.2025.3545032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sensor fusion is vital for many critical applications, including robotics, autonomous driving, aerospace, and beyond. Integrating data streams from different sensors enables us to overcome the intrinsic limitations of each sensor, providing more reliable measurements and reducing uncertainty. Moreover, deep learning-based sensor fusion unlocked the possibility of multimodal learning, which utilizes different sensor modalities to boost object detection. Yet, adverse weather conditions remain a significant challenge to the reliability of sensor fusion. However, introducing the Transformer deep learning model in sensor fusion presents a promising avenue for advancing its sensing capabilities, potentially overcoming that challenge. Transformer models proved powerful in modeling vision, language, and numerous other domains. However, these models suffer from high latency and heavy computation requirements. This paper aims to provide: 1) an extensive overview of sensor fusion and transformer models; 2) an in-depth survey of the state-of-the-art (SoTA) methods for Transformer-based sensor fusion, focusing on camera-LiDAR and camera-radar methods; and 3) a quantitative analysis of the SoTA methods, uncovering research gaps and stimulating future work.
引用
收藏
页码:41822 / 41838
页数:17
相关论文
共 50 条
  • [41] FormerUnify: Transformer-Based Unified Fusion for Efficient Image Matting
    Wang, Jiaquan
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VIII, 2025, 15038 : 412 - 425
  • [42] A Transformer-based Multi-Platform Sequential Estimation Fusion
    Zhai, Xupeng
    Yang, Yanbo
    Liu, Zhunga
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 144
  • [43] Public acceptance and perception of autonomous vehicles: a comprehensive review
    Kareem Othman
    AI and Ethics, 2021, 1 (3): : 355 - 387
  • [44] Comprehensive Review of Traffic Modeling: Towards Autonomous Vehicles
    Lach, Lukasz
    Svyetlichnyy, Dmytro
    APPLIED SCIENCES-BASEL, 2024, 14 (18):
  • [45] Transformer-based Multimodal Information Fusion for Facial Expression Analysis
    Zhang, Wei
    Qiu, Feng
    Wang, Suzhen
    Zeng, Hao
    Zhang, Zhimeng
    An, Rudong
    Ma, Bowen
    Ding, Yu
    arXiv, 2022,
  • [46] A Temporal Transformer-Based Fusion Framework for Morphological Arrhythmia Classification
    Anjum, Nafisa
    Sathi, Khaleda Akhter
    Hossain, Md. Azad
    Dewan, M. Ali Akber
    COMPUTERS, 2023, 12 (03)
  • [47] Transformer-based Multimodal Information Fusion for Facial Expression Analysis
    Zhang, Wei
    Qiu, Feng
    Wang, Suzhen
    Zeng, Hao
    Zhang, Zhimeng
    An, Rudong
    Ma, Bowen
    Ding, Yu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2427 - 2436
  • [48] TFTN: A Transformer-Based Fusion Tracking Framework of Hyperspectral and RGB
    Zhao, Chunhui
    Liu, Hongjiao
    Su, Nan
    Yan, Yiming
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [49] Vision transformer-based autonomous crack detection on asphalt and concrete surfaces
    Shamsabadi, Elyas Asadi
    Xu, Chang
    Rao, Aravinda S.
    Nguyen, Tuan
    Ngo, Tuan
    Dias-da-Costa, Daniel
    AUTOMATION IN CONSTRUCTION, 2022, 140
  • [50] Transformer-Based Reinforcement Learning for Multi-Robot Autonomous Exploration
    Chen, Qihong
    Wang, Rui
    Lyu, Ming
    Zhang, Jie
    SENSORS, 2024, 24 (16)