SA-YOLOv3: An Efficient and Accurate Object Detector Using Self-Attention Mechanism for Autonomous Driving

被引:31
|
作者
Tian, Daxin [1 ]
Lin, Chunmian [1 ]
Zhou, Jianshan [1 ]
Duan, Xuting [1 ]
Cao, Yue [1 ]
Zhao, Dezong [2 ]
Cao, Dongpu [3 ]
机构
[1] Beihang Univ, Beijing Adv Innovat Ctr Big Data & Brain Comp, Sch Transportat Sci & Engn, Beijing Key Lab Cooperat Vehicle Infrastruct Syst, Beijing 100191, Peoples R China
[2] Loughborough Univ, Dept Aeronaut & Automot Engn, Loughborough LE11 3TU, Leics, England
[3] Univ Waterloo, Dept Mech & Mechatron Engn, Waterloo, ON N2L 3G1, Canada
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Autonomous driving; object detection; attention mechanism; deep learning; YOLOv3; intelligent transportation systems; VEHICLE DETECTION;
D O I
10.1109/TITS.2020.3041278
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Object detection is becoming increasingly significant for autonomous-driving system. However, poor accuracy or low inference performance limits current object detectors in applying to autonomous driving. In this work, a fast and accurate object detector termed as SA-YOLOv3, is proposed by introducing dilated convolution and self-attention module (SAM) into the architecture of YOLOv3. Furthermore, loss function based on GIoU and focal loss is reconstructed to further optimize detection performance. With an input size of 512 x 512, our proposed SA-YOLOv3 improves YOLOv3 by 2.58 mAP and 2.63 mAP on KITTI and BDD100K benchmarks, with real-time inference (more than 40 FPS). When compared with other state-of-the-art detectors, it reports better trade-off in terms of detection accuracy and speed, indicating the suitability for autonomous-driving application. To our hest knowledge, it is the first method that incorporates YOLOv3 with attention mechanism, and we expect this work would guide for autonomous-driving research in the future.
引用
收藏
页码:4099 / 4110
页数:12
相关论文
共 29 条
  • [1] Gaussian YOLOv3: An Accurate and Fast Object Detector Using Localization Uncertainty for Autonomous Driving
    Choi, Jiwoong
    Chun, Dayoung
    Kim, Hyun
    Lee, Hyuk-Jae
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 502 - 511
  • [2] YOLO-SA: An Efficient Object Detection Model Based on Self-attention Mechanism
    Li, Ang
    Song, Xiangyu
    Sun, ShiJie
    Zhang, Zhaoyang
    Cai, Taotao
    Song, Huansheng
    [J]. WEB AND BIG DATA, PT IV, APWEB-WAIM 2023, 2024, 14334 : 1 - 15
  • [3] Multi-Camera 3D Object Detection for Autonomous Driving Using Deep Learning and Self-Attention Mechanism
    Hazarika, Ananya
    Vyas, Amit
    Rahmati, Mehdi
    Wang, Yan
    [J]. IEEE ACCESS, 2023, 11 : 64608 - 64620
  • [4] Self-Attention blocks in UNet and FCN for accurate semantic segmentation of difficult object classes in autonomous driving
    Mousavi, Seyed-Hamid
    Yow, Kin-Choong
    [J]. 2023 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE, 2023,
  • [5] Fast and accurate object detector for autonomous driving based on improved YOLOv5
    Xiang Jia
    Ying Tong
    Hongming Qiao
    Man Li
    Jiangang Tong
    Baoling Liang
    [J]. Scientific Reports, 13
  • [6] Fast and accurate object detector for autonomous driving based on improved YOLOv5
    Jia, Xiang
    Tong, Ying
    Qiao, Hongming
    Li, Man
    Tong, Jiangang
    Liang, Baoling
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [7] YOLOv4-5D: An Effective and Efficient Object Detector for Autonomous Driving
    Cai, Yingfeng
    Luan, Tianyu
    Gao, Hongbo
    Wang, Hai
    Chen, Long
    Li, Yicheng
    Sotelo, Miguel Angel
    Li, Zhixiong
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [8] SEMANTIC IMAGES SEGMENTATION FOR AUTONOMOUS DRIVING USING SELF-ATTENTION KNOWLEDGE DISTILLATION
    Karine, Ayoub
    Napoleon, Thibault
    Jridi, Maher
    [J]. 2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 198 - 202
  • [9] Real-Time Semantic Segmentation with Dual Encoder and Self-Attention Mechanism for Autonomous Driving
    Chang, Yu-Bang
    Tsai, Chieh
    Lin, Chang-Hong
    Chen, Poki
    [J]. SENSORS, 2021, 21 (23)
  • [10] LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous Driving
    Meyer, Gregory P.
    Laddha, Ankit
    Kee, Eric
    Vallespi-Gonzalez, Carlos
    Wellington, Carl K.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12669 - 12678