STrans-YOLOX: Fusing Swin Transformer and YOLOX for Automatic Pavement Crack Detection

被引:17
|
作者
Luo, Hui [1 ]
Li, Jiamin [1 ]
Cai, Lianming [1 ]
Wu, Mingquan [1 ]
机构
[1] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 03期
基金
中国国家自然科学基金;
关键词
pavement crack detection; object detection; Swin Transformer; YOLOX; global guidance attention; multi-scale feature fusion; NMS; complex scenes;
D O I
10.3390/app13031999
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Automatic pavement crack detection is crucial for reducing road maintenance costs and ensuring transportation safety. Although convolutional neural networks (CNNs) have been widely used in automatic pavement crack detection, they cannot adequately model the long-range dependencies between pixels and easily lose edge detail information in complex scenes. Moreover, irregular crack shapes also make the detection task challenging. To address these issues, an automatic pavement crack detection architecture named STrans-YOLOX is proposed. Specifically, the architecture first exploits the CNN backbone to extract feature information, preserving the local modeling ability of the CNN. Then, Swin Transformer is introduced to enhance the long-range dependencies through a self-attention mechanism by supplying each pixel with global features. A new global attention guidance module (GAGM) is used to ensure effective information propagation in the feature pyramid network (FPN) by using high-level semantic information to guide the low-level spatial information, thereby enhancing the multi-class and multi-scale features of cracks. During the post-processing stage, we utilize alpha-IoU-NMS to achieve the accurate suppression of the detection boxes in the case of occlusion and overlapping objects by introducing an adjustable power parameter. The experiments demonstrate that the proposed STrans-YOLOX achieves 63.37% mAP and surpasses the state-of-the-art models on the challenging pavement crack dataset.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] ST-YOLOX: a lightweight and accurate object detection network based on Swin Transformer
    Jingjing Han
    Guangqi Yang
    Hongyang Wei
    Weijun Gong
    Yurong Qian
    The Journal of Supercomputing, 2024, 80 : 8038 - 8059
  • [2] ST-YOLOX: a lightweight and accurate object detection network based on Swin Transformer
    Han, Jingjing
    Yang, Guangqi
    Wei, Hongyang
    Gong, Weijun
    Qian, Yurong
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (06): : 8038 - 8059
  • [3] Obstacle detection: improved YOLOX-S based on swin transformer-tiny
    Zhang, Hongying
    Lu, Chengjian
    Chen, Enyao
    OPTOELECTRONICS LETTERS, 2023, 19 (11) : 698 - 704
  • [4] Obstacle detection: improved YOLOX-S based on swin transformer-tiny
    ZHANG Hongying
    LU Chengjian
    CHEN Enyao
    Optoelectronics Letters, 2023, 19 (11) : 698 - 704
  • [5] Obstacle detection: improved YOLOX-S based on swin transformer-tiny
    Hongying Zhang
    Chengjian Lu
    Enyao Chen
    Optoelectronics Letters, 2023, 19 : 698 - 704
  • [6] Foreign object detection for transmission lines based on Swin Transformer V2 and YOLOX
    Chaoli Tang
    Huiyuan Dong
    Yourui Huang
    Tao Han
    Mingshuai Fang
    Jiahao Fu
    The Visual Computer, 2024, 40 : 3003 - 3021
  • [7] Foreign object detection for transmission lines based on Swin Transformer V2 and YOLOX
    Tang, Chaoli
    Dong, Huiyuan
    Huang, Yourui
    Han, Tao
    Fang, Mingshuai
    Fu, Jiahao
    VISUAL COMPUTER, 2024, 40 (05): : 3003 - 3021
  • [8] 基于Swin-Transformer的YOLOX交通标志检测
    嵇文
    刘全金
    黄崇文
    杨瑞
    黄汇磊
    徐光豪
    无线电通信技术, 2023, 49 (03) : 547 - 555
  • [9] 基于YOLOX和Swin Transformer的车载红外目标检测
    楼哲航
    罗素云
    红外技术, 2022, 44 (11) : 1167 - 1175
  • [10] Automatic Pavement Crack Detection Fusing Attention Mechanism
    Ren, Junhua
    Zhao, Guowu
    Ma, Yadong
    Zhao, De
    Liu, Tao
    Yan, Jun
    ELECTRONICS, 2022, 11 (21)