STrans-YOLOX: Fusing Swin Transformer and YOLOX for Automatic Pavement Crack Detection

被引:17
|
作者
Luo, Hui [1 ]
Li, Jiamin [1 ]
Cai, Lianming [1 ]
Wu, Mingquan [1 ]
机构
[1] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 03期
基金
中国国家自然科学基金;
关键词
pavement crack detection; object detection; Swin Transformer; YOLOX; global guidance attention; multi-scale feature fusion; NMS; complex scenes;
D O I
10.3390/app13031999
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Automatic pavement crack detection is crucial for reducing road maintenance costs and ensuring transportation safety. Although convolutional neural networks (CNNs) have been widely used in automatic pavement crack detection, they cannot adequately model the long-range dependencies between pixels and easily lose edge detail information in complex scenes. Moreover, irregular crack shapes also make the detection task challenging. To address these issues, an automatic pavement crack detection architecture named STrans-YOLOX is proposed. Specifically, the architecture first exploits the CNN backbone to extract feature information, preserving the local modeling ability of the CNN. Then, Swin Transformer is introduced to enhance the long-range dependencies through a self-attention mechanism by supplying each pixel with global features. A new global attention guidance module (GAGM) is used to ensure effective information propagation in the feature pyramid network (FPN) by using high-level semantic information to guide the low-level spatial information, thereby enhancing the multi-class and multi-scale features of cracks. During the post-processing stage, we utilize alpha-IoU-NMS to achieve the accurate suppression of the detection boxes in the case of occlusion and overlapping objects by introducing an adjustable power parameter. The experiments demonstrate that the proposed STrans-YOLOX achieves 63.37% mAP and surpasses the state-of-the-art models on the challenging pavement crack dataset.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] A novel transformer-based network with attention mechanism for automatic pavement crack detection
    Guo, Feng
    Liu, Jian
    Lv, Chengshun
    Yu, Huayang
    CONSTRUCTION AND BUILDING MATERIALS, 2023, 391
  • [22] The Combination of Transformer and You Only Look Once for Automatic Concrete Pavement Crack Detection
    Zheng, Xin
    Qian, Songrong
    Wei, Shaodong
    Zhou, Shiyun
    Hou, Yi
    APPLIED SCIENCES-BASEL, 2023, 13 (16):
  • [23] Pavement crack detection based on transformer network
    Guo, Feng
    Qian, Yu
    Liu, Jian
    Yu, Huayang
    AUTOMATION IN CONSTRUCTION, 2023, 145
  • [24] Pavement Crack Detection Based on the Improved Swin-Unet Model
    Chen, Song
    Feng, Zhixuan
    Xiao, Guangqing
    Chen, Xilong
    Gao, Chuxiang
    Zhao, Mingming
    Yu, Huayang
    BUILDINGS, 2024, 14 (05)
  • [25] Automatic Detection of Water Supply Pipe Defects Based on Underwater Image Enhancement and Improved YOLOX
    Su, Changwang
    Hu, Shaowei
    Zhang, Haifen
    Pan, Fuqu
    Shan, Changxi
    Qi, Hao
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2024, 150 (10)
  • [26] Sw-YoloX: An anchor-free detector based transformer for sea surface object detection
    Ding, Jiangang
    Li, Wei
    Pei, Lili
    Yang, Ming
    Ye, Chao
    Yuan, Bo
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 217
  • [27] Crack Tree: Automatic crack detection from pavement images
    Zou, Qin
    Cao, Yu
    Li, Qingquan
    Mao, Qingzhou
    Wang, Song
    PATTERN RECOGNITION LETTERS, 2012, 33 (03) : 227 - 238
  • [28] Crack _PSTU: Crack detection based on the U-Net framework combined with Swin Transformer
    Lu, Weizhong
    Qian, Meiling
    Xia, Yiyi
    Lu, Yiming
    Shen, Jiyun
    Fu, Qiming
    Lu, You
    STRUCTURES, 2024, 62
  • [29] Road Pavement Crack Automatic Detection by MMS Images
    Mancini, A.
    Malinverni, E. S.
    Frontoni, E.
    Zingaretti, P.
    2013 21ST MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2013, : 1589 - 1596
  • [30] An Automatic Pavement Crack Detection System with FocusCrack Dataset
    Yan, Xinyun
    Shi, Shang
    Xu, Xiaohu
    He, Zhengran
    Zhou, Xiaofeng
    Wang, Chishe
    Lu, Zhiyi
    2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,