STrans-YOLOX: Fusing Swin Transformer and YOLOX for Automatic Pavement Crack Detection

被引：17

作者：

Luo, Hui ^{[1
]}

Li, Jiamin ^{[1
]}

Cai, Lianming ^{[1
]}

Wu, Mingquan ^{[1
]}

机构：

[1] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 03期

基金：

中国国家自然科学基金;

关键词：

pavement crack detection; object detection; Swin Transformer; YOLOX; global guidance attention; multi-scale feature fusion; NMS; complex scenes;

D O I：

10.3390/app13031999

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Automatic pavement crack detection is crucial for reducing road maintenance costs and ensuring transportation safety. Although convolutional neural networks (CNNs) have been widely used in automatic pavement crack detection, they cannot adequately model the long-range dependencies between pixels and easily lose edge detail information in complex scenes. Moreover, irregular crack shapes also make the detection task challenging. To address these issues, an automatic pavement crack detection architecture named STrans-YOLOX is proposed. Specifically, the architecture first exploits the CNN backbone to extract feature information, preserving the local modeling ability of the CNN. Then, Swin Transformer is introduced to enhance the long-range dependencies through a self-attention mechanism by supplying each pixel with global features. A new global attention guidance module (GAGM) is used to ensure effective information propagation in the feature pyramid network (FPN) by using high-level semantic information to guide the low-level spatial information, thereby enhancing the multi-class and multi-scale features of cracks. During the post-processing stage, we utilize alpha-IoU-NMS to achieve the accurate suppression of the detection boxes in the case of occlusion and overlapping objects by introducing an adjustable power parameter. The experiments demonstrate that the proposed STrans-YOLOX achieves 63.37% mAP and surpasses the state-of-the-art models on the challenging pavement crack dataset.

引用

页数：17

共 50 条

[21] A novel transformer-based network with attention mechanism for automatic pavement crack detection
Guo, Feng
Liu, Jian
Lv, Chengshun
Yu, Huayang
CONSTRUCTION AND BUILDING MATERIALS, 2023, 391
[22] The Combination of Transformer and You Only Look Once for Automatic Concrete Pavement Crack Detection
Zheng, Xin
Qian, Songrong
Wei, Shaodong
Zhou, Shiyun
Hou, Yi
APPLIED SCIENCES-BASEL, 2023, 13 (16):
[23] Pavement crack detection based on transformer network
Guo, Feng
Qian, Yu
Liu, Jian
Yu, Huayang
AUTOMATION IN CONSTRUCTION, 2023, 145
[24] Pavement Crack Detection Based on the Improved Swin-Unet Model
Chen, Song
Feng, Zhixuan
Xiao, Guangqing
Chen, Xilong
Gao, Chuxiang
Zhao, Mingming
Yu, Huayang
BUILDINGS, 2024, 14 (05)
[25] Automatic Detection of Water Supply Pipe Defects Based on Underwater Image Enhancement and Improved YOLOX
Su, Changwang
Hu, Shaowei
Zhang, Haifen
Pan, Fuqu
Shan, Changxi
Qi, Hao
JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2024, 150 (10)
[26] Sw-YoloX: An anchor-free detector based transformer for sea surface object detection
Ding, Jiangang
Li, Wei
Pei, Lili
Yang, Ming
Ye, Chao
Yuan, Bo
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 217
[27] Crack Tree: Automatic crack detection from pavement images
Zou, Qin
Cao, Yu
Li, Qingquan
Mao, Qingzhou
Wang, Song
PATTERN RECOGNITION LETTERS, 2012, 33 (03) : 227 - 238
[28] Crack _PSTU: Crack detection based on the U-Net framework combined with Swin Transformer
Lu, Weizhong
Qian, Meiling
Xia, Yiyi
Lu, Yiming
Shen, Jiyun
Fu, Qiming
Lu, You
STRUCTURES, 2024, 62
[29] Road Pavement Crack Automatic Detection by MMS Images
Mancini, A.
Malinverni, E. S.
Frontoni, E.
Zingaretti, P.
2013 21ST MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2013, : 1589 - 1596
[30] An Automatic Pavement Crack Detection System with FocusCrack Dataset
Yan, Xinyun
Shi, Shang
Xu, Xiaohu
He, Zhengran
Zhou, Xiaofeng
Wang, Chishe
Lu, Zhiyi
2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,

← 1 2 3 4 5 →