Pavement Distress Detection Using Street View Images Captured via Action Camera

被引:10
|
作者
Liu, Yuchen [1 ]
Liu, Fang [2 ]
Liu, Wei [1 ]
Huang, Yucheng [1 ]
机构
[1] Soochow Univ, Sch Rail Transportat, Suzhou 215031, Peoples R China
[2] Xian Jiaotong Liverpool Univ, Acad Creat Technol, Suzhou 215123, Peoples R China
关键词
Feature extraction; Roads; Computational modeling; Object detection; Cameras; Task analysis; Neck; Pavement distress; YOLOv5; shuffle attention; swin-transformer; transfer learning;
D O I
10.1109/TITS.2023.3306578
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Timely and accurately detection as well as rehabilitation of road surface defects are of utmost importance for ensuring road safety and minimizing maintenance cost. However, the variety of pavement distress types and forms makes it difficult to accurately classify and detect them. To tackle the issue, this paper proposes a novel target detection model YOLO-SST based on YOLOv5 with the improvement in pavement distress features. First, a Shuffle Attention mechanism is introduced in the feature extraction backbone network to enhance the detection ability without significantly increasing the computational cost. Secondly, we add a detection layer and embed Swin-Transformer encoder blocks into the C3 module to capture global and contextual information. Finally, to improve the model's detection ability, transfer learning is employed on a self-made dataset called RDDdect_2023, which consists of street view images captured via a DJI Action camera mounted on the car. Experimental results demonstrate that the YOLO-SST model outperforms YOLOv5 and other target detection models in terms of accuracy, recall rate, and mAP@0.5 value for detecting pavement distresses. This confirms that the YOLO-SST model has stronger feature extraction and fusion capabilities, resulting in better detection performance.
引用
收藏
页码:738 / 747
页数:10
相关论文
共 50 条
  • [21] Building instance classification using street view images
    Kang, Jian
    Koerner, Marco
    Wang, Yuanyuan
    Taubenboeck, Hannes
    Zhu, Xiao Xiang
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 : 44 - 59
  • [22] Front-View Vehicle Damage Detection using Roadway Surveillance Camera Images
    Balci, Burak
    Artan, Yusuf
    Alkan, Bensu
    Elihos, Alperen
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON VEHICLE TECHNOLOGY AND INTELLIGENT TRANSPORT SYSTEMS (VEHITS 2019), 2019, : 193 - 198
  • [23] A novel approach for pavement distress detection and quantification using RGB-D camera and deep learning algorithm
    Lin, Wuguang
    Li, Xiaolong
    Han, Hao
    Yu, Qifeng
    Cho, Yoon-Ho
    CONSTRUCTION AND BUILDING MATERIALS, 2023, 407
  • [24] Pavement Distress Detection Using Random Decision Forests
    Cui, Limeng
    Qi, Zhiquan
    Chen, Zhensong
    Meng, Fan
    Shi, Yong
    DATA SCIENCE, 2015, 9208 : 95 - 102
  • [25] Text-Line Detection in Camera-Captured Document Images Using the State Estimation of Connected Components
    Koo, Hyung Il
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (11) : 5358 - 5368
  • [26] Automatic large scale detection of red palm weevil infestation using street view images
    Kagan, Dima
    Alpert, Galit Fuhrmann
    Fire, Michael
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 182 : 122 - 133
  • [27] Deep learning-based visual crack detection using Google Street View images
    Maniat, Mohsen
    Camp, Charles V.
    Kashani, Ali R.
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (21): : 14565 - 14582
  • [28] Deep learning-based visual crack detection using Google Street View images
    Mohsen Maniat
    Charles V. Camp
    Ali R. Kashani
    Neural Computing and Applications, 2021, 33 : 14565 - 14582
  • [29] New trends on moving object detection in video images captured by a moving camera: A survey
    Yazdi, Mehran
    Bouwmans, Thierry
    COMPUTER SCIENCE REVIEW, 2018, 28 : 157 - 177
  • [30] Fast watermark detection scheme from camera-captured images on mobile phones
    Nakamura, Takao
    Katayama, Atsushi
    Yamamuro, Masashi
    Sonehara, Noboru
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2006, 20 (04) : 543 - 564