Pavement Distress Detection Using Street View Images Captured via Action Camera

被引:10
|
作者
Liu, Yuchen [1 ]
Liu, Fang [2 ]
Liu, Wei [1 ]
Huang, Yucheng [1 ]
机构
[1] Soochow Univ, Sch Rail Transportat, Suzhou 215031, Peoples R China
[2] Xian Jiaotong Liverpool Univ, Acad Creat Technol, Suzhou 215123, Peoples R China
关键词
Feature extraction; Roads; Computational modeling; Object detection; Cameras; Task analysis; Neck; Pavement distress; YOLOv5; shuffle attention; swin-transformer; transfer learning;
D O I
10.1109/TITS.2023.3306578
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Timely and accurately detection as well as rehabilitation of road surface defects are of utmost importance for ensuring road safety and minimizing maintenance cost. However, the variety of pavement distress types and forms makes it difficult to accurately classify and detect them. To tackle the issue, this paper proposes a novel target detection model YOLO-SST based on YOLOv5 with the improvement in pavement distress features. First, a Shuffle Attention mechanism is introduced in the feature extraction backbone network to enhance the detection ability without significantly increasing the computational cost. Secondly, we add a detection layer and embed Swin-Transformer encoder blocks into the C3 module to capture global and contextual information. Finally, to improve the model's detection ability, transfer learning is employed on a self-made dataset called RDDdect_2023, which consists of street view images captured via a DJI Action camera mounted on the car. Experimental results demonstrate that the YOLO-SST model outperforms YOLOv5 and other target detection models in terms of accuracy, recall rate, and mAP@0.5 value for detecting pavement distresses. This confirms that the YOLO-SST model has stronger feature extraction and fusion capabilities, resulting in better detection performance.
引用
收藏
页码:738 / 747
页数:10
相关论文
共 50 条
  • [41] Episode detection in videos captured using a head-mounted camera
    Aneesh Chauhan
    Sameer Singh
    Dave Grosvenor
    Pattern Analysis and Applications, 2004, 7 : 176 - 189
  • [42] Developing Sidewalk Inventory Data Using Street View Images
    Kang, Bumjoon
    Lee, Sangwon
    Zou, Shengyuan
    SENSORS, 2021, 21 (09)
  • [43] Episode detection in videos captured using a head-mounted camera
    Chauhan, A
    Singh, S
    Grosvenor, D
    PATTERN ANALYSIS AND APPLICATIONS, 2004, 7 (02) : 176 - 189
  • [44] Research on Methods of Pavement Distress Detection using Convolutional Neural Network based on Highway Rapid Inspection Images
    Shu, Donglin
    Deng, Wenhao
    Li, Zibing
    Zhao, Chihang
    Wu, Jialun
    Zhao, Yong
    Zhang, Ziyi
    Huang, Yaxin
    2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 623 - 627
  • [45] Automatic pavement distress severity detection using deep learning
    Valipour, Parisa Setayesh
    Golroo, Amir
    Kheirati, Afarin
    Fahmani, Mohammadsadegh
    Amani, Mohammad Javad
    ROAD MATERIALS AND PAVEMENT DESIGN, 2024, 25 (08) : 1830 - 1846
  • [46] Automated Pavement Distress Detection Using Image Processing Techniques
    Abbas, Iman Hashim
    Ismael, Mohammed Qadir
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2021, 11 (05) : 7702 - 7708
  • [47] Detection of Biotic or Abiotic Stress in Vineyards Using Thermal and RGB Images Captured via IoT Sensors
    Fevgas, Georgios
    Lagkas, Thomas
    Argyriou, Vasileios
    Sarigiannidis, Panagiotis
    IEEE ACCESS, 2023, 11 : 105902 - 105915
  • [48] Block-based feature detection and matching for mosaicing of camera-captured document images
    Kasar, T.
    Ramakrishnan, A. G.
    TENCON 2007 - 2007 IEEE REGION 10 CONFERENCE, VOLS 1-3, 2007, : 1280 - 1283
  • [49] Detection of Wet-Road Conditions from Images Captured by a Vehicle-Mounted Camera
    Yamada, Muneo
    Ueda, Koji
    Horiba, Isao
    Yamamoto, Shin
    Tsugawa, Sadayuki
    JOURNAL OF ROBOTICS AND MECHATRONICS, 2005, 17 (03) : 269 - 276
  • [50] Mobile camera localization using aerial-view images
    Toriya, Hisatoshi
    Kitahara, Itaru
    Ohta, Yuichi
    IPSJ Transactions on Computer Vision and Applications, 2014, 6 : 111 - 119