Pavement Distress Detection Using Street View Images Captured via Action Camera

被引:10
|
作者
Liu, Yuchen [1 ]
Liu, Fang [2 ]
Liu, Wei [1 ]
Huang, Yucheng [1 ]
机构
[1] Soochow Univ, Sch Rail Transportat, Suzhou 215031, Peoples R China
[2] Xian Jiaotong Liverpool Univ, Acad Creat Technol, Suzhou 215123, Peoples R China
关键词
Feature extraction; Roads; Computational modeling; Object detection; Cameras; Task analysis; Neck; Pavement distress; YOLOv5; shuffle attention; swin-transformer; transfer learning;
D O I
10.1109/TITS.2023.3306578
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Timely and accurately detection as well as rehabilitation of road surface defects are of utmost importance for ensuring road safety and minimizing maintenance cost. However, the variety of pavement distress types and forms makes it difficult to accurately classify and detect them. To tackle the issue, this paper proposes a novel target detection model YOLO-SST based on YOLOv5 with the improvement in pavement distress features. First, a Shuffle Attention mechanism is introduced in the feature extraction backbone network to enhance the detection ability without significantly increasing the computational cost. Secondly, we add a detection layer and embed Swin-Transformer encoder blocks into the C3 module to capture global and contextual information. Finally, to improve the model's detection ability, transfer learning is employed on a self-made dataset called RDDdect_2023, which consists of street view images captured via a DJI Action camera mounted on the car. Experimental results demonstrate that the YOLO-SST model outperforms YOLOv5 and other target detection models in terms of accuracy, recall rate, and mAP@0.5 value for detecting pavement distresses. This confirms that the YOLO-SST model has stronger feature extraction and fusion capabilities, resulting in better detection performance.
引用
收藏
页码:738 / 747
页数:10
相关论文
共 50 条
  • [31] A Conglomerate Technique for Finger Print Recognition using Phone Camera Captured images
    Sisodia, Dilip Singh
    Vandana, Tummala
    Choudhary, Manjula
    2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 2740 - 2746
  • [32] Characterization of a silicon nanowire array using reflected images captured by a smartphone camera
    Lertvachirapaiboon, Chutiparn
    Tunghathaithip, Naraphorn
    Tungasmita, Sukkaneste
    Baba, Akira
    Shinbo, Kazunari
    Kato, Keizo
    INSTRUMENTATION SCIENCE & TECHNOLOGY, 2021, 49 (05) : 487 - 498
  • [33] Automated classification and detection of multiple pavement distress images based on deep learning
    Li, Deru
    Duan, Zhongdong
    Hu, Xiaoyang
    Zhang, Dongchang
    Zhang, Yiying
    JOURNAL OF TRAFFIC AND TRANSPORTATION ENGINEERING-ENGLISH EDITION, 2023, 10 (02) : 276 - 290
  • [34] Characterization of a silicon nanowire array using reflected images captured by a smartphone camera
    Lertvachirapaiboon, Chutiparn
    Tunghathaithip, Naraphorn
    Tungasmita, Sukkaneste
    Baba, Akira
    Shinbo, Kazunari
    Kato, Keizo
    Instrumentation Science and Technology, 2021, 49 (05): : 487 - 498
  • [35] Automatic Detection and Assessment of Pavement Marking Defects with Street View Imagery at the City Scale
    Kong, Wanyue
    Zhong, Teng
    Mai, Xin
    Zhang, Shuliang
    Chen, Min
    Lv, Guonian
    REMOTE SENSING, 2022, 14 (16)
  • [36] Automated classification and detection of multiple pavement distress images based on deep learning
    Deru Li
    Zhongdong Duan
    Xiaoyang Hu
    Dongchang Zhang
    Yiying Zhang
    Journal of Traffic and Transportation Engineering(English Edition), 2023, (02) : 276 - 290
  • [37] Assessing semantic consistency of pavement markings and signs using street view images - a case study on lane-turning information
    Shi, Jinlin
    Li, Xiao
    Lin, Bingxian
    Zhou, Liangchen
    Lv, Guonian
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [38] Text Detection in Street View Images by Cascaded Convolutional Neural Networks
    Chang, Po-Wei
    Zeng, Guan-Xin
    Su, Po-Chyi
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [39] Deep Learning-Assisted Ultrasensitive Detection of Gold Nanoparticles Using Light Microscopy Images Captured by a Cellphone Camera
    Song, Chen
    Zhou, Li
    Wang, Yongchen
    Wang, Chao
    Lei, Yu
    Luo, Yan
    Zhao, Jing
    ANALYTICAL CHEMISTRY, 2025, 97 (09) : 5164 - 5170
  • [40] Road Segmentation in Street View Images Using Texture Information
    Abou Chacra, David
    Zelek, John
    2016 13TH CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2016, : 424 - 431