GM-DETR: Research on a Defect Detection Method Based on Improved DETR

被引:3
|
作者
Liu, Xin [1 ]
Yang, Xudong [1 ]
Shao, Lianhe [1 ]
Wang, Xihan [1 ]
Gao, Quanli [1 ]
Shi, Hongbo [2 ]
机构
[1] Xian Polytech Univ, State & Local Joint Engn Res Ctr Adv Networking &, Sch Comp Sci, Xian 710048, Peoples R China
[2] Shaanxi Prov Inst Water Resources & Elect Power In, Xian 710001, Peoples R China
关键词
transformer; DETR; GAM; defect detection; REAL-TIME DETECTION; SURFACE-DEFECTS;
D O I
10.3390/s24113610
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Defect detection is an indispensable part of the industrial intelligence process. The introduction of the DETR model marked the successful application of a transformer for defect detection, achieving true end-to-end detection. However, due to the complexity of defective backgrounds, low resolutions can lead to a lack of image detail control and slow convergence of the DETR model. To address these issues, we proposed a defect detection method based on an improved DETR model, called the GM-DETR. We optimized the DETR model by integrating GAM global attention with CNN feature extraction and matching features. This optimization process reduces the defect information diffusion and enhances the global feature interaction, improving the neural network's performance and ability to recognize target defects in complex backgrounds. Next, to filter out unnecessary model parameters, we proposed a layer pruning strategy to optimize the decoding layer, thereby reducing the model's parameter count. In addition, to address the issue of poor sensitivity of the original loss function to small differences in defect targets, we replaced the L1 loss in the original loss function with MSE loss to accelerate the network's convergence speed and improve the model's recognition accuracy. We conducted experiments on a dataset of road pothole defects to further validate the effectiveness of the GM-DETR model. The results demonstrate that the improved model exhibits better performance, with an increase in average precision of 4.9% (mAP@0.5), while reducing the parameter count by 12.9%.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] RT-DETR-Tomato: Tomato Target Detection Algorithm Based on Improved RT-DETR for Agricultural Safety Production
    Zhao, Zhimin
    Chen, Shuo
    Ge, Yuheng
    Yang, Penghao
    Wang, Yunkun
    Song, Yunsheng
    APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [22] Railway rutting defects detection based on improved RT-DETR
    Yu, Chenghai
    Chen, Xiangwei
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (04)
  • [23] SA-DETR: Saliency Attention-based DETR for salient object detection
    Nam, Kwangwoon
    Kim, Jeeheon
    Kim, Heeyeon
    Chung, Minyoung
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
  • [24] LR-DETR: a lightweight real-time traffic sign detection model based on improved RT-DETR
    Zhang, Longzhen
    Wang, Mingyang
    Zhao, Xianhao
    Wang, Xianjie
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2025, 22 (02)
  • [25] Surface defect detection method of aluminium alloy castings based on data enhancement and CRT-DETR
    Li, Long
    Gao, Mingyang
    Tian, Xing
    Wang, Chengjun
    Yu, Jiangli
    IET IMAGE PROCESSING, 2024, 18 (13) : 4275 - 4286
  • [26] Wood Veneer Defect Detection Based on Multiscale DETR with Position Encoder Net
    Ge, Yilin
    Jiang, Dapeng
    Sun, Liping
    SENSORS, 2023, 23 (10)
  • [27] Lightweight tomato ripeness detection algorithm based on the improved RT-DETR
    Wang, Sen
    Jiang, Huiping
    Yang, Jixiang
    Ma, Xuan
    Chen, Jiamin
    Li, Zhongjie
    Tang, Xingqun
    FRONTIERS IN PLANT SCIENCE, 2024, 15
  • [28] PAPER DFAM-DETR: Deformable Feature Based Attention Mechanism DETR on Slender Object Detection
    Wen, Feng
    Wang, Mei
    Hu, Xiaojie
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (03) : 401 - 409
  • [29] DETR-ORD: An Improved DETR Detector for Oriented Remote Sensing Object Detection with Feature Reconstruction and Dynamic Query
    He, Xiaohai
    Liang, Kaiwen
    Zhang, Weimin
    Li, Fangxing
    Jiang, Zhou
    Zuo, Zhengqing
    Tan, Xinyan
    REMOTE SENSING, 2024, 16 (18)
  • [30] IDPD: improved deformable-DETR for crowd pedestrian detection
    Wenjing Han
    Ning He
    Xin Wang
    Fengxi Sun
    Shengjie Liu
    Signal, Image and Video Processing, 2024, 18 : 2243 - 2253