Swin Transformer Combined with Convolution Neural Network for Surface Defect Detection

被引:8
|
作者
Li, Yinghao [1 ]
Xiang, Yihao [1 ]
Guo, Haogong [1 ]
Liu, Panpan [1 ]
Liu, Chengming [1 ]
机构
[1] Zhengzhou Univ, Sch Cyber Sci & Engn, Zhengzhou 450052, Peoples R China
关键词
surface defect detection; Swin transformer; convolutional neural networks; flange;
D O I
10.3390/machines10111083
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Surface defect detection aims to classify and locate a certain defect that exists in the target surface area. It is an important part of industrial quality inspection. Most of the research on surface defect detection are currently based on convolutional neural networks (CNNs), which are more concerned with local information and lack global perception. Thus, CNNs are unable to effectively extract the defect features. In this paper, a defect detection method based on the Swin transformer is proposed. The structure of the Swin transformer has been fine-tuned so that it has five scales of output, making it more suitable for defect detection tasks with large variations in target size. A bi-directional feature pyramid network is used as the feature fusion part to efficiently fuse to the extracted features. The focal loss is used as a loss function to weight the hard- and easy-to-distinguish samples, potentially making the model fit the surface defect data better. To reduce the number of parameters in the model, a shared detection head was chosen for result prediction. Experiments were conducted on the flange surface defect dataset and the steel surface defect dataset, respectively. Compared with the classical CNNs target detection algorithm, our method improves the mean average precision (mAP) by about 15.4%, while the model volume and detection speed are essentially the same as those of the CNNs-based method. The experimental results show that our proposed method is more competitive compared with CNNs-based methods and has some generality for different types of defects.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] CSwinDoubleU-Net: A double U-shaped network combined with convolution and Swin Transformer for colorectal polyp segmentation
    Lin, Yuanjie
    Han, Xiaoxiang
    Chen, Keyan
    Zhang, Weikun
    Liu, Qiaohong
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 89
  • [22] RoadFormer: Road Extraction Using a Swin Transformer Combined with a Spatial and Channel Separable Convolution
    Liu, Xiangzeng
    Wang, Ziyao
    Wan, Jinting
    Zhang, Juli
    Xi, Yue
    Liu, Ruyi
    Miao, Qiguang
    [J]. REMOTE SENSING, 2023, 15 (04)
  • [23] Transmission Line Insulator Defect Detection Based on Swin Transformer and Context
    Xi, Yu
    Zhou, Ke
    Meng, Ling-Wen
    Chen, Bo
    Chen, Hao-Min
    Zhang, Jing-Yi
    [J]. MACHINE INTELLIGENCE RESEARCH, 2023, 20 (05) : 729 - 740
  • [24] Defect-aware transformer network for intelligent visual surface defect detection
    Shang, Hongbing
    Sun, Chuang
    Liu, Jinxin
    Chen, Xuefeng
    Yan, Ruqiang
    [J]. ADVANCED ENGINEERING INFORMATICS, 2023, 55
  • [25] Transmission Line Insulator Defect Detection Based on Swin Transformer and Context
    Yu Xi
    Ke Zhou
    Ling-Wen Meng
    Bo Chen
    Hao-Min Chen
    Jing-Yi Zhang
    [J]. Machine Intelligence Research, 2023, 20 : 729 - 740
  • [26] Domain adaptation via Transferable Swin Transformer for tire defect detection
    Zhang, Yulong
    Wang, Yilin
    Jiang, Zhiqiang
    Zheng, Li
    Chen, Jinshui
    Lu, Jiangang
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [27] Reliable and Lightweight Adaptive Convolution Network for PCB Surface Defect Detection
    Lei, Lei
    Li, Han-Xiong
    Yang, Hai-Dong
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 8
  • [28] Chip Surface Character Detection Based On Convolution Neural Network
    Li, Jiaxin
    Zhang, Hesheng
    Fang, Yubin
    Zhu, Xiaojin
    [J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 2818 - 2823
  • [29] STMG: Swin transformer for multi-label image recognition with graph convolution network
    Wang, Yangtao
    Xie, Yanzhao
    Fan, Lisheng
    Hu, Guangxing
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (12): : 10051 - 10063
  • [30] STMG: Swin transformer for multi-label image recognition with graph convolution network
    Yangtao Wang
    Yanzhao Xie
    Lisheng Fan
    Guangxing Hu
    [J]. Neural Computing and Applications, 2022, 34 : 10051 - 10063