Spiking ViT: spiking neural networks with transformer-attention for steel surface defect classification

被引:2
|
作者
Gong, Liang [1 ]
Dong, Hang [1 ]
Zhang, Xinyu [1 ]
Cheng, Xin [2 ]
Ye, Fan [3 ]
Guo, Liangchao [1 ]
Ge, Zhenghui [1 ]
机构
[1] Yangzhou Univ, Sch Mech Engn, Yangzhou, Jiangsu, Peoples R China
[2] Yangzhou Univ, Sch Informat Engn, Sch Artificial Intelligence, Yangzhou, Jiangsu, Peoples R China
[3] Tongling Nonferrous Met Grp Co Ltd, Tongling, Peoples R China
基金
中国博士后科学基金;
关键词
surface defect classification; vision transformer; spiking neural network; deep learning; data modeling;
D O I
10.1117/1.JEI.33.3.033001
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
. Throughout the steel production process, a variety of surface defects inevitably occur. These defects can impair the quality of steel products and reduce manufacturing efficiency. Therefore, it is crucial to study and categorize the multiple defects on the surface of steel strips. Vision transformer (ViT) is a unique neural network model based on a self-attention mechanism that is widely used in many different disciplines. Conventional ViT ignores the specifics of brain signaling and instead uses activation functions to simulate genuine neurons. One of the fundamental building blocks of a spiking neural network is leaky integration and fire (LIF), which has biodynamic characteristics akin to those of a genuine neuron. LIF neurons work in an event-driven manner such that higher performance can be achieved with less power. The goal of this work is to integrate ViT and LIF neurons to build and train an end-to-end hybrid network architecture, spiking vision transformer (S-ViT), for the classification of steel surface defects. The framework relies on the ViT architecture by replacing the activation functions used in ViT with LIF neurons, constructing a global spike feature fusion module spiking transformer encoder as well as a spiking-MLP classification head for implementing the classification functionality and using it as a basic building block of S-ViT. Based on the experimental results, our method has demonstrated outstanding classification performance across all metrics. The overall test accuracies of S-ViT are 99.41%, 99.65%, 99.54%, and 99.77% on NEU-CLSs, and 95.70%, 95.93%, 96.94%, and 97.19% on XSDD. S-ViT achieves superior classification performance compared to convolutional neural networks and recent findings. Its performance is also improved relative to the original ViT model. Furthermore, the robustness test results of S-ViT show that S-ViT still maintains reliable accuracy when recognizing images that contain Gaussian noise.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Attention Spiking Neural Networks
    Yao, Man
    Zhao, Guangshe
    Zhang, Hengyu
    Hu, Yifan
    Deng, Lei
    Tian, Yonghong
    Xu, Bo
    Li, Guoqi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9393 - 9410
  • [2] Transformer-Based Spiking Neural Networks for Multimodal Audiovisual Classification
    Guo, Lingyue
    Gao, Zeyu
    Qu, Jinye
    Zheng, Suiwu
    Jiang, Runhao
    Lu, Yanfeng
    Qiao, Hong
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (03) : 1077 - 1086
  • [3] Spike Attention Coding for Spiking Neural Networks
    Liu, Jiawen
    Hu, Yifan
    Li, Guoqi
    Pei, Jing
    Deng, Lei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (12) : 18892 - 18898
  • [4] Temporal-wise Attention Spiking Neural Networks for Event Streams Classification
    Yao, Man
    Gao, Huanhuan
    Zhao, Guangshe
    Wang, Dingheng
    Lin, Yihan
    Yang, Zhaoxu
    Li, Guoqi
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10201 - 10210
  • [5] Adaptive Feature Self-Attention in Spiking Neural Networks for Hyperspectral Classification
    Li, Heng
    Tu, Bing
    Liu, Bo
    Li, Jun
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [6] Image Classification with Recurrent Spiking Neural Networks
    Cureno Ramirez, Andres
    Garcia Morgado, Balam
    Gerardo de la Fraga, Luis
    PATTERN RECOGNITION, MCPR 2024, 2024, 14755 : 368 - 376
  • [7] File Classification Based on Spiking Neural Networks
    Stanojevic, Ana
    Cherubini, Giovanni
    Moraitis, Timoleon
    Abu Sebastian
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [8] Robustness of classification ability of spiking neural networks
    Jie Yang
    Pingping Zhang
    Yan Liu
    Nonlinear Dynamics, 2015, 82 : 723 - 730
  • [9] Robustness of classification ability of spiking neural networks
    Yang, Jie
    Zhang, Pingping
    Liu, Yan
    NONLINEAR DYNAMICS, 2015, 82 (1-2) : 723 - 730
  • [10] Classification of spiking events with wavelet neural networks
    Nazimov, Alexey I.
    Pavlov, Alexey N.
    DYNAMICS AND FLUCTUATIONS IN BIOMEDICAL PHOTONICS VIII, 2011, 7898