Spiking ViT: spiking neural networks with transformer-attention for steel surface defect classification

被引：2

作者：

Gong, Liang ^{[1
]}

Dong, Hang ^{[1
]}

Zhang, Xinyu ^{[1
]}

Cheng, Xin ^{[2
]}

Ye, Fan ^{[3
]}

Guo, Liangchao ^{[1
]}

Ge, Zhenghui ^{[1
]}

机构：

[1] Yangzhou Univ, Sch Mech Engn, Yangzhou, Jiangsu, Peoples R China

[2] Yangzhou Univ, Sch Informat Engn, Sch Artificial Intelligence, Yangzhou, Jiangsu, Peoples R China

[3] Tongling Nonferrous Met Grp Co Ltd, Tongling, Peoples R China

来源：

JOURNAL OF ELECTRONIC IMAGING | 2024年 / 33卷 / 03期

基金：

中国博士后科学基金;

关键词：

surface defect classification; vision transformer; spiking neural network; deep learning; data modeling;

D O I：

10.1117/1.JEI.33.3.033001

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

. Throughout the steel production process, a variety of surface defects inevitably occur. These defects can impair the quality of steel products and reduce manufacturing efficiency. Therefore, it is crucial to study and categorize the multiple defects on the surface of steel strips. Vision transformer (ViT) is a unique neural network model based on a self-attention mechanism that is widely used in many different disciplines. Conventional ViT ignores the specifics of brain signaling and instead uses activation functions to simulate genuine neurons. One of the fundamental building blocks of a spiking neural network is leaky integration and fire (LIF), which has biodynamic characteristics akin to those of a genuine neuron. LIF neurons work in an event-driven manner such that higher performance can be achieved with less power. The goal of this work is to integrate ViT and LIF neurons to build and train an end-to-end hybrid network architecture, spiking vision transformer (S-ViT), for the classification of steel surface defects. The framework relies on the ViT architecture by replacing the activation functions used in ViT with LIF neurons, constructing a global spike feature fusion module spiking transformer encoder as well as a spiking-MLP classification head for implementing the classification functionality and using it as a basic building block of S-ViT. Based on the experimental results, our method has demonstrated outstanding classification performance across all metrics. The overall test accuracies of S-ViT are 99.41%, 99.65%, 99.54%, and 99.77% on NEU-CLSs, and 95.70%, 95.93%, 96.94%, and 97.19% on XSDD. S-ViT achieves superior classification performance compared to convolutional neural networks and recent findings. Its performance is also improved relative to the original ViT model. Furthermore, the robustness test results of S-ViT show that S-ViT still maintains reliable accuracy when recognizing images that contain Gaussian noise.

引用

页数：20

共 50 条

[1] Attention Spiking Neural Networks
Yao, Man
Zhao, Guangshe
Zhang, Hengyu
Hu, Yifan
Deng, Lei
Tian, Yonghong
Xu, Bo
Li, Guoqi
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9393 - 9410
[2] Transformer-Based Spiking Neural Networks for Multimodal Audiovisual Classification
Guo, Lingyue
Gao, Zeyu
Qu, Jinye
Zheng, Suiwu
Jiang, Runhao
Lu, Yanfeng
Qiao, Hong
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2024, 16 (03) : 1077 - 1086
[3] Spike Attention Coding for Spiking Neural Networks
Liu, Jiawen
Hu, Yifan
Li, Guoqi
Pei, Jing
Deng, Lei
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (12) : 18892 - 18898
[4] Temporal-wise Attention Spiking Neural Networks for Event Streams Classification
Yao, Man
Gao, Huanhuan
Zhao, Guangshe
Wang, Dingheng
Lin, Yihan
Yang, Zhaoxu
Li, Guoqi
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10201 - 10210
[5] Adaptive Feature Self-Attention in Spiking Neural Networks for Hyperspectral Classification
Li, Heng
Tu, Bing
Liu, Bo
Li, Jun
Plaza, Antonio
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
[6] Image Classification with Recurrent Spiking Neural Networks
Cureno Ramirez, Andres
Garcia Morgado, Balam
Gerardo de la Fraga, Luis
PATTERN RECOGNITION, MCPR 2024, 2024, 14755 : 368 - 376
[7] File Classification Based on Spiking Neural Networks
Stanojevic, Ana
Cherubini, Giovanni
Moraitis, Timoleon
Abu Sebastian
2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
[8] Robustness of classification ability of spiking neural networks
Jie Yang
Pingping Zhang
Yan Liu
Nonlinear Dynamics, 2015, 82 : 723 - 730
[9] Robustness of classification ability of spiking neural networks
Yang, Jie
Zhang, Pingping
Liu, Yan
NONLINEAR DYNAMICS, 2015, 82 (1-2) : 723 - 730
[10] Classification of spiking events with wavelet neural networks
Nazimov, Alexey I.
Pavlov, Alexey N.
DYNAMICS AND FLUCTUATIONS IN BIOMEDICAL PHOTONICS VIII, 2011, 7898

← 1 2 3 4 5 →