Variational Attention-Based Interpretable Transformer Network for Rotary Machine Fault Diagnosis

被引:46
|
作者
Li, Yasong [1 ,2 ]
Zhou, Zheng [1 ,2 ]
Sun, Chuang [1 ,2 ]
Chen, Xuefeng [1 ,2 ]
Yan, Ruqiang [1 ,2 ]
机构
[1] Xi An Jiao Tong Univ, State Key Lab Mfg Syst Engn, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Mech Engn, Xian 710049, Peoples R China
关键词
Vibrations; Transformers; Analytical models; Fault diagnosis; Feature extraction; Convolutional neural networks; Heating systems; Interpretability; rotary machine fault diagnosis (RMFD); transformer; variational attention; FAULT-DIAGNOSIS;
D O I
10.1109/TNNLS.2022.3202234
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning technology provides a promising approach for rotary machine fault diagnosis (RMFD), where vibration signals are commonly utilized as input of a deep network model to reveal the internal state of machinery. However, most existing methods fail to mine association relationships within signals. Unlike deep neural networks, transformer networks are capable of capturing association relationships through the global self-attention mechanism to enhance feature representations from vibration signals. Despite this, transformer networks cannot explicitly establish the causal association between signal patterns and fault types, resulting in poor interpretability. To tackle these problems, an interpretable deep learning model named the variational attention-based transformer network (VATN) is proposed for RMFD. VATN is improved from transformer encoder to mine the association relationships within signals. To embed the prior knowledge of the fault type, which can be recognized based on several key features of vibration signals, a sparse constraint is designed for attention weights. Variational inference is employed to force attention weights to samples from Dirichlet distributions, and Laplace approximation is applied to realize reparameterization. Finally, two experimental studies conducted on bevel gear and bearing datasets demonstrate the effectiveness of VATN to other comparison methods, and the heat map of attention weights illustrates the causal association between fault types and signal patterns.
引用
收藏
页码:6878 / 6892
页数:14
相关论文
共 50 条
  • [1] An Intelligent Fault Diagnosis Method Based on Attention-Based Bidirectional LSTM Network
    Wang, Taiyong
    Wang, Tinghu
    Wang, Peng
    Qiao, Huihui
    Xu, Mingda
    [J]. Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2020, 53 (06): : 601 - 608
  • [2] Interpretable clinical prediction via attention-based neural network
    Chen, Peipei
    Dong, Wei
    Wang, Jinliang
    Lu, Xudong
    Kaymak, Uzay
    Huang, Zhengxing
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (Suppl 3)
  • [3] Interpretable clinical prediction via attention-based neural network
    Peipei Chen
    Wei Dong
    Jinliang Wang
    Xudong Lu
    Uzay Kaymak
    Zhengxing Huang
    [J]. BMC Medical Informatics and Decision Making, 20
  • [4] Interpretable Predictions of Clinical Outcomes with An Attention-based Recurrent Neural Network
    Sha, Ying
    Wang, May D.
    [J]. ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 233 - 240
  • [5] Attention-based interpretable neural network for building cooling load prediction
    Li, Ao
    Xiao, Fu
    Zhang, Chong
    Fan, Cheng
    [J]. APPLIED ENERGY, 2021, 299
  • [6] Fault diagnosis based on the quantification of the fault features in a rotary machine
    Lee, Jongsu
    Park, Byeonghui
    Lee, Changwoo
    [J]. APPLIED SOFT COMPUTING, 2020, 97
  • [7] The Attention-Based Autoencoder for Network Traffic Classification with Interpretable Feature Representation
    Cui, Jun
    Bai, Longkun
    Zhang, Xiaofeng
    Lin, Zhigui
    Liu, Qi
    [J]. SYMMETRY-BASEL, 2024, 16 (05):
  • [8] A residual denoising and multiscale attention-based weighted domain adaptation network for tunnel boring machine main bearing fault diagnosis
    Zhong, Tao
    Qin, ChengJin
    Shi, Gang
    Zhang, ZhiNan
    Tao, JianFeng
    Liu, ChengLiang
    [J]. SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2024, 67 (08) : 2594 - 2618
  • [9] A residual denoising and multiscale attention-based weighted domain adaptation network for tunnel boring machine main bearing fault diagnosis
    ZHONG Tao
    QIN ChengJin
    SHI Gang
    ZHANG ZhiNan
    TAO JianFeng
    LIU ChengLiang
    [J]. ScienceChina(TechnologicalSciences)., 2024, 67 (08) - 2618
  • [10] Machine Translation Using Improved Attention-based Transformer with Hybrid Input
    Abrishami, Mahsa
    Rashti, Mohammad Javad
    Naderan, Marjan
    [J]. 2020 6TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2020, : 52 - 57