Hydrophobicity-Based Grading of Industrial Composite Insulators Images Using Cross Attention Vision Transformer With Knowledge Distillation

被引:0
|
作者
Das, Samiran [1 ]
Chatterjee, Sujoy [2 ]
Basu, Mainak [3 ]
机构
[1] Helmholtz Zentrum Dresden Rossendorf, Helmholtz Inst Freiberg, D-09599 Freiberg, Germany
[2] Amity Univ, Amity Sch Engn & Technol, Dept CSE & Res & Innovat Cell, Kolkata 700135, India
[3] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
关键词
Power transformer insulation; Insulators; Visualization; Shape; Feature extraction; Deep learning; Surface treatment; Composite insulator (CI); deep learning; depthwise separable convolution; explainable AI; grad-CAM; hydrophobicity; industrial automation; vision transformer (ViT);
D O I
10.1109/TDEI.2023.3347377
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The hydrophobic property of composite insulators (CIs) is a crucial attribute that prevents the accumulation of significant water content on the surface and ensures smooth operation and prevention of power line surges. However, since the weathering effect gradually reduces the hydrophobicity, periodic monitoring is necessary to ensure the smooth operation of the powerline. Efficient machine learning-based analysis of the CI images collected by the spray method can potentially achieve very high efficacy. However, inconsistency in the shape and size of droplets, the intricate pattern of the droplets, and the lack of color variation of the images, pose a challenge to the prevalent computer vision techniques for categorizing the CIs. This treatise explored a novel, efficient deep-learning paradigm, termed cross-attention vision transformer (CA-ViT) for grading the CI images. The CA-ViT uses both small and large patches to blend spatial features corresponding to a different scale. The network uses a token fusion module to effectively combine the tokens obtained from small and large patches and understand the visual pattern, and characterize droplets of different shapes and sizes by exchanging cross-attentions, which demonstrates its ability to precisely detect small and large droplets, and accurately classify the droplet images. Further, the work also introduces a knowledge distillation (KD) strategy to reduce the computational run time of the model. Exhaustive experimental results have confirmed that our proposed work surpasses state-of-the-art methods.
引用
收藏
页码:523 / 532
页数:10
相关论文
共 6 条
  • [1] DHT: Dynamic Vision Transformer Using Hybrid Window Attention for Industrial Defect Images Classification
    Ding, Chao
    Tang, Donglin
    Zheng, Xianghua
    Wang, Qiang
    He, Yuanyuan
    Long, Zhang
    IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE, 2023, 26 (02) : 19 - 28
  • [2] Squeeze-and-excitation-attention-based mobile vision transformer for grading recognition of bladder prolapse in pelvic MRI images
    Zhu, Shaojun
    Chen, Guotao
    Chen, Hongguang
    Lu, Ying
    Wu, Maonian
    Zheng, Bo
    Liu, Dongquan
    Qian, Cheng
    Chen, Yun
    MEDICAL PHYSICS, 2024, 51 (08) : 5236 - 5249
  • [3] Two-stream cross-attention vision Transformer based on RGB-D images for pig weight estimation
    He, Wei
    Mi, Yang
    Ding, Xiangdong
    Liu, Gang
    Li, Tao
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 212
  • [4] A generalised vision transformer-based self-supervised model for diagnosing and grading prostate cancer using histological images
    Chaurasia, Abadh K.
    Harris, Helen C.
    Toohey, Patrick W.
    Hewitt, Alex W.
    PROSTATE CANCER AND PROSTATIC DISEASES, 2025,
  • [5] Grading diffuse glioma based on 2021 WHO grade using self-attention-base deep learning architecture: variable Vision Transformer (vViT)
    Usuzaki, Takuma
    Takahashi, Kengo
    Inamori, Ryusei
    Morishita, Yohei
    Takagi, Hidenobu
    Shizukuishi, Takashi
    Toyama, Yoshitaka
    Abe, Mirei
    Ishikuro, Mami
    Obara, Taku
    Majima, Kazuhiro
    Takase, Kei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 91
  • [6] AN INTERPRETABLE GLAUCOMA DETECTION USING DUAL SCALE CROSS-ATTENTION VISION TRANSFORMER-BASED LONG SHORT TERM MEMORY WITH OPTICAL CUP AND DISK SEGMENTATION
    Krishnamoorthy, V.
    Logeswari, S.
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2025, 25 (01)