Unsupervised Visual Anomaly Detection Using Self-Supervised Pre-Trained Transformer

被引:0
|
作者
Kim, Jun-Hyung [1 ]
Kwon, Goo-Rak [1 ]
机构
[1] Chosun Univ, Dept Informat & Commun Engn, Gwangju 61452, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Image reconstruction; Image segmentation; Transformers; Computational modeling; Location awareness; Feature extraction; Anomaly detection; Data augmentation; Self-supervised learning; data-augmentation; self-supervised learning; transformer;
D O I
10.1109/ACCESS.2024.3454753
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the various industrial manufacturing processes, the automatic visual inspection system is an essential part as it reduces the chances of delivering defective products and the cost of training and hiring experts for manual inspection. In this work, we propose a new unsupervised anomaly detection method inspired by the masked language model for the automatic visual inspection system. The proposed method consists of an image tokenizer and two subnetworks, a reconstruction subnetwork, and a segmentation subnetwork. We adopt a pre-trained self-supervised vision Transformer model to use it as an image tokenizer. Our first subnetwork is trained to predict the anomaly-free patch tokens and the second subnetwork is trained to produce anomaly segmentation results from both the reconstructed and input patch tokens. During training, only the two subnetworks are optimized, and parameters of an image tokenizer are frozen. Experimental results show that the proposed method exhibits better performance than conventional methods in detecting defective products by achieving 99.05% I-AUROC on MVTecAD dataset and 94.8% I-AUROC on BTAD.
引用
收藏
页码:127604 / 127613
页数:10
相关论文
共 50 条
  • [41] Self-Supervised Graph Transformer for Deepfake Detection
    Khormali, Aminollah
    Yuan, Jiann-Shiun
    [J]. IEEE ACCESS, 2024, 12 : 58114 - 58127
  • [42] A self-supervised anomaly detection algorithm with interpretability
    Wu, Zhichao
    Yang, Xin
    Wei, Xiaopeng
    Yuan, Peijun
    Zhang, Yuanping
    Bai, Jianming
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [43] Anomaly Detection on Electroencephalography with Self-supervised Learning
    Xu, Junjie
    Zheng, Yaojia
    Mao, Yifan
    Wang, Ruixuan
    Zheng, Wei-Shi
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 363 - 368
  • [44] Self-supervised Visual Anomaly Detection with Image Patch Generation and Comparison Networks
    Huang, Jianfeng
    Zhao, Kaikai
    Li, Chenyang
    Lin, Yimin
    Liu, Zhaoxiang
    Wang, Kai
    Lian, Shiguo
    [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT X, ICIC 2024, 2024, 14871 : 96 - 113
  • [45] Underwater Image Enhancement Using Pre-trained Transformer
    Boudiaf, Abderrahmene
    Guo, Yuhang
    Ghimire, Adarsh
    Werghi, Naoufel
    De Masi, Giulia
    Javed, Sajid
    Dias, Jorge
    [J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 480 - 488
  • [46] MST: Masked Self-Supervised Transformer for Visual Representation
    Li, Zhaowen
    Chen, Zhiyang
    Yang, Fan
    Li, Wei
    Zhu, Yousong
    Zhao, Chaoyang
    Deng, Rui
    Wu, Liwei
    Zhao, Rui
    Tang, Ming
    Wang, Jinqiao
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [47] Understanding writing style in social media with a supervised contrastively pre-trained transformer
    Huertas-Tato, Javier
    Martin, Alejandro
    Camacho, David
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 296
  • [48] Pre-Trained Image Processing Transformer
    Chen, Hanting
    Wang, Yunhe
    Guo, Tianyu
    Xu, Chang
    Deng, Yiping
    Liu, Zhenhua
    Ma, Siwei
    Xu, Chunjing
    Xu, Chao
    Gao, Wen
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12294 - 12305
  • [49] Exploiting temporal coherence for self-supervised visual tracking by using vision transformer
    Zhu, Wenjun
    Wang, Zuyi
    Xu, Li
    Meng, Jun
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 251
  • [50] SPot-the-Difference Self-supervised Pre-training for Anomaly Detection and Segmentation
    Zou, Yang
    Jeong, Jongheon
    Pemula, Latha
    Zhang, Dongqing
    Dabeer, Onkar
    [J]. COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 392 - 408