Unsupervised Visual Anomaly Detection Using Self-Supervised Pre-Trained Transformer

被引：0

作者：

Kim, Jun-Hyung ^{[1
]}

Kwon, Goo-Rak ^{[1
]}

机构：

[1] Chosun Univ, Dept Informat & Commun Engn, Gwangju 61452, South Korea

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Image reconstruction; Image segmentation; Transformers; Computational modeling; Location awareness; Feature extraction; Anomaly detection; Data augmentation; Self-supervised learning; data-augmentation; self-supervised learning; transformer;

D O I：

10.1109/ACCESS.2024.3454753

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the various industrial manufacturing processes, the automatic visual inspection system is an essential part as it reduces the chances of delivering defective products and the cost of training and hiring experts for manual inspection. In this work, we propose a new unsupervised anomaly detection method inspired by the masked language model for the automatic visual inspection system. The proposed method consists of an image tokenizer and two subnetworks, a reconstruction subnetwork, and a segmentation subnetwork. We adopt a pre-trained self-supervised vision Transformer model to use it as an image tokenizer. Our first subnetwork is trained to predict the anomaly-free patch tokens and the second subnetwork is trained to produce anomaly segmentation results from both the reconstructed and input patch tokens. During training, only the two subnetworks are optimized, and parameters of an image tokenizer are frozen. Experimental results show that the proposed method exhibits better performance than conventional methods in detecting defective products by achieving 99.05% I-AUROC on MVTecAD dataset and 94.8% I-AUROC on BTAD.

引用

页码：127604 / 127613

页数：10

共 50 条

[41] Self-Supervised Graph Transformer for Deepfake Detection
Khormali, Aminollah
Yuan, Jiann-Shiun
[J]. IEEE ACCESS, 2024, 12 : 58114 - 58127
[42] A self-supervised anomaly detection algorithm with interpretability
Wu, Zhichao
Yang, Xin
Wei, Xiaopeng
Yuan, Peijun
Zhang, Yuanping
Bai, Jianming
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
[43] Anomaly Detection on Electroencephalography with Self-supervised Learning
Xu, Junjie
Zheng, Yaojia
Mao, Yifan
Wang, Ruixuan
Zheng, Wei-Shi
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 363 - 368
[44] Self-supervised Visual Anomaly Detection with Image Patch Generation and Comparison Networks
Huang, Jianfeng
Zhao, Kaikai
Li, Chenyang
Lin, Yimin
Liu, Zhaoxiang
Wang, Kai
Lian, Shiguo
[J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT X, ICIC 2024, 2024, 14871 : 96 - 113
[45] Underwater Image Enhancement Using Pre-trained Transformer
Boudiaf, Abderrahmene
Guo, Yuhang
Ghimire, Adarsh
Werghi, Naoufel
De Masi, Giulia
Javed, Sajid
Dias, Jorge
[J]. IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 480 - 488
[46] MST: Masked Self-Supervised Transformer for Visual Representation
Li, Zhaowen
Chen, Zhiyang
Yang, Fan
Li, Wei
Zhu, Yousong
Zhao, Chaoyang
Deng, Rui
Wu, Liwei
Zhao, Rui
Tang, Ming
Wang, Jinqiao
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[47] Understanding writing style in social media with a supervised contrastively pre-trained transformer
Huertas-Tato, Javier
Martin, Alejandro
Camacho, David
[J]. KNOWLEDGE-BASED SYSTEMS, 2024, 296
[48] Pre-Trained Image Processing Transformer
Chen, Hanting
Wang, Yunhe
Guo, Tianyu
Xu, Chang
Deng, Yiping
Liu, Zhenhua
Ma, Siwei
Xu, Chunjing
Xu, Chao
Gao, Wen
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12294 - 12305
[49] Exploiting temporal coherence for self-supervised visual tracking by using vision transformer
Zhu, Wenjun
Wang, Zuyi
Xu, Li
Meng, Jun
[J]. KNOWLEDGE-BASED SYSTEMS, 2022, 251
[50] SPot-the-Difference Self-supervised Pre-training for Anomaly Detection and Segmentation
Zou, Yang
Jeong, Jongheon
Pemula, Latha
Zhang, Dongqing
Dabeer, Onkar
[J]. COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 392 - 408

← 1 2 3 4 5 →