Fusion-restoration model for industrial multimodal anomaly detection

被引:0
|
作者
Wang, Jiaxun [1 ]
Niu, Yanchang [1 ]
Huang, Biqing [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
关键词
Anomaly detection; Multimodal fusion; Feature reconstruction; Unsupervised learning;
D O I
10.1016/j.neucom.2025.130073
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Industrial anomaly detection based on multimodal data is receiving increasing attention. The application of the feature mapping paradigm represents a prevailing trend. However, the existing feature mapping method is limited by the lack of multimodal fusion, which hinders the comprehensive interaction between RGB and point cloud features. In this paper, we introduce a novel feature reconstruction paradigm called Fusion-Restoration Model (FRM) to ameliorate this problem. A fusion encoder integrates the information of two domains into a fusion embedding. Then, a pair of decoupled decoders independently restore embeddings of the corresponding domains from the fusion embedding. FRM learns nominal feature reconstruction from anomaly-free training samples and detects and localizes anomalies based on the reconstruction residuals in the inference phase. A joint loss that constrains both direction and magnitude is used to enhance the robustness of the reconstruction. Additionally, a semi-frozen training strategy is designed to adapt the batch normalization parameters of the 3D feature extractor to the target industrial dataset. Extensive experiments show that our method achieves effective and efficient multimodal anomaly detection on the MVTec 3D-AD dataset.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Multimodal Industrial Anomaly Detection via Hybrid Fusion
    Wang, Yue
    Peng, Jinlong
    Zhang, Jiangning
    Yi, Ran
    Wang, Yabiao
    Wang, Chengjie
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8032 - 8041
  • [2] Multimodal Fusion Anomaly Detection Model for Agricultural Wireless Sensors
    Zhou, Zhenggui
    ENGINEERING REPORTS, 2024,
  • [3] FusionNN: A Semantic Feature Fusion Model Based on Multimodal for Web Anomaly Detection
    Wang, Li
    Xia, Mingshan
    Hu, Hao
    Li, Jianfang
    Hou, Fengyao
    Chen, Gang
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (02): : 2991 - 3006
  • [4] Multimodal fusion and knowledge distillation for improved anomaly detection
    Lu, Meichen
    Chai, Yi
    Xu, Kaixiong
    Chen, Weiqing
    Ao, Fei
    Ji, Wen
    VISUAL COMPUTER, 2024,
  • [5] MFGAN: Multimodal Fusion for Industrial Anomaly Detection Using Attention-Based Autoencoder and Generative Adversarial Network
    Qu, Xinji
    Liu, Zhuo
    Wu, Chase Q.
    Hou, Aiqin
    Yin, Xiaoyan
    Chen, Zhulian
    SENSORS, 2024, 24 (02)
  • [6] A Coattention Enhanced Multimodal Feature Fusion With Inner Feature for Anomaly Detection
    Zhang, Danwei
    Sun, Hongshuo
    Yu, Wen
    Xu, Quan
    Chai, Tianyou
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2024,
  • [7] Multimodal and multiscale feature fusion for weakly supervised video anomaly detection
    Sun, Wenwen
    Cao, Lin
    Guo, Yanan
    Du, Kangning
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [8] AI-Based Multimodal Anomaly Detection for Industrial Machine Operations
    Zhang, Qiaoyun
    Chang, Hsiang-Chuan
    Ho, Chia-Ling
    Keh, Huan-Chao
    Roy, Diptendu Sinha
    JOURNAL OF INTERNET TECHNOLOGY, 2025, 26 (02): : 255 - 264
  • [9] Multimodal Fusion Induced Attention Network for Industrial VOCs Detection
    Kang, Yu
    Shi, Kehao
    Tan, Jifang
    Cao, Yang
    Zhao, Lijun
    Xu, Zhenyi
    IEEE Transactions on Artificial Intelligence, 2024, 5 (12): : 6385 - 6398
  • [10] Design of an Improved Model for Anomaly Detection in CCTV Systems Using Multimodal Fusion and Attention-Based Networks
    Srilakshmi, V.
    Veesam, Sai Babu
    Krishna, Mallu Shiva Rama
    Munaganuri, Ravi Kumar
    Sivaprasad, Dulam Devee
    IEEE ACCESS, 2025, 13 : 27287 - 27309