Multi-scale Siamese prediction network for video anomaly detection

被引:0
|
作者
Jingxian Yang
Yiheng Cai
Dan Liu
Jin Xie
机构
[1] Faculty of Information Technology,
[2] Beijing University of Technology,undefined
来源
关键词
Conv-RGU; Inception module; Siamese network; Video anomaly detection;
D O I
暂无
中图分类号
学科分类号
摘要
Automatically detecting anomalous events in surveillance videos is crucial for security maintenance. Due to the challenging nature of the task, the performance of the existing approaches is still limited. In this study, we propose a video anomaly detection method called multi-scale Siamese prediction framework (MSSP), where the Siamese network uses the information embedded in the observed anomalous events without requiring any additional parameters. To extract spatiotemporal features, we introduce a multi-scale term where an improved inception module and a convolutional GRU (Conv-GRU) module are combined. They are employed in each layer of the U-Net coding stage to mitigate the information loss caused by subsampling. To further optimize the proposed model, a loss function combining the prediction loss function and the contrastive loss is proposed. We evaluate the system performance on three public datasets: CUHK Avenue, UCSD Ped2, and ShanghaiTech dataset. Experimental results demonstrated that the MSSP framework achieved AUC values of 89.4%, 97.4% and 73.83%, respectively, which significantly outperforms other methods.
引用
收藏
页码:671 / 678
页数:7
相关论文
共 50 条
  • [41] Multi-Scale Anomaly Detection in Complex Dynamic Networks
    Mahyari, Arash Golibagh
    Aviyente, Selin
    2013 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2013, : 603 - 606
  • [42] Future Frame Prediction Network for Video Anomaly Detection
    Luo, Weixin
    Liu, Wen
    Lian, Dongze
    Gao, Shenghua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) : 7505 - 7520
  • [43] OctaveNet: An efficient multi-scale pseudo-siamese network for change detection in remote sensing images
    Farhadi N.
    Kiani A.
    Ebadi H.
    Multimedia Tools and Applications, 2024, 83 (36) : 83941 - 83961
  • [44] Transformer-Based Multi-Scale Feature Integration Network for Video Saliency Prediction
    Zhou, Xiaofei
    Wu, Songhe
    Shi, Ran
    Zheng, Bolun
    Wang, Shuai
    Yin, Haibing
    Zhang, Jiyong
    Yan, Chenggang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7696 - 7707
  • [45] Multi-Scale Convolutional Neural Network-Based Intra Prediction for Video Coding
    Wang, Yang
    Fan, Xiaopeng
    Liu, Shaohui
    Zhao, Debin
    Gao, Wen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 1803 - 1815
  • [46] MULTI-SCALE PREDICTION NETWORK FOR LUNG SEGMENTATION
    Gu, Yuchong
    Lai, Yaoming
    Xie, Peiliang
    Wei, Jun
    Lu, Yao
    2019 IEEE 16TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2019), 2019, : 438 - 442
  • [47] Remote Sensing Image Change Detection Based on Deep Multi-Scale Multi-Attention Siamese Transformer Network
    Zhang, Mengxuan
    Liu, Zhao
    Feng, Jie
    Liu, Long
    Jiao, Licheng
    REMOTE SENSING, 2023, 15 (03)
  • [48] A Multi-Scale Video Longformer Network for Action Recognition
    Chen, Congping
    Zhang, Chunsheng
    Dong, Xin
    APPLIED SCIENCES-BASEL, 2024, 14 (03):
  • [49] MULTI-SCALE ANALYSIS OF CONTEXTUAL INFORMATION WITHIN SPATIO-TEMPORAL VIDEO VOLUMES FOR ANOMALY DETECTION
    Li, Nannan
    Guo, Huiwen
    Xu, Dan
    Wu, Xinyu
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2363 - 2367
  • [50] TSMSAN: A Three-Stream Multi-Scale Attentive Network for Video Saliency Detection
    Yang, Jingwen
    Zhang, Guanwen
    Yan, Jiaming
    Zhou, Wei
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4371 - 4376