Multi-level feature splicing 3D network based on multi-task joint learning for video anomaly detection

被引:0
|
作者
Li, Yang [1 ]
Tong, Guoxiang [1 ]
机构
[1] Univ Shanghai Sci & Technol, 516 Jungong Rd, Shanghai 200093, Peoples R China
关键词
Video anomaly detection; Multi-task learning; Pseudo-anomaly; Feature splicing; Attention gating; ABNORMAL EVENT DETECTION;
D O I
10.1016/j.neucom.2025.129964
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In video anomaly detection research, deep learning is dedicated to identifying anomalous events accurately and efficiently. However, due to the scarcity and diversity of anomaly samples, previous methods have not adequately taken into account important information about location and timing. In addition, the overpowered generalization ability of the models leads to the fact that anomalies can also be well reconstructed or predicted. To address the above challenges, we propose a 3D network based on multi-level feature splicing with joint multi-task learning. The network is improved by the autoencoder (AE) as a backbone network. Firstly, we design a normal sample training task and a Gaussian noise task from a spatial perspective to enhance the reconstruction of positive samples. The frame-skipping task and the inverse sequence task of the video are designed from the temporal perspective to suppress the reconstruction ability of negative samples. Secondly, we use multi-level feature splicing in the encoding and decoding process to equip the network with the ability to explore sufficient information from the full scale. At the same time, we use an attention gating module to filter redundant features. The results show that our network is competitive with state-of-the-art methods. In terms of AUC, UCSD Ped2 achieves 99.3%, CUHK Avenue achieves 88.4%, and ShanghaiTech Campus achieves 74.2%.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Probabilistic Joint Feature Selection for Multi-task Learning
    Xiong, Tao
    Bi, Jinbo
    Rao, Bharat
    Cherkassky, Vladimir
    PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 332 - +
  • [22] Multi-Task Network Combing Multi-Level Information for Object Localization
    Tian, Yan
    Wang, Huiyan
    Wang, Xun
    Huang, Gang
    Zhang, Guofeng
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2017, 29 (07): : 1275 - 1282
  • [23] FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
    Huang, Zhijian
    Lin, Sihao
    Liu, Guiyu
    Luo, Mukun
    Ye, Chaoqiang
    Xu, Hang
    Chang, Xiaojun
    Liang, Xiaodan
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3479 - 3488
  • [24] 3D-MMFN: Multi-level multimodal fusion network for 3D industrial image anomaly detection
    Asad, Mujtaba
    Azeem, Waqar
    Malik, Aftab Ahmad
    Jiang, He
    Ali, Ahmad
    Yang, Jie
    Liu, Wei
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [25] Predicting Taxi Demand Based on 3D Convolutional Neural Network and Multi-task Learning
    Kuang, Li
    Yan, Xuejin
    Tan, Xianhan
    Li, Shuqi
    Yang, Xiaoxian
    REMOTE SENSING, 2019, 11 (11)
  • [26] Multi-Task Adversarial Network for Disentangled Feature Learning
    Liu, Yang
    Wang, Zhaowen
    Jin, Hailin
    Wassell, Ian
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3743 - 3751
  • [27] 3D facial landmark detection based on differential cylindrical projection and multi-task learning
    Terada, Takuma
    Kimura, Ryusuke
    Chen, Yen-Wei
    COMMUNICATIONS IN INFORMATION AND SYSTEMS, 2020, 20 (04) : 443 - 459
  • [28] A Multi-level Feature Enhancement Network for Image Splicing Localization
    Zhang, Zeyu
    Cao, Yun
    Zhao, Xianfeng
    DIGITAL FORENSICS AND WATERMARKING, IWDW 2021, 2022, 13180 : 3 - 16
  • [29] Multi-Task Learning Based Joint Pulse Detection and Modulation Classification
    Akyon, Fatih Cagatay
    Nuhoglu, Mustafa Atahan
    Alp, Yasar Kemal
    Arikan, Orhan
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [30] Multi-Task Multi-Sensor Fusion for 3D Object Detection
    Liang, Ming
    Yang, Bin
    Chen, Yun
    Hu, Rui
    Urtasun, Raquel
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7337 - 7345