Multi-level feature splicing 3D network based on multi-task joint learning for video anomaly detection

被引：0

作者：

Li, Yang ^{[1
]}

Tong, Guoxiang ^{[1
]}

机构：

[1] Univ Shanghai Sci & Technol, 516 Jungong Rd, Shanghai 200093, Peoples R China

来源：

NEUROCOMPUTING | 2025年 / 636卷

关键词：

Video anomaly detection; Multi-task learning; Pseudo-anomaly; Feature splicing; Attention gating; ABNORMAL EVENT DETECTION;

D O I：

10.1016/j.neucom.2025.129964

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In video anomaly detection research, deep learning is dedicated to identifying anomalous events accurately and efficiently. However, due to the scarcity and diversity of anomaly samples, previous methods have not adequately taken into account important information about location and timing. In addition, the overpowered generalization ability of the models leads to the fact that anomalies can also be well reconstructed or predicted. To address the above challenges, we propose a 3D network based on multi-level feature splicing with joint multi-task learning. The network is improved by the autoencoder (AE) as a backbone network. Firstly, we design a normal sample training task and a Gaussian noise task from a spatial perspective to enhance the reconstruction of positive samples. The frame-skipping task and the inverse sequence task of the video are designed from the temporal perspective to suppress the reconstruction ability of negative samples. Secondly, we use multi-level feature splicing in the encoding and decoding process to equip the network with the ability to explore sufficient information from the full scale. At the same time, we use an attention gating module to filter redundant features. The results show that our network is competitive with state-of-the-art methods. In terms of AUC, UCSD Ped2 achieves 99.3%, CUHK Avenue achieves 88.4%, and ShanghaiTech Campus achieves 74.2%.

引用

页数：13

共 50 条

[21] Probabilistic Joint Feature Selection for Multi-task Learning
Xiong, Tao
Bi, Jinbo
Rao, Bharat
Cherkassky, Vladimir
PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 332 - +
[22] Multi-Task Network Combing Multi-Level Information for Object Localization
Tian, Yan
Wang, Huiyan
Wang, Xun
Huang, Gang
Zhang, Guofeng
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2017, 29 (07): : 1275 - 1282
[23] FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Huang, Zhijian
Lin, Sihao
Liu, Guiyu
Luo, Mukun
Ye, Chaoqiang
Xu, Hang
Chang, Xiaojun
Liang, Xiaodan
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 3479 - 3488
[24] 3D-MMFN: Multi-level multimodal fusion network for 3D industrial image anomaly detection
Asad, Mujtaba
Azeem, Waqar
Malik, Aftab Ahmad
Jiang, He
Ali, Ahmad
Yang, Jie
Liu, Wei
ADVANCED ENGINEERING INFORMATICS, 2025, 65
[25] Predicting Taxi Demand Based on 3D Convolutional Neural Network and Multi-task Learning
Kuang, Li
Yan, Xuejin
Tan, Xianhan
Li, Shuqi
Yang, Xiaoxian
REMOTE SENSING, 2019, 11 (11)
[26] Multi-Task Adversarial Network for Disentangled Feature Learning
Liu, Yang
Wang, Zhaowen
Jin, Hailin
Wassell, Ian
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3743 - 3751
[27] 3D facial landmark detection based on differential cylindrical projection and multi-task learning
Terada, Takuma
Kimura, Ryusuke
Chen, Yen-Wei
COMMUNICATIONS IN INFORMATION AND SYSTEMS, 2020, 20 (04) : 443 - 459
[28] A Multi-level Feature Enhancement Network for Image Splicing Localization
Zhang, Zeyu
Cao, Yun
Zhao, Xianfeng
DIGITAL FORENSICS AND WATERMARKING, IWDW 2021, 2022, 13180 : 3 - 16
[29] Multi-Task Learning Based Joint Pulse Detection and Modulation Classification
Akyon, Fatih Cagatay
Nuhoglu, Mustafa Atahan
Alp, Yasar Kemal
Arikan, Orhan
2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
[30] Multi-Task Multi-Sensor Fusion for 3D Object Detection
Liang, Ming
Yang, Bin
Chen, Yun
Hu, Rui
Urtasun, Raquel
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7337 - 7345

← 1 2 3 4 5 →