Enhancing Feature Representation for Anomaly Detection via Local-and-Global Temporal Relations and a Multi-stage Memory

被引：0

作者：

Li, Xuan ^{[1
]}

Ma, Ding ^{[1
]}

Wu, Xiangqian ^{[1
]}

机构：

[1] Harbin Inst Technol, Fac Comp, Harbin, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI | 2024年 / 14430卷

基金：

黑龙江省自然科学基金;

关键词：

Video anomaly detection; Weak supervision; Feature representation enhancing; Temporal relations; Multi-stage memory;

D O I：

10.1007/978-981-99-8537-1_10

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly supervised video anomaly detection is a challenging task because frame-level labels are not accessible at the training time. Effectively tackling this task necessitates models to learn discriminative feature representation. To address this challenge, we propose a multi-stage memory-augmented feature discrimination learning (MMFDL) method. The first stage obtains the preliminary abnormal probabilities of clip features. In the second stage, an easy normal pattern memory (ENPM) are proposed to store normal patterns with low abnormal probabilities. In the last stage, we bring clip features with high abnormal probabilities in normal videos close to ENPM and away from the clip features with high probabilities of being abnormal in abnormal videos to make models learn more discriminative features for anomaly detection. Furthermore, we propose a local-and-global temporal relations modeling (LGTRM) module to enhance clip features by aggregating local and global contexts. Our LGTRM module can be divided into two subnetworks: DW-Net and TF-Net. DW-Net integrates the current clip feature with its adjacent clip features to capture local-range temporal dependencies. TF-Net utilizes the multi-head self-attention mechanism of the transformer to capture global-range temporal dependencies. Experiments on two datasets demonstrate that our method outperforms state-of-the-art approaches. The code is available at https://github.com/xuanli01/PRCV347.

引用

页码：121 / 133

页数：13

共 13 条

[1] Enhancing Feature Representation for Anomaly Detection via Local-and-Global Temporal Relations and a Multi-stage Memory
Li, Xuan
Ma, Ding
Wu, Xiangqian
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2024, 14430 LNCS : 121 - 133
[2] Multi-stage temporal representation learning via global and local perspectives for real-time speech enhancement
Chau, Hoang Ngoc
Linh, Nguyen Thi Nhat
Doan, Tuan Kiet
Nguyen, Quoc Cuong
APPLIED ACOUSTICS, 2024, 223
[3] Attention Guided Food Recognition via Multi-Stage Local Feature Fusion
Deng, Gonghui
Wu, Dunzhi
Chen, Weizhen
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (02): : 1985 - 2003
[4] Gait Recognition via Effective Global-Local Feature Representation and Local Temporal Aggregation
Lin, Beibei
Zhang, Shunli
Yu, Xin
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14628 - 14636
[5] Multi-Stage Multi-Scale Local Feature Fusion for Infrared Small Target Detection
Wang, Yahui
Tian, Yan
Liu, Jijun
Xu, Yiping
REMOTE SENSING, 2023, 15 (18)
[6] Global–local multi-stage temporal convolutional network for cataract surgery phase recognition
Lixin Fang
Lei Mou
Yuanyuan Gu
Yan Hu
Bang Chen
Xu Chen
Yang Wang
Jiang Liu
Yitian Zhao
BioMedical Engineering OnLine, 21
[7] Global-local multi-stage temporal convolutional network for cataract surgery phase recognition
Fang, Lixin
Mou, Lei
Gu, Yuanyuan
Hu, Yan
Chen, Bang
Chen, Xu
Wang, Yang
Liu, Jiang
Zhao, Yitian
BIOMEDICAL ENGINEERING ONLINE, 2022, 21 (01)
[8] Local-aware spatio-temporal attention network with multi-stage feature fusion for human action recognition
Yaqing Hou
Hua Yu
Dongsheng Zhou
Pengfei Wang
Hongwei Ge
Jianxin Zhang
Qiang Zhang
Neural Computing and Applications, 2021, 33 : 16439 - 16450
[9] Local-aware spatio-temporal attention network with multi-stage feature fusion for human action recognition
Hou, Yaqing
Yu, Hua
Zhou, Dongsheng
Wang, Pengfei
Ge, Hongwei
Zhang, Jianxin
Zhang, Qiang
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (23): : 16439 - 16450
[10] Enhancing energy efficiency and imbalance handling in botnet detection in IoT networks: a multi-stage feature reduction and weighted approach
Deepa Krishnan
Pravin Shrinath
International Journal of Information Technology, 2025, 17 (2) : 811 - 822

← 1 2 →