Hybrid Attention and Motion Constraint for Anomaly Detection in Crowded Scenes

被引：7

作者：

Zhang, Xinfeng ^{[1
]}

Fang, Jinpeng ^{[1
]}

Yang, Baoqing ^{[1
]}

Chen, Shuhan ^{[1
]}

Li, Bin ^{[1
]}

机构：

[1] Yangzhou Univ, Coll Artificial Intelligence, Coll Informat Engn, Jiangsu Prov Engn Res Ctr Knowledge Management, Yangzhou 225127, Jiangsu, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2023年 / 33卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Videos; Anomaly detection; Training; Memory modules; Dictionaries; Testing; Surveillance; video surveillance; deep autoencoder; attention mechanism; ABNORMAL EVENT DETECTION;

D O I：

10.1109/TCSVT.2022.3221622

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Crowds often appear in surveillance videos in public places, from which anomaly detection is of great importance to public safety. Since the abnormal cases are rare, variable and unpredictable, autoencoders with encoder and decoder structures using only normal samples have become a hot topic among various approaches for anomaly detection. However, since autoencoders have excessive generalization ability, they can sometimes still reconstruct abnormal cases very well. Recently, some researchers construct memory modules under normal conditions and use these normal memory items to reconstruct test samples during inference to increase the reconstruction errors for anomalies. However, in practice, the errors of reconstructing normal samples with the memory items often increase as well, which makes it still difficult to distinguish between normal and abnormal cases. In addition, the memory-based autoencoder is usually available only in the specific scene where the memory module is constructed and almost loses the prospect of cross-scene applications. We mitigate the overgeneralization of autoencoders from a different perspective, namely, by reducing the prediction errors for normal cases rather than increasing the prediction errors for abnormal cases. To this end, we propose an autoencoder based on hybrid attention and motion constraint for anomaly detection. The hybrid attention includes the channel attention used in the encoding process and spatial attention added to the skip connection between the encoder and decoder. The hybrid attention is introduced to reduce the weight of the feature channels and regions representing the background in the feature matrix, which makes the autoencoder features more focused on optimizing the representation of the normal targets during training. Furthermore, we introduce motion constraint to improve the autoencoder's ability to predict normal activities in crowded scenes. We conduct experiments on real-world surveillance videos, UCSD, CUHK Avenue, and ShanghaiTech datasets. The experimental results indicate that the prediction errors of the proposed method for frequent normal crowd activities are smaller than those of other approaches, which increases the gap between the prediction errors for normal frames and the prediction errors for abnormal frames. In addition, the proposed method does not depend on a specific scene. Therefore, it balances good anomaly detection performance and strong cross-scene capability.

引用

页码：2259 / 2274

页数：16

共 50 条

[1] Anomaly detection in crowded scenes using motion energy model
Tianyu Chen
Chunping Hou
Zhipeng Wang
Hua Chen
[J]. Multimedia Tools and Applications, 2018, 77 : 14137 - 14152
[2] Anomaly Detection in Crowded Scenes Based on Group Motion Features
Guo, Shuqiang
Li, Dongxue
Yao, Lili
[J]. JOURNAL OF INTERNET TECHNOLOGY, 2020, 21 (03): : 871 - 879
[3] Anomaly detection in crowded scenes using motion energy model
Chen, Tianyu
Hou, Chunping
Wang, Zhipeng
Chen, Hua
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (11) : 14137 - 14152
[4] Anomaly Detection in Crowded Scenes
Mahadevan, Vijay
Li, Weixin
Bhalodia, Viral
Vasconcelos, Nuno
[J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1975 - 1981
[5] Anomaly Detection and Localization in Crowded Scenes
Li, Weixin
Mahadevan, Vijay
Vasconcelos, Nuno
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (01) : 18 - 32
[6] Contextual anomaly detection in crowded surveillance scenes
Leach, Michael J. V.
Sparks, Ed. P.
Robertson, Neil M.
[J]. PATTERN RECOGNITION LETTERS, 2014, 44 : 71 - 79
[7] Density aware anomaly detection in crowded scenes
Gunduz, Ayse Elvan
Ongun, Cihan
Temizel, Tugba Taskaya
Temizel, Alptekin
[J]. IET COMPUTER VISION, 2016, 10 (05) : 374 - 381
[8] Gaussian mixtures for anomaly detection in crowded scenes
Ullah, Habib
Tenuti, Lorenza
Conci, Nicola
[J]. VIDEO SURVEILLANCE AND TRANSPORTATION IMAGING APPLICATIONS, 2013, 8663
[9] Anomaly Detection in Crowded Scenes using Genetic Programming
Xie, Cheng
Shang, Lin
[J]. 2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, : 1832 - 1839
[10] Generative Neural Networks for Anomaly Detection in Crowded Scenes
Wang, Tian
Qiao, Meina
Lin, Zhiwei
Li, Ce
Snoussi, Hichem
Liu, Zhe
Choi, Chang
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (05) : 1390 - 1399

← 1 2 3 4 5 →