Hybrid Attention and Motion Constraint for Anomaly Detection in Crowded Scenes

被引:7
|
作者
Zhang, Xinfeng [1 ]
Fang, Jinpeng [1 ]
Yang, Baoqing [1 ]
Chen, Shuhan [1 ]
Li, Bin [1 ]
机构
[1] Yangzhou Univ, Coll Artificial Intelligence, Coll Informat Engn, Jiangsu Prov Engn Res Ctr Knowledge Management, Yangzhou 225127, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Videos; Anomaly detection; Training; Memory modules; Dictionaries; Testing; Surveillance; video surveillance; deep autoencoder; attention mechanism; ABNORMAL EVENT DETECTION;
D O I
10.1109/TCSVT.2022.3221622
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Crowds often appear in surveillance videos in public places, from which anomaly detection is of great importance to public safety. Since the abnormal cases are rare, variable and unpredictable, autoencoders with encoder and decoder structures using only normal samples have become a hot topic among various approaches for anomaly detection. However, since autoencoders have excessive generalization ability, they can sometimes still reconstruct abnormal cases very well. Recently, some researchers construct memory modules under normal conditions and use these normal memory items to reconstruct test samples during inference to increase the reconstruction errors for anomalies. However, in practice, the errors of reconstructing normal samples with the memory items often increase as well, which makes it still difficult to distinguish between normal and abnormal cases. In addition, the memory-based autoencoder is usually available only in the specific scene where the memory module is constructed and almost loses the prospect of cross-scene applications. We mitigate the overgeneralization of autoencoders from a different perspective, namely, by reducing the prediction errors for normal cases rather than increasing the prediction errors for abnormal cases. To this end, we propose an autoencoder based on hybrid attention and motion constraint for anomaly detection. The hybrid attention includes the channel attention used in the encoding process and spatial attention added to the skip connection between the encoder and decoder. The hybrid attention is introduced to reduce the weight of the feature channels and regions representing the background in the feature matrix, which makes the autoencoder features more focused on optimizing the representation of the normal targets during training. Furthermore, we introduce motion constraint to improve the autoencoder's ability to predict normal activities in crowded scenes. We conduct experiments on real-world surveillance videos, UCSD, CUHK Avenue, and ShanghaiTech datasets. The experimental results indicate that the prediction errors of the proposed method for frequent normal crowd activities are smaller than those of other approaches, which increases the gap between the prediction errors for normal frames and the prediction errors for abnormal frames. In addition, the proposed method does not depend on a specific scene. Therefore, it balances good anomaly detection performance and strong cross-scene capability.
引用
收藏
页码:2259 / 2274
页数:16
相关论文
共 50 条
  • [1] Anomaly detection in crowded scenes using motion energy model
    Tianyu Chen
    Chunping Hou
    Zhipeng Wang
    Hua Chen
    [J]. Multimedia Tools and Applications, 2018, 77 : 14137 - 14152
  • [2] Anomaly Detection in Crowded Scenes Based on Group Motion Features
    Guo, Shuqiang
    Li, Dongxue
    Yao, Lili
    [J]. JOURNAL OF INTERNET TECHNOLOGY, 2020, 21 (03): : 871 - 879
  • [3] Anomaly detection in crowded scenes using motion energy model
    Chen, Tianyu
    Hou, Chunping
    Wang, Zhipeng
    Chen, Hua
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (11) : 14137 - 14152
  • [4] Anomaly Detection in Crowded Scenes
    Mahadevan, Vijay
    Li, Weixin
    Bhalodia, Viral
    Vasconcelos, Nuno
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1975 - 1981
  • [5] Anomaly Detection and Localization in Crowded Scenes
    Li, Weixin
    Mahadevan, Vijay
    Vasconcelos, Nuno
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (01) : 18 - 32
  • [6] Contextual anomaly detection in crowded surveillance scenes
    Leach, Michael J. V.
    Sparks, Ed. P.
    Robertson, Neil M.
    [J]. PATTERN RECOGNITION LETTERS, 2014, 44 : 71 - 79
  • [7] Density aware anomaly detection in crowded scenes
    Gunduz, Ayse Elvan
    Ongun, Cihan
    Temizel, Tugba Taskaya
    Temizel, Alptekin
    [J]. IET COMPUTER VISION, 2016, 10 (05) : 374 - 381
  • [8] Gaussian mixtures for anomaly detection in crowded scenes
    Ullah, Habib
    Tenuti, Lorenza
    Conci, Nicola
    [J]. VIDEO SURVEILLANCE AND TRANSPORTATION IMAGING APPLICATIONS, 2013, 8663
  • [9] Anomaly Detection in Crowded Scenes using Genetic Programming
    Xie, Cheng
    Shang, Lin
    [J]. 2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, : 1832 - 1839
  • [10] Generative Neural Networks for Anomaly Detection in Crowded Scenes
    Wang, Tian
    Qiao, Meina
    Lin, Zhiwei
    Li, Ce
    Snoussi, Hichem
    Liu, Zhe
    Choi, Chang
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (05) : 1390 - 1399