Multiple instance-based video anomaly detection using deep temporal encoding-decoding

被引:26
|
作者
Kamoona, Ammar Mansoor [1 ,2 ]
Gostar, Amirali Khodadadian [1 ]
Bab-Hadiashar, Alireza [1 ]
Hoseinnezhad, Reza [1 ]
机构
[1] RMIT Univ, Sch Engn, Melbourne, Vic 3000, Australia
[2] Univ Kufa, Fac Engn, Dept Elect Engn, Najaf, Iraq
基金
澳大利亚研究理事会;
关键词
Anomaly detection; Surveillance videos; Weakly supervised multiple instance learning; EVENT DETECTION; BEHAVIOR DETECTION; CLASSIFICATION; LOCALIZATION; FRAMEWORK; NETWORKS;
D O I
10.1016/j.eswa.2022.119079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a weakly supervised deep temporal encoding-decoding solution for anomaly detection in surveillance videos using multiple instance learning. The proposed approach uses both abnormal and normal video clips during the training phase which is developed in the multiple instance learning framework where we treat the video as a bag and video clips as instances in the bag. Our main contribution lies in the proposed novel approach to consider temporal relations between video instances in a weakly supervised setting. We deal with video instances (clips) as sequential visual data rather than a set of independent instances. We employ a deep temporal encoding-decoding network that is designed to capture spatio-temporal evolution of video instances over time. We also propose a new loss function that maximizes the mean distance between normal and abnormal instance predictions. The new loss function ensures a low false alarm rate which is crucial in practical surveillance applications. The proposed temporal encoding-decoding approach with the modified loss is benchmarked against the state of the art. The results show that the proposed method performs similar to, or better than the state-of-the-art solutions for anomaly detection in video surveillance applications and achieves the lowest false alarm rate on UCF-Crime dataset.
引用
收藏
页数:13
相关论文
empty
未找到相关数据