Moving Object Detection in Video Sequences Based on a Two-Frame Temporal Information CNN

被引：2

作者：

Chacon-Murguia, Mario I. ^{[1
]}

Guzman-Pando, Abimael ^{[1
]}

机构：

[1] Tecnol Nacl Mex Inst Tecnol Chihuahua, Ave Tecnol 2909, CHIH, Chihuahua 31310, Mexico

来源：

NEURAL PROCESSING LETTERS | 2023年 / 55卷 / 05期

关键词：

Moving object detection; Dynamic object detection; Convolutional neural network; Unbalanced classes;

D O I：

10.1007/s11063-022-11092-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Moving object detection methods, MOD, must solve complex situations found in video scenarios related to bootstrapping, illumination changes, bad weather, PTZ, intermittent objects, color camouflage, camera jittering, low camera frame rate, noisy videos, shadows, thermal videos, night videos, etc. Some of the most promising MOD methods are based on convolutional neural networks, which are among the best-ranked algorithms in the CDnet14 dataset. Therefore, this paper presents a novel CNN to detect moving objects called Two-Frame CNN, 2FraCNN. Unlike best-ranked algorithms in CDnet14, 2FraCNN is a non-transfer learning model and employs temporal information to estimate the motion of moving objects. The architecture of 2FraCNN is inspired by how the optical flow helps to estimate motion, and its core is the FlowNet architecture. 2FraCNN processes temporal information through the concatenation of two consecutive frames and an Encoder-Decoder architecture. 2FraCNN includes a novel training scheme to deal with unbalanced pixel classes background/foreground. 2FraCNN was evaluated using three different schemes: the CDnet14 benchmark for a state-of-the-art comparison; against human performance metric intervals for a realistic evaluation; and for practical purposes with the performance instrument PVADN that considers the quantitative criteria of performance, speed, auto-adaptability, documentation, and novelty. Findings show that 2FraCNN has a performance comparable to the top ten algorithms in CDnet14 and is one of the best twelve in the PVADN evaluation. Also, 2FraCNN demonstrated that can solve many video challenges categories with human-like performance, such as dynamic backgrounds, jittering, shadow, bad weather, and thermal cameras, among others. Based on these findings, it can be concluded that 2FraCNN is a robust algorithm solving different video conditions with competent performance regarding state-of-the-art algorithms.

引用

页码：5425 / 5449

页数：25

共 50 条

[1] Moving Object Detection in Video Sequences Based on a Two-Frame Temporal Information CNN
Mario I. Chacon-Murguia
Abimael Guzman-Pando
[J]. Neural Processing Letters, 2023, 55 : 5425 - 5449
[2] Moving Object Detection Based on Temporal Information
Wang, Zhihu
Liao, Kai
Xiong, Jiulong
Zhang, Qi
[J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (11) : 1403 - 1407
[3] Moving object detection in video sequences
Gu, Xiaofeng
Yao, Hui
Fu, Yan
Kuang, Ping
Sun, Shixin
[J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 810 - +
[4] Moving object detection in video sequences
Ren, Di
Xu, Bing
[J]. PROCEEDINGS OF THE ADVANCES IN MATERIALS, MACHINERY, ELECTRICAL ENGINEERING (AMMEE 2017), 2017, 114 : 406 - 412
[5] Small moving object detection in video sequences
Zaibi, R
Çetin, AE
Yardimci, Y
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2071 - 2074
[6] A moving object detection and recognition method in video sequences
College of Engineering, Ocean University of China, Qingdao 266071, China
[J]. Moshi Shibie yu Rengong Zhineng, 2006, 2 (238-242):
[7] Video object segmentation via random walks on two-frame graphs comprising superpixels
Wang, Hui
Liu, Weibin
Xing, Weiwei
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 80
[8] Video object segmentation via random walks on two-frame graphs comprising superpixels
Wang, Hui
Liu, Weibin
Xing, Weiwei
[J]. Journal of Visual Communication and Image Representation, 2021, 80
[9] Spatio-temporal detection of video moving object
Ren, Ming-Yi
Li, Xiao-Feng
Li, Zai-Ming
[J]. Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2009, 20 (07): : 911 - 915
[10] Key Frame Extraction of Surveillance Video based on Moving Object Detection and Image Similarity
Luo Y.
Zhou H.
Tan Q.
Chen X.
Yun M.
[J]. Pattern Recognition and Image Analysis, 2018, 28 (2) : 225 - 231

← 1 2 3 4 5 →