Moving Object Detection in Video Sequences Based on a Two-Frame Temporal Information CNN

被引:2
|
作者
Chacon-Murguia, Mario I. [1 ]
Guzman-Pando, Abimael [1 ]
机构
[1] Tecnol Nacl Mex Inst Tecnol Chihuahua, Ave Tecnol 2909, CHIH, Chihuahua 31310, Mexico
关键词
Moving object detection; Dynamic object detection; Convolutional neural network; Unbalanced classes;
D O I
10.1007/s11063-022-11092-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Moving object detection methods, MOD, must solve complex situations found in video scenarios related to bootstrapping, illumination changes, bad weather, PTZ, intermittent objects, color camouflage, camera jittering, low camera frame rate, noisy videos, shadows, thermal videos, night videos, etc. Some of the most promising MOD methods are based on convolutional neural networks, which are among the best-ranked algorithms in the CDnet14 dataset. Therefore, this paper presents a novel CNN to detect moving objects called Two-Frame CNN, 2FraCNN. Unlike best-ranked algorithms in CDnet14, 2FraCNN is a non-transfer learning model and employs temporal information to estimate the motion of moving objects. The architecture of 2FraCNN is inspired by how the optical flow helps to estimate motion, and its core is the FlowNet architecture. 2FraCNN processes temporal information through the concatenation of two consecutive frames and an Encoder-Decoder architecture. 2FraCNN includes a novel training scheme to deal with unbalanced pixel classes background/foreground. 2FraCNN was evaluated using three different schemes: the CDnet14 benchmark for a state-of-the-art comparison; against human performance metric intervals for a realistic evaluation; and for practical purposes with the performance instrument PVADN that considers the quantitative criteria of performance, speed, auto-adaptability, documentation, and novelty. Findings show that 2FraCNN has a performance comparable to the top ten algorithms in CDnet14 and is one of the best twelve in the PVADN evaluation. Also, 2FraCNN demonstrated that can solve many video challenges categories with human-like performance, such as dynamic backgrounds, jittering, shadow, bad weather, and thermal cameras, among others. Based on these findings, it can be concluded that 2FraCNN is a robust algorithm solving different video conditions with competent performance regarding state-of-the-art algorithms.
引用
收藏
页码:5425 / 5449
页数:25
相关论文
共 50 条
  • [1] Moving Object Detection in Video Sequences Based on a Two-Frame Temporal Information CNN
    Mario I. Chacon-Murguia
    Abimael Guzman-Pando
    [J]. Neural Processing Letters, 2023, 55 : 5425 - 5449
  • [2] Moving Object Detection Based on Temporal Information
    Wang, Zhihu
    Liao, Kai
    Xiong, Jiulong
    Zhang, Qi
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (11) : 1403 - 1407
  • [3] Moving object detection in video sequences
    Gu, Xiaofeng
    Yao, Hui
    Fu, Yan
    Kuang, Ping
    Sun, Shixin
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 810 - +
  • [4] Moving object detection in video sequences
    Ren, Di
    Xu, Bing
    [J]. PROCEEDINGS OF THE ADVANCES IN MATERIALS, MACHINERY, ELECTRICAL ENGINEERING (AMMEE 2017), 2017, 114 : 406 - 412
  • [5] Small moving object detection in video sequences
    Zaibi, R
    Çetin, AE
    Yardimci, Y
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2071 - 2074
  • [6] A moving object detection and recognition method in video sequences
    College of Engineering, Ocean University of China, Qingdao 266071, China
    [J]. Moshi Shibie yu Rengong Zhineng, 2006, 2 (238-242):
  • [7] Video object segmentation via random walks on two-frame graphs comprising superpixels
    Wang, Hui
    Liu, Weibin
    Xing, Weiwei
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 80
  • [8] Video object segmentation via random walks on two-frame graphs comprising superpixels
    Wang, Hui
    Liu, Weibin
    Xing, Weiwei
    [J]. Journal of Visual Communication and Image Representation, 2021, 80
  • [9] Spatio-temporal detection of video moving object
    Ren, Ming-Yi
    Li, Xiao-Feng
    Li, Zai-Ming
    [J]. Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2009, 20 (07): : 911 - 915
  • [10] Key Frame Extraction of Surveillance Video based on Moving Object Detection and Image Similarity
    Luo Y.
    Zhou H.
    Tan Q.
    Chen X.
    Yun M.
    [J]. Pattern Recognition and Image Analysis, 2018, 28 (2) : 225 - 231