A cross frame post-processing strategy for video object detection

被引:2
|
作者
Song, Xin [1 ,2 ]
Qi, Ziqiang [1 ]
Zhu, Jianlin [1 ]
Li, Shuhua [1 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110169, Peoples R China
[2] Northeastern Univ Qinhuangdao, Qinhuangdao 066004, Peoples R China
基金
中国国家自然科学基金;
关键词
Video object detection; Post-processing; Deep learning; Optimization algorithm;
D O I
10.1016/j.displa.2022.102230
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Video-based object detection plays an important role in the real world and scientific research. Compared with still images, video detection is more challenging due to occlusion, rare poses, high-speed movement, frames loss, etc. In order to improve the existing video stream detectors widely and with low coupling, a post-processing strategy, CFPP, is proposed in this work. The framework can establish a cross frame link based on deep learning, connect the proposals belonging to the same object, and improve the performance of the detector by optimizing the classification confidence and object coordinates. Furthermore, CFPP can connect the proposals in adjacent and non adjacent frames at the same time, which makes it exploit the context information of video stream more effectively than other post-processing strategies. Experiments shows that CFPP can improve the existing detectors (e.g. we improve the mAP of YOLOv4 on ImageNet VID dataset form 69.24% to 78.15%). In addition, experiments show that the designed framework can achieve better detection effect than other strategies in the case of high-speed moving object and frames loss.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Robust and efficient post-processing for video object detection
    Sabater, Alberto
    Montesano, Luis
    Murillo, Ana C.
    [J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10536 - 10542
  • [2] An improved Gaussian Mixture Model with post-processing for multiple object detection in surveillance video analytics
    Joy, Fancy
    Vijayakumar, V.
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2022, 13 (08) : 653 - 660
  • [3] Efficient FPGA-based Accelerator for Post-Processing in Object Detection
    Guo, Zibo
    Liu, Kai
    Liu, Wei
    Li, Shangrong
    [J]. 2023 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY, ICFPT, 2023, : 125 - 131
  • [4] A video pre/post-processing LSI for video capture
    Kinugasa, T
    Nishizawa, A
    Koshio, K
    Iguchi, T
    Kamimura, J
    Marumori, H
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1996, 42 (03) : 776 - 780
  • [5] Video pre/post-processing LSI for video capture
    Hitachi Ltd, Yokohama, Japan
    [J]. IEEE Trans Consum Electron, 3 (776-780):
  • [6] A video pre/post-processing LSI for video capture
    Kinugasa, T
    Nishizawa, A
    Koshio, K
    Iguchi, T
    Kamimura, J
    Marumori, H
    [J]. ICCE - INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 1996 DIGEST OF TECHNICAL PAPERS, 1996, : 396 - 397
  • [7] PBDE: an effective post-processing method based on box density for object detection
    Zhishan Li
    Baozhi Jia
    Yifan He
    Lei Xie
    [J]. Applied Intelligence, 2022, 52 : 2930 - 2941
  • [8] PBDE: an effective post-processing method based on box density for object detection
    Li, Zhishan
    Jia, Baozhi
    He, Yifan
    Xie, Lei
    [J]. APPLIED INTELLIGENCE, 2022, 52 (03) : 2930 - 2941
  • [9] Reducing the complexity of iterative post-processing of video
    Robertson, MA
    Stevenson, RL
    [J]. 1998 MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, 1999, : 399 - 402
  • [10] Post-processing of compressed video using a unified metric for digital video processing
    Boroczky, L
    Yang, YB
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2004, PTS 1 AND 2, 2004, 5308 : 124 - 131