A cross frame post-processing strategy for video object detection

被引：2

作者：

Song, Xin ^{[1
,2
]}

Qi, Ziqiang ^{[1
]}

Zhu, Jianlin ^{[1
]}

Li, Shuhua ^{[1
]}

机构：

[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110169, Peoples R China

[2] Northeastern Univ Qinhuangdao, Qinhuangdao 066004, Peoples R China

来源：

DISPLAYS | 2022年 / 73卷

基金：

中国国家自然科学基金;

关键词：

Video object detection; Post-processing; Deep learning; Optimization algorithm;

D O I：

10.1016/j.displa.2022.102230

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video-based object detection plays an important role in the real world and scientific research. Compared with still images, video detection is more challenging due to occlusion, rare poses, high-speed movement, frames loss, etc. In order to improve the existing video stream detectors widely and with low coupling, a post-processing strategy, CFPP, is proposed in this work. The framework can establish a cross frame link based on deep learning, connect the proposals belonging to the same object, and improve the performance of the detector by optimizing the classification confidence and object coordinates. Furthermore, CFPP can connect the proposals in adjacent and non adjacent frames at the same time, which makes it exploit the context information of video stream more effectively than other post-processing strategies. Experiments shows that CFPP can improve the existing detectors (e.g. we improve the mAP of YOLOv4 on ImageNet VID dataset form 69.24% to 78.15%). In addition, experiments show that the designed framework can achieve better detection effect than other strategies in the case of high-speed moving object and frames loss.

引用

页数：10

共 50 条

[1] Robust and efficient post-processing for video object detection
Sabater, Alberto
Montesano, Luis
Murillo, Ana C.
[J]. 2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10536 - 10542
[2] An improved Gaussian Mixture Model with post-processing for multiple object detection in surveillance video analytics
Joy, Fancy
Vijayakumar, V.
[J]. INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2022, 13 (08) : 653 - 660
[3] Efficient FPGA-based Accelerator for Post-Processing in Object Detection
Guo, Zibo
Liu, Kai
Liu, Wei
Li, Shangrong
[J]. 2023 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY, ICFPT, 2023, : 125 - 131
[4] A video pre/post-processing LSI for video capture
Kinugasa, T
Nishizawa, A
Koshio, K
Iguchi, T
Kamimura, J
Marumori, H
[J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1996, 42 (03) : 776 - 780
[5] Video pre/post-processing LSI for video capture
Hitachi Ltd, Yokohama, Japan
[J]. IEEE Trans Consum Electron, 3 (776-780):
[6] A video pre/post-processing LSI for video capture
Kinugasa, T
Nishizawa, A
Koshio, K
Iguchi, T
Kamimura, J
Marumori, H
[J]. ICCE - INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 1996 DIGEST OF TECHNICAL PAPERS, 1996, : 396 - 397
[7] PBDE: an effective post-processing method based on box density for object detection
Zhishan Li
Baozhi Jia
Yifan He
Lei Xie
[J]. Applied Intelligence, 2022, 52 : 2930 - 2941
[8] PBDE: an effective post-processing method based on box density for object detection
Li, Zhishan
Jia, Baozhi
He, Yifan
Xie, Lei
[J]. APPLIED INTELLIGENCE, 2022, 52 (03) : 2930 - 2941
[9] Reducing the complexity of iterative post-processing of video
Robertson, MA
Stevenson, RL
[J]. 1998 MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, 1999, : 399 - 402
[10] Post-processing of compressed video using a unified metric for digital video processing
Boroczky, L
Yang, YB
[J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2004, PTS 1 AND 2, 2004, 5308 : 124 - 131

← 1 2 3 4 5 →