Global Memory and Local Continuity for Video Object Detection

被引：13

作者：

Han, Liang ^{[1
]}

Yin, Zhaozheng ^{[1
,2
]}

机构：

[1] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA

[2] SUNY Stony Brook, Dept Biomed Informat, Stony Brook, NY 11794 USA

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2023年 / 25卷

基金：

美国国家科学基金会;

关键词：

Feature extraction; Object detection; Detectors; Proposals; Target tracking; Signal processing algorithms; Costs; Video object detection; global memory bank; feature aggregation; local continuity; object tracker;

D O I：

10.1109/TMM.2022.3164253

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To deal with the challenges in video object detection (VOD), such as occlusion and motion blur, many state-of-the-art video object detectors adopt a feature aggregation module to encode the long-range contextual information to support the current frame. The main drawbacks of these detectors are three-folds: first, the frame-wise detection slows down the detection speed; second, the frame-wise detection usually ignores the local continuity of the objects in a video, resulting in temporal inconsistent detection; third, the feature aggregation module usually encodes temporal features either from a local video clip or a single video, without exploiting the features in other videos. In this work, we develop an online VOD algorithm, aiming at a balanced high-speed and high-accuracy, by exploiting the global memory and local continuity. In the algorithm, an effective and efficient global memory bank (GMB) is designed to deposit and update object class features, which enables us to exploit the support features in other videos to enhance object features in the current video frames. Besides, to further speed up the detection, we design an object tracker to perform object detection for non-key frames based on the detection results of the key frame by leveraging the local continuity property of the video. Considering the trade-off between detection accuracy and speed, the proposed framework achieves superior performance on the ImageNet VID dataset. Source codes will be released to the public via our GitHub website.

引用

页码：3681 / 3693

页数：13

共 50 条

[41] A novel memory mechanism for video object detection from indoor mobile robots
Hu, Jiyuan
Wang, Tao
Li, Yuehua
Zhu, Shiqiang
SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (08) : 1785 - 1795
[42] Temporal Feature Enhancement Network with External Memory for Object Detection in Surveillance Video
Fujitake, Masato
Sugimoto, Akihiro
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7684 - 7691
[43] The contrasting impact of global and local object attributes on Kanizsa figure detection
Conci, Markus
Mueller, Hermann J.
Elliott, Mark A.
PERCEPTION & PSYCHOPHYSICS, 2007, 69 (08): : 1278 - 1294
[44] Unifying Global-Local Representations in Salient Object Detection With Transformers
Ren, Sucheng
Zhao, Nanxuan
Wen, Qiang
Han, Guoqiang
He, Shengfeng
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (04): : 2870 - 2879
[45] Salient object detection by local and global manifold regularized SVM model
Zhang, Lihe
Zhang, Dandan
Sun, Jiayu
Wei, Guohua
Bo, Hongguang
NEUROCOMPUTING, 2019, 340 : 42 - 54
[46] Salient object detection using local, global and high contrast graphs
Nouri, Fatemeh
Kazemi, Kamran
Danyali, Habibollah
SIGNAL IMAGE AND VIDEO PROCESSING, 2018, 12 (04) : 659 - 667
[47] The contrasting impact of global and local object attributes on Kanizsa figure detection
Markus Conci
Hermann J. Müller
Mark A. Elliott
Perception & Psychophysics, 2007, 69 : 1278 - 1294
[48] Salient object detection based on global to local visual search guidance
Wu, Yangxi
Zhang, Dongbo
Yin, Feng
Zhang, Ying
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 102
[49] Local to global purification strategy to realize collaborative camouflaged object detection
Tong, Jinghui
Bi, Yaqiu
Zhang, Cong
Bi, Hongbo
Yuan, Ye
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 241
[50] Global-Local Attention Mechanism Based Small Object Detection
Liu, Bao
Huang, Jinlei
2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1439 - 1443

← 1 2 3 4 5 →