Dual-Memory Feature Aggregation for Video Object Detection

被引:0
|
作者
Fan, Diwei [1 ,2 ,3 ]
Zheng, Huicheng [1 ,2 ,3 ]
Dang, Jisheng [1 ,2 ,3 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[2] Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Peoples R China
[3] Guangdong Prov Key Lab Informat Secur Technol, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
video object detection; feature aggregation; temporal information; global memory; local feature cache;
D O I
10.1007/978-981-99-8537-1_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent studies on video object detection have shown the advantages of aggregating features across frames to capture temporal information, which can mitigate appearance degradation, such as occlusion, motion blur, and defocus. However, these methods often employ a sliding window or memory queue to store temporal information frame by frame, leading to discarding features of earlier frames over time. To address this, we propose a dual-memory feature aggregation framework (DMFA). DMFA simultaneously constructs a local feature cache and a global feature memory in a feature-wise updating way at different granularities, i.e., pixel level and proposal level. This approach can partially preserve key features across frames. The local feature cache stores the spatio-temporal contexts from nearby frames to boost the localization capacity, while the global feature memory enhances semantic feature representation by capturing temporal information from all previous frames. Moreover, we introduce contrastive learning to improve the discriminability of temporal features, resulting in more accurate proposal-level feature aggregation. Extensive experiments demonstrate that our method achieves state-of-the-art performance on the ImageNet VID benchmark.
引用
收藏
页码:220 / 232
页数:13
相关论文
共 50 条
  • [21] Feature aggregation network for small object detection
    Jing, Rudong
    Zhang, Wei
    Li, Yuzhuo
    Li, Wenlin
    Liu, Yanyan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [22] Class-Aware Dual-Supervised Aggregation Network for Video Object Detection
    Qi, Qiang
    Yan, Yan
    Wang, Hanzi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2109 - 2123
  • [23] MAMBA: Multi-level Aggregation via Memory Bank for Video Object Detection
    Sun, Guanxiong
    Hua, Yang
    Hu, Guosheng
    Robertson, Neil
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2620 - 2627
  • [24] Relation-Guided Multi-stage Feature Aggregation Network for Video Object Detection
    Yao, Tingting
    Cao, Fuxiao
    Mi, Fuheng
    Li, Danmeng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 146 - 157
  • [25] DAFA: Diversity-Aware Feature Aggregation for Attention-Based Video Object Detection
    Roh, Si-Dong
    Chung, Ki-Seok
    IEEE ACCESS, 2022, 10 : 93453 - 93463
  • [26] RoI Feature Propagation for Video Object Detection
    Cores, Daniel
    Mucientes, Manuel
    Brea, Victor M.
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2680 - 2687
  • [27] Dual Memory Aggregation Network for Event-Based Object Detection with Learnable Representation
    Wang, Dongsheng
    Jia, Xu
    Zhang, Yang
    Zhang, Xinyu
    Wang, Yaoyuan
    Zhang, Ziyang
    Wang, Dong
    Lu, Huchuan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2492 - 2500
  • [28] The specificity of learned parallelism in dual-memory retrieval
    Strobach, Tilo
    Schubert, Torsten
    Pashler, Harold
    Rickard, Timothy
    MEMORY & COGNITION, 2014, 42 (04) : 552 - 569
  • [29] The specificity of learned parallelism in dual-memory retrieval
    Tilo Strobach
    Torsten Schubert
    Harold Pashler
    Timothy Rickard
    Memory & Cognition, 2014, 42 : 552 - 569
  • [30] Design and Analysis of Magnet Proportioning for Dual-Memory Machines
    Li, Fuhua
    Chau, K. T.
    Liu, Chunhua
    Jiang, J. Z.
    Wang, Winson Yong
    IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY, 2012, 22 (03)