Efficient Semisupervised Object Segmentation for Long-Term Videos Using Adaptive Memory Network

被引:0
|
作者
Zhong, Shan [1 ,2 ,3 ]
Li, Guoqiang [2 ]
Ying, Wenhao [1 ]
Zhao, Fuzhou [4 ]
Xie, Gengsheng [5 ]
Gong, Shengrong [1 ,2 ,3 ]
机构
[1] Changshu Inst Technol, Sch Comp Sci & Engn, Changshu 215500, Jiangsu, Peoples R China
[2] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215000, Jiangsu, Peoples R China
[3] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130000, Peoples R China
[4] Changshu Inst Technol, Sch Automot Engn, Suzhou 215000, Jiangsu, Peoples R China
[5] Jiangxi Normal Univ, Sch Software, Nanchang 330022, Jiangxi, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Feature extraction; Videos; Object recognition; Data mining; Adaptation models; Adaptive systems; Video sequences; Long-term videos; memory network; object segmentation; semisupervised learning;
D O I
10.1109/TCDS.2024.3385849
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video object segmentation (VOS) uses the first annotated video mask to achieve consistent and precise segmentation in subsequent frames. Recently, memory-based methods have received significant attention owing to their substantial performance enhancements. However, these approaches rely on a fixed global memory strategy, which poses a challenge to segmentation accuracy and speed in the context of longer videos. To alleviate this limitation, we propose a novel semisupervised VOS model, founded on the principles of the adaptive memory network. Our proposed model adaptively extracts object features by focusing on the object area while effectively filtering out extraneous background noise. An identification mechanism is also thoughtfully applied to discern each object in multiobject scenarios. To further reduce storage consumption without compromising the saliency of object information, the outdated features residing in the memory pool are compressed into salient features through the employment of a self-attention mechanism. Furthermore, we introduce a local matching module, specifically devised to refine object features by fusing the contextual information from historical frames. We demonstrate the efficiency of our approach through experiments, substantially augmenting both the speed and precision of segmentation for long-term videos, while maintaining comparable performance for short videos.
引用
收藏
页码:1789 / 1802
页数:14
相关论文
共 50 条
  • [31] Video Object Segmentation with Dynamic Memory Networks and Adaptive Object Alignment
    Liang, Shuxian
    Shen, Xu
    Huang, Jianqiang
    Hua, Xian-Sheng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8045 - 8054
  • [32] Content-adaptive long-term prediction with reduced memory
    Kutka, R
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 817 - 820
  • [33] Prefrontal Cortex Represents Long-Term Memory of Object Values for Months
    Ghazizadeh, Ali
    Hong, Simon
    Hikosaka, Okihide
    CURRENT BIOLOGY, 2018, 28 (14) : 2206 - +
  • [34] Anterior retrosplenial cortex is required for long-term object recognition memory
    Ana Belén de Landeta
    Magdalena Pereyra
    Jorge H. Medina
    Cynthia Katche
    Scientific Reports, 10
  • [35] Modulation of long-term memory for object recognition via HDAC inhibition
    Stefanko, Daniel P.
    Barrett, Ruth M.
    Ly, Alexandra R.
    Reolon, Gustavo K.
    Wood, Marcelo A.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (23) : 9447 - 9452
  • [36] Robust long-term object tracking with adaptive scale and rotation estimation
    Lu, Huimin
    Xiong, Dan
    Xiao, Junhao
    Zheng, Zhiqiang
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (02):
  • [37] Robust Long-term Object Tracking With Adaptive Scale and Rotation Estimation
    Xiong D.
    Lu H.-M.
    Xiao J.-H.
    Zheng Z.-Q.
    Zidonghua Xuebao/Acta Automatica Sinica, 2019, 45 (02): : 289 - 304
  • [38] Anterior retrosplenial cortex is required for long-term object recognition memory
    Belen de Landeta, Ana
    Pereyra, Magdalena
    Medina, Jorge H.
    Katche, Cynthia
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [39] Conversion of short-term to long-term memory in the novel object recognition paradigm
    Moore, Shannon J.
    Deshpande, Kaivalya
    Stinnett, Gwen S.
    Seasholtz, Audrey F.
    Murphy, Geoffrey G.
    NEUROBIOLOGY OF LEARNING AND MEMORY, 2013, 105 : 174 - 185
  • [40] Handwritten character recognition using skewed line segmentation method and long short term memory network
    Asha Kathigi
    Krishnappa Honnamachanahalli Kariputtaiah
    International Journal of System Assurance Engineering and Management, 2022, 13 : 1733 - 1745