Efficient Semisupervised Object Segmentation for Long-Term Videos Using Adaptive Memory Network

被引：0

作者：

Zhong, Shan ^{[1
,2
,3
]}

Li, Guoqiang ^{[2
]}

Ying, Wenhao ^{[1
]}

Zhao, Fuzhou ^{[4
]}

Xie, Gengsheng ^{[5
]}

Gong, Shengrong ^{[1
,2
,3
]}

机构：

[1] Changshu Inst Technol, Sch Comp Sci & Engn, Changshu 215500, Jiangsu, Peoples R China

[2] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215000, Jiangsu, Peoples R China

[3] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130000, Peoples R China

[4] Changshu Inst Technol, Sch Automot Engn, Suzhou 215000, Jiangsu, Peoples R China

[5] Jiangxi Normal Univ, Sch Software, Nanchang 330022, Jiangxi, Peoples R China

来源：

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS | 2024年 / 16卷 / 05期

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Feature extraction; Videos; Object recognition; Data mining; Adaptation models; Adaptive systems; Video sequences; Long-term videos; memory network; object segmentation; semisupervised learning;

D O I：

10.1109/TCDS.2024.3385849

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video object segmentation (VOS) uses the first annotated video mask to achieve consistent and precise segmentation in subsequent frames. Recently, memory-based methods have received significant attention owing to their substantial performance enhancements. However, these approaches rely on a fixed global memory strategy, which poses a challenge to segmentation accuracy and speed in the context of longer videos. To alleviate this limitation, we propose a novel semisupervised VOS model, founded on the principles of the adaptive memory network. Our proposed model adaptively extracts object features by focusing on the object area while effectively filtering out extraneous background noise. An identification mechanism is also thoughtfully applied to discern each object in multiobject scenarios. To further reduce storage consumption without compromising the saliency of object information, the outdated features residing in the memory pool are compressed into salient features through the employment of a self-attention mechanism. Furthermore, we introduce a local matching module, specifically devised to refine object features by fusing the contextual information from historical frames. We demonstrate the efficiency of our approach through experiments, substantially augmenting both the speed and precision of segmentation for long-term videos, while maintaining comparable performance for short videos.

引用

页码：1789 / 1802

页数：14

共 50 条

[31] Video Object Segmentation with Dynamic Memory Networks and Adaptive Object Alignment
Liang, Shuxian
Shen, Xu
Huang, Jianqiang
Hua, Xian-Sheng
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8045 - 8054
[32] Content-adaptive long-term prediction with reduced memory
Kutka, R
2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 817 - 820
[33] Prefrontal Cortex Represents Long-Term Memory of Object Values for Months
Ghazizadeh, Ali
Hong, Simon
Hikosaka, Okihide
CURRENT BIOLOGY, 2018, 28 (14) : 2206 - +
[34] Anterior retrosplenial cortex is required for long-term object recognition memory
Ana Belén de Landeta
Magdalena Pereyra
Jorge H. Medina
Cynthia Katche
Scientific Reports, 10
[35] Modulation of long-term memory for object recognition via HDAC inhibition
Stefanko, Daniel P.
Barrett, Ruth M.
Ly, Alexandra R.
Reolon, Gustavo K.
Wood, Marcelo A.
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (23) : 9447 - 9452
[36] Robust long-term object tracking with adaptive scale and rotation estimation
Lu, Huimin
Xiong, Dan
Xiao, Junhao
Zheng, Zhiqiang
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (02):
[37] Robust Long-term Object Tracking With Adaptive Scale and Rotation Estimation
Xiong D.
Lu H.-M.
Xiao J.-H.
Zheng Z.-Q.
Zidonghua Xuebao/Acta Automatica Sinica, 2019, 45 (02): : 289 - 304
[38] Anterior retrosplenial cortex is required for long-term object recognition memory
Belen de Landeta, Ana
Pereyra, Magdalena
Medina, Jorge H.
Katche, Cynthia
SCIENTIFIC REPORTS, 2020, 10 (01)
[39] Conversion of short-term to long-term memory in the novel object recognition paradigm
Moore, Shannon J.
Deshpande, Kaivalya
Stinnett, Gwen S.
Seasholtz, Audrey F.
Murphy, Geoffrey G.
NEUROBIOLOGY OF LEARNING AND MEMORY, 2013, 105 : 174 - 185
[40] Handwritten character recognition using skewed line segmentation method and long short term memory network
Asha Kathigi
Krishnappa Honnamachanahalli Kariputtaiah
International Journal of System Assurance Engineering and Management, 2022, 13 : 1733 - 1745

← 1 2 3 4 5 →