Efficient Semisupervised Object Segmentation for Long-Term Videos Using Adaptive Memory Network

被引：0

作者：

Zhong, Shan ^{[1
,2
,3
]}

Li, Guoqiang ^{[2
]}

Ying, Wenhao ^{[1
]}

Zhao, Fuzhou ^{[4
]}

Xie, Gengsheng ^{[5
]}

Gong, Shengrong ^{[1
,2
,3
]}

机构：

[1] Changshu Inst Technol, Sch Comp Sci & Engn, Changshu 215500, Jiangsu, Peoples R China

[2] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215000, Jiangsu, Peoples R China

[3] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130000, Peoples R China

[4] Changshu Inst Technol, Sch Automot Engn, Suzhou 215000, Jiangsu, Peoples R China

[5] Jiangxi Normal Univ, Sch Software, Nanchang 330022, Jiangxi, Peoples R China

来源：

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS | 2024年 / 16卷 / 05期

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Feature extraction; Videos; Object recognition; Data mining; Adaptation models; Adaptive systems; Video sequences; Long-term videos; memory network; object segmentation; semisupervised learning;

D O I：

10.1109/TCDS.2024.3385849

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video object segmentation (VOS) uses the first annotated video mask to achieve consistent and precise segmentation in subsequent frames. Recently, memory-based methods have received significant attention owing to their substantial performance enhancements. However, these approaches rely on a fixed global memory strategy, which poses a challenge to segmentation accuracy and speed in the context of longer videos. To alleviate this limitation, we propose a novel semisupervised VOS model, founded on the principles of the adaptive memory network. Our proposed model adaptively extracts object features by focusing on the object area while effectively filtering out extraneous background noise. An identification mechanism is also thoughtfully applied to discern each object in multiobject scenarios. To further reduce storage consumption without compromising the saliency of object information, the outdated features residing in the memory pool are compressed into salient features through the employment of a self-attention mechanism. Furthermore, we introduce a local matching module, specifically devised to refine object features by fusing the contextual information from historical frames. We demonstrate the efficiency of our approach through experiments, substantially augmenting both the speed and precision of segmentation for long-term videos, while maintaining comparable performance for short videos.

引用

页码：1789 / 1802

页数：14

共 50 条

[41] Handwritten character recognition using skewed line segmentation method and long short term memory network
Kathigi, Asha
Kariputtaiah, Krishnappa Honnamachanahalli
INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2022, 13 (04) : 1733 - 1745
[42] Immune Memory: the Basics and How to Trigger an Efficient Long-Term Immune Memory
Beverley, P. C. L.
JOURNAL OF COMPARATIVE PATHOLOGY, 2010, 141 : S91 - S95
[43] Learning the Long-term Memory Effect of Power Amplifiers Using Temporal Convolutional Network
Akram, Iqra
Ma, Yi
He, Ziming
Tong, Fei
2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
[44] PLAYFIELD SEGMENTATION FOR BASEBALL VIDEOS USING ADAPTIVE GMMS
Kuo, Chung-Ming
Hung, Mao-Hsiung
Liu, Chih-Shan
Chang, Yukon
Hsieh, Chaur-Heh
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (06): : 2787 - 2801
[45] Human Brain Tissue Segmentation in fMRI using Deep Long-Term Recurrent Convolutional Network
Ang, Sui Paul
Phung, Son Lam
Schira, Mark Matthias
Bouzerdoum, Abdesselam
Soan Thi Minh Duong
2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 630 - 636
[46] Energy Efficient Ultra-Dense Network Using Long Short-Term Memory
Son, Junwon
Kim, Seungnyun
Shim, Byonghyo
2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
[47] Efficient Long-Short Temporal Attention network for unsupervised Video Object Segmentation
Li, Ping
Zhang, Yu
Yuan, Li
Xiao, Huaxin
Lin, Binbin
Xu, Xianghua
PATTERN RECOGNITION, 2024, 146
[48] A Memory Model Based on the Siamese Network for Long-Term Tracking
Lee, Hankyeol
Choi, Seokeon
Kim, Changick
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 100 - 115
[49] Hierarchical Memory Matching Network for Video Object Segmentation
Seong, Hongje
Oh, Seoung Wug
Lee, Joon-Young
Lee, Seongwon
Lee, Suhyeon
Kim, Euntai
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12869 - 12878
[50] EFFICIENT SEMANTIC-BASED VEHICLE RETRIEVAL IN LONG-TERM CAR PARK VIDEOS
Cheong, Clarence Weihan
Lim, Ryan Woei-Sheng
See, John
Wong, Lai-Kuan
Tan, Ian K. T.
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 138 - 143

← 1 2 3 4 5 →