Efficient Semisupervised Object Segmentation for Long-Term Videos Using Adaptive Memory Network

被引:0
|
作者
Zhong, Shan [1 ,2 ,3 ]
Li, Guoqiang [2 ]
Ying, Wenhao [1 ]
Zhao, Fuzhou [4 ]
Xie, Gengsheng [5 ]
Gong, Shengrong [1 ,2 ,3 ]
机构
[1] Changshu Inst Technol, Sch Comp Sci & Engn, Changshu 215500, Jiangsu, Peoples R China
[2] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215000, Jiangsu, Peoples R China
[3] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130000, Peoples R China
[4] Changshu Inst Technol, Sch Automot Engn, Suzhou 215000, Jiangsu, Peoples R China
[5] Jiangxi Normal Univ, Sch Software, Nanchang 330022, Jiangxi, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Feature extraction; Videos; Object recognition; Data mining; Adaptation models; Adaptive systems; Video sequences; Long-term videos; memory network; object segmentation; semisupervised learning;
D O I
10.1109/TCDS.2024.3385849
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video object segmentation (VOS) uses the first annotated video mask to achieve consistent and precise segmentation in subsequent frames. Recently, memory-based methods have received significant attention owing to their substantial performance enhancements. However, these approaches rely on a fixed global memory strategy, which poses a challenge to segmentation accuracy and speed in the context of longer videos. To alleviate this limitation, we propose a novel semisupervised VOS model, founded on the principles of the adaptive memory network. Our proposed model adaptively extracts object features by focusing on the object area while effectively filtering out extraneous background noise. An identification mechanism is also thoughtfully applied to discern each object in multiobject scenarios. To further reduce storage consumption without compromising the saliency of object information, the outdated features residing in the memory pool are compressed into salient features through the employment of a self-attention mechanism. Furthermore, we introduce a local matching module, specifically devised to refine object features by fusing the contextual information from historical frames. We demonstrate the efficiency of our approach through experiments, substantially augmenting both the speed and precision of segmentation for long-term videos, while maintaining comparable performance for short videos.
引用
收藏
页码:1789 / 1802
页数:14
相关论文
共 50 条
  • [41] Handwritten character recognition using skewed line segmentation method and long short term memory network
    Kathigi, Asha
    Kariputtaiah, Krishnappa Honnamachanahalli
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2022, 13 (04) : 1733 - 1745
  • [42] Immune Memory: the Basics and How to Trigger an Efficient Long-Term Immune Memory
    Beverley, P. C. L.
    JOURNAL OF COMPARATIVE PATHOLOGY, 2010, 141 : S91 - S95
  • [43] Learning the Long-term Memory Effect of Power Amplifiers Using Temporal Convolutional Network
    Akram, Iqra
    Ma, Yi
    He, Ziming
    Tong, Fei
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [44] PLAYFIELD SEGMENTATION FOR BASEBALL VIDEOS USING ADAPTIVE GMMS
    Kuo, Chung-Ming
    Hung, Mao-Hsiung
    Liu, Chih-Shan
    Chang, Yukon
    Hsieh, Chaur-Heh
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (06): : 2787 - 2801
  • [45] Human Brain Tissue Segmentation in fMRI using Deep Long-Term Recurrent Convolutional Network
    Ang, Sui Paul
    Phung, Son Lam
    Schira, Mark Matthias
    Bouzerdoum, Abdesselam
    Soan Thi Minh Duong
    2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 630 - 636
  • [46] Energy Efficient Ultra-Dense Network Using Long Short-Term Memory
    Son, Junwon
    Kim, Seungnyun
    Shim, Byonghyo
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,
  • [47] Efficient Long-Short Temporal Attention network for unsupervised Video Object Segmentation
    Li, Ping
    Zhang, Yu
    Yuan, Li
    Xiao, Huaxin
    Lin, Binbin
    Xu, Xianghua
    PATTERN RECOGNITION, 2024, 146
  • [48] A Memory Model Based on the Siamese Network for Long-Term Tracking
    Lee, Hankyeol
    Choi, Seokeon
    Kim, Changick
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 100 - 115
  • [49] Hierarchical Memory Matching Network for Video Object Segmentation
    Seong, Hongje
    Oh, Seoung Wug
    Lee, Joon-Young
    Lee, Seongwon
    Lee, Suhyeon
    Kim, Euntai
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12869 - 12878
  • [50] EFFICIENT SEMANTIC-BASED VEHICLE RETRIEVAL IN LONG-TERM CAR PARK VIDEOS
    Cheong, Clarence Weihan
    Lim, Ryan Woei-Sheng
    See, John
    Wong, Lai-Kuan
    Tan, Ian K. T.
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 138 - 143