Hierarchical Memory Matching Network for Video Object Segmentation

被引:55
|
作者
Seong, Hongje [1 ]
Oh, Seoung Wug [2 ]
Lee, Joon-Young [2 ]
Lee, Seongwon [1 ]
Lee, Suhyeon [1 ]
Kim, Euntai [1 ]
机构
[1] Yonsei Univ, Seoul, South Korea
[2] Adobe Res, San Jose, CA USA
关键词
D O I
10.1109/ICCV48922.2021.01265
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present Hierarchical Memory Matching Network (HMMN) for semi-supervised video object segmentation. Based on a recent memory-based method [33], we propose two advanced memory read modules that enable us to perform memory reading in multiple scales while exploiting temporal smoothness. We first propose a kernel guided memory matching module that replaces the non-local dense memory read, commonly adopted in previous memory-based methods. The module imposes the temporal smoothness constraint in the memory read, leading to accurate memory retrieval. More importantly, we introduce a hierarchical memory matching scheme and propose a top-k guided memory matching module in which memory read on a fine-scale is guided by that on a coarse-scale. With the module, we perform memory read in multiple scales efficiently and leverage both high-level semantic and low-level fine-grained memory features to predict detailed object masks. Our network achieves state-of-the-art performance on the validation sets of DAVIS 2016/2017 (90.8% and 84.7%) and YouTube-VOS 2018/2019 (82.6% and 82.5%), and test-dev set of DAVIS 2017 (78.6%). The source code and model are available online: https://github.com/Hongje/HMMN.
引用
收藏
页码:12869 / 12878
页数:10
相关论文
共 50 条
  • [1] Modulated Memory Network for Video Object Segmentation
    Lu, Hannan
    Guo, Zixian
    Zuo, Wangmeng
    [J]. MATHEMATICS, 2024, 12 (06)
  • [2] Hierarchical Video Object Segmentation
    Xing, Junliang
    Ai, Haizhou
    Lao, Shihong
    [J]. 2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 67 - 71
  • [3] Reliability-Guided Hierarchical Memory Network for Scribble-Supervised Video Object Segmentation
    Zhou, Zikun
    Mao, Kaige
    Pei, Wenjie
    Wang, Hongpeng
    Wang, Yaowei
    He, Zhenyu
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [4] Robust and Efficient Memory Network for Video Object Segmentation
    Chen, Yadang
    Zhang, Dingwei
    Yang, Zhi-Xin
    Wu, Enhua
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1769 - 1774
  • [5] Efficient Regional Memory Network for Video Object Segmentation
    Xie, Haozhe
    Yao, Hongxun
    Zhou, Shangchen
    Zhang, Shengping
    Sun, Wenxiu
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1286 - 1295
  • [6] Kernel based local matching network for video object segmentation
    Wang, Guoqiang
    Li, Lan
    Zhu, Min
    Zhao, Rui
    Zhang, Xiang
    [J]. MACHINE VISION AND APPLICATIONS, 2024, 35 (03)
  • [7] Kernel based local matching network for video object segmentation
    Guoqiang Wang
    Lan Li
    Min Zhu
    Rui Zhao
    Xiang Zhang
    [J]. Machine Vision and Applications, 2024, 35
  • [8] Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation
    Pei, Gensheng
    Shen, Fumin
    Yao, Yazhou
    Xie, Guo-Sen
    Tang, Zhenmin
    Tang, Jinhui
    [J]. COMPUTER VISION, ECCV 2022, PT XXXIV, 2022, 13694 : 596 - 613
  • [9] Global Spectral Filter Memory Network for Video Object Segmentation
    Liu, Yong
    Yu, Ran
    Wang, Jiahao
    Zhao, Xinyuan
    Wang, Yitong
    Tang, Yansong
    Yang, Yujiu
    [J]. COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 648 - 665
  • [10] Dual Temporal Memory Network for Efficient Video Object Segmentation
    Zhang, Kaihua
    Wang, Long
    Liu, Dong
    Liu, Bo
    Liu, Qingshan
    Li, Zhu
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1515 - 1523