XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

被引:118
|
作者
Cheng, Ho Kei [1 ]
Schwing, Alexander G. [1 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
来源
关键词
D O I
10.1007/978-3-031-19815-1_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present XMem, a video object segmentation architecture for long videos with unified feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video object segmentation typically only uses one type of feature memory. For videos longer than a minute, a single feature memory model tightly links memory consumption and accuracy. In contrast, following the Atkinson-Shiffrin model, we develop an architecture that incorporates multiple independent yet deeply-connected feature memory stores: a rapidly updated sensory memory, a high-resolution working memory, and a compact thus sustained long-term memory. Crucially, we develop a memory potentiation algorithm that routinely consolidates actively used working memory elements into the long-term memory, which avoids memory explosion and minimizes performance decay for long-term prediction. Combined with a new memory reading mechanism, XMem greatly exceeds state-of-the-art performance on long-video datasets while being on par with state-of-the-art methods (that do not work on long videos) on short-video datasets.
引用
收藏
页码:640 / 658
页数:19
相关论文
共 50 条
  • [21] Efficient Regional Memory Network for Video Object Segmentation
    Xie, Haozhe
    Yao, Hongxun
    Zhou, Shangchen
    Zhang, Shengping
    Sun, Wenxiu
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1286 - 1295
  • [22] Robust and Efficient Memory Network for Video Object Segmentation
    Chen, Yadang
    Zhang, Dingwei
    Yang, Zhi-Xin
    Wu, Enhua
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1769 - 1774
  • [23] Hierarchical Memory Matching Network for Video Object Segmentation
    Seong, Hongje
    Oh, Seoung Wug
    Lee, Joon-Young
    Lee, Seongwon
    Lee, Suhyeon
    Kim, Euntai
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 12869 - 12878
  • [24] Memory Warps for Long-Term Online Video Representations and Anticipation
    Tuan-Hung Vu
    Choi, Wongun
    Schulter, Samuel
    Chandraker, Manmohan
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1156 - 1165
  • [25] Optimal long-term investment model with memory
    Inoue, Akihiko
    Nakano, Yumiharu
    APPLIED MATHEMATICS AND OPTIMIZATION, 2007, 55 (01): : 93 - 122
  • [26] CALCIUM IN LONG-TERM POTENTIATION AS A MODEL FOR MEMORY
    ECCLES, JC
    NEUROSCIENCE, 1983, 10 (04) : 1071 - 1081
  • [27] A columnar model explaining long-term memory
    Tetsuya Hoshino
    Toyohiko Yatagai
    Masahide Itoh
    Optical Memory and Neural Networks, 2012, 21 (4) : 209 - 218
  • [28] Optimal Long-Term Investment Model with Memory
    Akihiko Inoue
    Yumiharu Nakano
    Applied Mathematics and Optimization, 2007, 55 : 93 - 122
  • [29] Short-term treatment with flumazenil restores long-term object memory in a mouse model of Down syndrome
    Colas, Damien
    Chuluun, Bayarsaikhan
    Garner, Craig C.
    Heller, H. Craig
    NEUROBIOLOGY OF LEARNING AND MEMORY, 2017, 140 : 11 - 16
  • [30] A conceptual model of cerebellar long-term memory
    Ogasawara, Hideaki
    Kawato, Mitsuo
    NEUROSCIENCE RESEARCH, 2008, 61 : S79 - S79