RANet: Ranking Attention Network for Fast Video Object Segmentation

被引:132
|
作者
Wang, Ziqin [1 ,3 ]
Xu, Jun [2 ,4 ]
Liu, Li [2 ]
Zhu, Fan [2 ]
Shao, Ling [2 ]
机构
[1] Univ Sydney, Sydney, NSW, Australia
[2] Incept Inst Artificial Intelligence IIAI, Abu Dhabi, U Arab Emirates
[3] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian, Peoples R China
[4] Nankai Univ, Coll Comp Sci, Media Comp Lab, Tianjin, Peoples R China
关键词
D O I
10.1109/ICCV.2019.00408
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite online learning (OL) techniques have boosted the performance of semi-supervised video object segmentation (VOS) methods, the huge time costs of OL greatly restricts their practicality. Matching based and propagation based methods run at a faster speed by avoiding OL techniques. However, they are limited by sub-optimal accuracy, due to mismatching and drifting problems. In this paper, we develop a real-time yet very accurate Ranking Attention Network (RANet) for VOS. Specifically, to integrate the insights of matching based and propagation based methods, we employ an encoder-decoder framework to learn pixel-level similarity and segmentation in an end-to-end manner. To better utilize the similarity maps, we propose a novel ranking attention module, which automatically ranks and selects these maps for fine-grained VOS performance. Experiments on DAVIS(16) and DAVIS(17) datasets show that our RANet achieves the best speed-accuracy trade-off, e.g., with 33 milliseconds per frame and J&F=85:5% on DAVIS(16). With OL, our RANet reaches J&F=87:1% on DAVIS(16), exceeding state-of-the-art VOS methods. The code can be found at https://github.com/Storife/RANet.
引用
收藏
页码:3977 / 3986
页数:10
相关论文
共 50 条
  • [1] RANet: Region Attention Network for Semantic Segmentation
    Shen, Dingguo
    Ji, Yuanfeng
    Li, Ping
    Wang, Yi
    Lin, Di
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [2] Adaptive Guidance and Attention-Refined Network for Fast Video Object Segmentation
    Li, Yaqian
    Li, Moran
    Xiao, Cunjun
    Li, Haibin
    [J]. NEURAL PROCESSING LETTERS, 2023, 55 (06) : 7211 - 7225
  • [3] Adaptive Guidance and Attention-Refined Network for Fast Video Object Segmentation
    Yaqian Li
    Moran Li
    Cunjun Xiao
    Haibin Li
    [J]. Neural Processing Letters, 2023, 55 : 7211 - 7225
  • [4] Symmetry Encoder-Decoder Network with Attention Mechanism for Fast Video Object Segmentation
    Guo, Mingyue
    Zhang, Dejun
    Sun, Jun
    Wu, Yiqi
    [J]. SYMMETRY-BASEL, 2019, 11 (08):
  • [5] Guided Co-Segmentation Network for Fast Video Object Segmentation
    Liu, Weide
    Lin, Guosheng
    Zhang, Tianyi
    Liu, Zichuan
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (04) : 1607 - 1617
  • [6] Multi-Attention Network for Unsupervised Video Object Segmentation
    Zhang, Guifang
    Wong, Hon-Cheng
    Lo, Sio-Long
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 71 - 75
  • [7] Fast Video Object Segmentation via Dynamic Targeting Network
    Zhang, Lu
    Lin, Zhe
    Zhang, Jianming
    Lu, Huchuan
    He, You
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5581 - 5590
  • [8] COMatchNet: Co-Attention Matching Network for Video Object Segmentation
    Huang, Lufei
    Sun, Fengming
    Yuan, Xia
    [J]. PATTERN RECOGNITION, ACPR 2021, PT I, 2022, 13188 : 271 - 284
  • [9] Multi-Attention Network for Compressed Video Referring Object Segmentation
    Chen, Weidong
    Hong, Dexiang
    Qi, Yuankai
    Han, Zhenjun
    Wang, Shuhui
    Qing, Laiyun
    Huang, Qingming
    Li, Guorong
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4416 - 4425
  • [10] Dual Attention Based Network with Hierarchical ConvLSTM for Video Object Segmentation
    Zhao, Zongji
    Zhao, Sanyuan
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 323 - 335