AGUnet: Annotation-guided U-net for fast one-shot video object segmentation

被引:18
|
作者
Yin, Yingjie [1 ,2 ,3 ]
Xu, De [1 ,3 ]
Wang, Xingang [1 ,3 ]
Zhang, Lei [2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Res Ctr Precis Sensing & Control, Beijing 100190, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hung Hom, Kowloon, Hong Kong, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Fully-convolutional Siamese network; U-net; Interactive image segmentation; Video object segmentation;
D O I
10.1016/j.patcog.2020.107580
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of semi-supervised video object segmentation has been popularly tackled by fine-tuning a general-purpose segmentation deep network on the annotated frame using hundreds of iterations of gra-dient descent. The time-consuming fine-tuning process, however, makes these methods difficult to use in practical applications. We propose a novel architecture called Annotation Guided U-net (AGUnet) for fast one-shot video object segmentation (VOS). AGUnet can quickly adapt a model trained on static images to segmenting the given target in a video by only several iterations of gradient descent. Our AGUnet is inspired by interactive image segmentation, where the interested target is segmented by using user annotated foreground. However, in AGUnet we use a fully-convolutional Siamese network to automatically annotate the foreground and background regions and fuse such annotation information into the skip connection of a U-net for VOS. Our AGUnet can be trained end-to-end effectively on static images instead of video sequences as required by many previous methods. The experiments show that AGUnet runs much faster than current state-of-the-art one-shot VOS algorithms while achieving competitive accuracy, and it has high generalization capability. (c) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] One-Shot Learning-Based Animal Video Segmentation
    Xue, Tengfei
    Qiao, Yongliang
    Kong, He
    Su, Daobilige
    Pan, Shirui
    Rafique, Khalid
    Sukkarieh, Salah
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (06) : 3799 - 3807
  • [12] One-Shot Scale and Angle Estimation for Fast Visual Object Tracking
    Lee, Dong-Hyun
    IEEE ACCESS, 2019, 7 : 55477 - 55484
  • [13] Quadruplet Network With One-Shot Learning for Fast Visual Object Tracking
    Dong, Xingping
    Shen, Jianbing
    Wu, Dongming
    Guo, Kan
    Jin, Xiaogang
    Porikli, Fatih
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) : 3516 - 3527
  • [14] Shape-intensity-guided U-net for medical image segmentation
    Dong, Wenhui
    Du, Bo
    Xu, Yongchao
    NEUROCOMPUTING, 2024, 610
  • [15] CAM-GUIDED U-NET WITH ADVERSARIAL REGULARIZATION FOR DEFECT SEGMENTATION
    Lin, Dongyun
    Li, Yiqun
    Prasad, Shitala
    Nwe, Tin Lay
    Dong, Sheng
    Oo, Zaw Min
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1054 - 1058
  • [16] A Few-Shot Attention Recurrent Residual U-Net for Crack Segmentation
    Katsamenis, Iason
    Protopapadakis, Eftychios
    Bakalos, Nikolaos
    Varvarigos, Andreas
    Doulamis, Anastasios
    Doulamis, Nikolaos
    Voulodimos, Athanasios
    ADVANCES IN VISUAL COMPUTING, ISVC 2023, PT I, 2023, 14361 : 199 - 209
  • [17] Guided Co-Segmentation Network for Fast Video Object Segmentation
    Liu, Weide
    Lin, Guosheng
    Zhang, Tianyi
    Liu, Zichuan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (04) : 1607 - 1617
  • [18] Fast and Accurate U-Net Model for Fetal Ultrasound Image Segmentation
    Chenarlogh, Vahid Ashkani
    Oghli, Mostafa Ghelich
    Shabanzadeh, Ali
    Sirjani, Nasim
    Akhavan, Ardavan
    Shiri, Isaac
    Arabi, Hossein
    Shabanzadeh, Zahra
    Taheri, Morteza Sanei
    Tarzamni, Mohammad Kazem
    ULTRASONIC IMAGING, 2022, 44 (01) : 25 - 38
  • [19] Object Classification by Effective Segmentation of Tree Canopy Using U-Net Model
    Vasavi, S.
    Likhitha, Atluri Lakshmi
    Premchand, Veeranki Sai
    Yasaswini, Jampa
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2024, 15 (03) : 422 - 434
  • [20] Coordinate-Guided U-Net for Automated Breast Segmentation on MRI Images
    Zheng, Xinpeng
    Liu, Zhuangsheng
    Chang, Lin
    Long, Wansheng
    Lu, Yao
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069