AGUnet: Annotation-guided U-net for fast one-shot video object segmentation

被引:18
|
作者
Yin, Yingjie [1 ,2 ,3 ]
Xu, De [1 ,3 ]
Wang, Xingang [1 ,3 ]
Zhang, Lei [2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Res Ctr Precis Sensing & Control, Beijing 100190, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hung Hom, Kowloon, Hong Kong, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Fully-convolutional Siamese network; U-net; Interactive image segmentation; Video object segmentation;
D O I
10.1016/j.patcog.2020.107580
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of semi-supervised video object segmentation has been popularly tackled by fine-tuning a general-purpose segmentation deep network on the annotated frame using hundreds of iterations of gra-dient descent. The time-consuming fine-tuning process, however, makes these methods difficult to use in practical applications. We propose a novel architecture called Annotation Guided U-net (AGUnet) for fast one-shot video object segmentation (VOS). AGUnet can quickly adapt a model trained on static images to segmenting the given target in a video by only several iterations of gradient descent. Our AGUnet is inspired by interactive image segmentation, where the interested target is segmented by using user annotated foreground. However, in AGUnet we use a fully-convolutional Siamese network to automatically annotate the foreground and background regions and fuse such annotation information into the skip connection of a U-net for VOS. Our AGUnet can be trained end-to-end effectively on static images instead of video sequences as required by many previous methods. The experiments show that AGUnet runs much faster than current state-of-the-art one-shot VOS algorithms while achieving competitive accuracy, and it has high generalization capability. (c) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] One-Shot Video Object Segmentation
    Caelles, S.
    Maninis, K. -K.
    Pont-Tuset, J.
    Leal-Taixe, L.
    Cremers, D.
    Van Gool, L.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5320 - 5329
  • [2] Make One-Shot Video Object Segmentation Efficient Again
    Meinhardt, Tim
    Leal-Taixe, Laura
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [3] One-Shot Video Object Segmentation Using Attention Transfer
    Chanda, Omit
    Wang, Yang
    2019 IEEE 21ST INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2019), 2019,
  • [4] A Spatiotemporal Mask Autoencoder for One-shot Video Object Segmentation
    Chen, Baiyu
    Zhao, Li
    Chan, Sixian
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, FAIML 2024, 2024, : 6 - 12
  • [5] Annotation-Free and One-Shot Learning for Instance Segmentation of Homogeneous Object Clusters
    Wu, Zheng
    Chang, Ruiheng
    Ma, Jiaxu
    Lu, Cewu
    Tang, Chi Keung
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1036 - 1042
  • [6] VQVC plus : One-Shot Voice Conversion by Vector Quantization and U-Net architecture
    Wu, Da-Yi
    Chen, Yen-Hao
    Lee, Hung-Yi
    INTERSPEECH 2020, 2020, : 4691 - 4695
  • [7] Semi-supervised one-shot learning for video object segmentation in dynamic environments
    Dinesh Elayaperumal
    Sachin Sakthi K S
    Jae Hoon Jeong
    Young Hoon Joo
    Multimedia Tools and Applications, 2025, 84 (6) : 3095 - 3115
  • [8] Exploring the Adversarial Robustness of Video Object Segmentation via One-shot Adversarial Attacks
    Jiang, Kaixun
    Hong, Lingyi
    Chen, Zhaoyu
    Guo, Pinxue
    Tao, Zeng
    Wang, Yan
    Zhang, Wenqiang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 8598 - 8607
  • [9] Attention guided U-Net for accurate iris segmentation
    Lian, Sheng
    Luo, Zhiming
    Zhong, Zhun
    Lin, Xiang
    Su, Songzhi
    Li, Shaozi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 56 : 296 - 304
  • [10] Fully Convolutional One-Shot Object Segmentation for Industrial Robotics
    Schnieders, Benjamin
    Luo, Shan
    Palmer, Gregory
    Tuyls, Karl
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1161 - 1169