AGUnet: Annotation-guided U-net for fast one-shot video object segmentation

被引:18
|
作者
Yin, Yingjie [1 ,2 ,3 ]
Xu, De [1 ,3 ]
Wang, Xingang [1 ,3 ]
Zhang, Lei [2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Res Ctr Precis Sensing & Control, Beijing 100190, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hung Hom, Kowloon, Hong Kong, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Fully-convolutional Siamese network; U-net; Interactive image segmentation; Video object segmentation;
D O I
10.1016/j.patcog.2020.107580
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of semi-supervised video object segmentation has been popularly tackled by fine-tuning a general-purpose segmentation deep network on the annotated frame using hundreds of iterations of gra-dient descent. The time-consuming fine-tuning process, however, makes these methods difficult to use in practical applications. We propose a novel architecture called Annotation Guided U-net (AGUnet) for fast one-shot video object segmentation (VOS). AGUnet can quickly adapt a model trained on static images to segmenting the given target in a video by only several iterations of gradient descent. Our AGUnet is inspired by interactive image segmentation, where the interested target is segmented by using user annotated foreground. However, in AGUnet we use a fully-convolutional Siamese network to automatically annotate the foreground and background regions and fuse such annotation information into the skip connection of a U-net for VOS. Our AGUnet can be trained end-to-end effectively on static images instead of video sequences as required by many previous methods. The experiments show that AGUnet runs much faster than current state-of-the-art one-shot VOS algorithms while achieving competitive accuracy, and it has high generalization capability. (c) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Self-supervised spatial-temporal feature enhancement for one-shot video object detection
    Yao, Xudong
    Yang, Xiaoshan
    NEUROCOMPUTING, 2024, 601
  • [42] Enhancing Building Facade Image Segmentation via Object-Wise Processing and Cascade U-Net
    Jung, Haemin
    Park, Heesung
    Jung, Hae Sun
    Lee, Kwang yon
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (02): : 2261 - 2279
  • [43] Fast Instance and Semantic Segmentation Exploiting Local Connectivity, Metric Learning, and One-Shot Detection for Robotics
    Milioto, Andres
    Mandtler, Leonard
    Stachniss, Cyrill
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5481 - 5487
  • [44] Performance Analysis of Dilated One-to-Many U-Net Model for Medical Image Segmentation
    Chenarlogh, Vahid Ashkani
    Hassanpour, Arman
    Grolinger, Katarina
    Parsa, Vijay
    IEEE ACCESS, 2024, 12 : 197259 - 197274
  • [45] TransAttUnet: Multi-Level Attention-Guided U-Net With Transformer for Medical Image Segmentation
    Chen, Bingzhi
    Liu, Yishu
    Zhang, Zheng
    Lu, Guangming
    Kong, Adams Wai Kin
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 55 - 68
  • [46] Enhancing prostate cancer segmentation in bpMRI: Integrating zonal awareness into attention-guided U-Net
    Wei, Chao
    Liu, Zheng
    Zhang, Yibo
    Fan, Lianhui
    DIGITAL HEALTH, 2025, 11
  • [47] Microstructural segmentation using a union of attention guided U-Net models with different color transformed images
    Momojit Biswas
    Rishav Pramanik
    Shibaprasad Sen
    Aleksandr Sinitca
    Dmitry Kaplun
    Ram Sarkar
    Scientific Reports, 13
  • [48] Attention-guided duplex adversarial U-net for pancreatic segmentation from computed tomography images
    Li, Meiyu
    Lian, Fenghui
    Li, Yang
    Guo, Shuxu
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2022, 23 (04):
  • [49] Attention-guided hierarchical fusion U-Net for uncertainty-driven medical image segmentation
    Munia, Afsana Ahmed
    Abdar, Moloud
    Hasan, Mehedi
    Jalali, Mohammad S.
    Banerjee, Biplab
    Khosravi, Abbas
    Hossain, Ibrahim
    Fu, Huazhu
    Frangi, Alejandro F.
    INFORMATION FUSION, 2025, 115
  • [50] Microstructural segmentation using a union of attention guided U-Net models with different color transformed images
    Biswas, Momojit
    Pramanik, Rishav
    Sen, Shibaprasad
    Sinitca, Aleksandr
    Kaplun, Dmitry
    Sarkar, Ram
    SCIENTIFIC REPORTS, 2023, 13 (01)