AGUnet: Annotation-guided U-net for fast one-shot video object segmentation

被引：18

作者：

Yin, Yingjie ^{[1
,2
,3
]}

Xu, De ^{[1
,3
]}

Wang, Xingang ^{[1
,3
]}

Zhang, Lei ^{[2
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Res Ctr Precis Sensing & Control, Beijing 100190, Peoples R China

[2] Hong Kong Polytech Univ, Dept Comp, Hung Hom, Kowloon, Hong Kong, Peoples R China

[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China

来源：

PATTERN RECOGNITION | 2021年 / 110卷

基金：

中国国家自然科学基金;

关键词：

Fully-convolutional Siamese network; U-net; Interactive image segmentation; Video object segmentation;

D O I：

10.1016/j.patcog.2020.107580

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The problem of semi-supervised video object segmentation has been popularly tackled by fine-tuning a general-purpose segmentation deep network on the annotated frame using hundreds of iterations of gra-dient descent. The time-consuming fine-tuning process, however, makes these methods difficult to use in practical applications. We propose a novel architecture called Annotation Guided U-net (AGUnet) for fast one-shot video object segmentation (VOS). AGUnet can quickly adapt a model trained on static images to segmenting the given target in a video by only several iterations of gradient descent. Our AGUnet is inspired by interactive image segmentation, where the interested target is segmented by using user annotated foreground. However, in AGUnet we use a fully-convolutional Siamese network to automatically annotate the foreground and background regions and fuse such annotation information into the skip connection of a U-net for VOS. Our AGUnet can be trained end-to-end effectively on static images instead of video sequences as required by many previous methods. The experiments show that AGUnet runs much faster than current state-of-the-art one-shot VOS algorithms while achieving competitive accuracy, and it has high generalization capability. (c) 2020 Elsevier Ltd. All rights reserved.

引用

页数：10

共 50 条

[41] Self-supervised spatial-temporal feature enhancement for one-shot video object detection
Yao, Xudong
Yang, Xiaoshan
NEUROCOMPUTING, 2024, 601
[42] Enhancing Building Facade Image Segmentation via Object-Wise Processing and Cascade U-Net
Jung, Haemin
Park, Heesung
Jung, Hae Sun
Lee, Kwang yon
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (02): : 2261 - 2279
[43] Fast Instance and Semantic Segmentation Exploiting Local Connectivity, Metric Learning, and One-Shot Detection for Robotics
Milioto, Andres
Mandtler, Leonard
Stachniss, Cyrill
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 5481 - 5487
[44] Performance Analysis of Dilated One-to-Many U-Net Model for Medical Image Segmentation
Chenarlogh, Vahid Ashkani
Hassanpour, Arman
Grolinger, Katarina
Parsa, Vijay
IEEE ACCESS, 2024, 12 : 197259 - 197274
[45] TransAttUnet: Multi-Level Attention-Guided U-Net With Transformer for Medical Image Segmentation
Chen, Bingzhi
Liu, Yishu
Zhang, Zheng
Lu, Guangming
Kong, Adams Wai Kin
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 55 - 68
[46] Enhancing prostate cancer segmentation in bpMRI: Integrating zonal awareness into attention-guided U-Net
Wei, Chao
Liu, Zheng
Zhang, Yibo
Fan, Lianhui
DIGITAL HEALTH, 2025, 11
[47] Microstructural segmentation using a union of attention guided U-Net models with different color transformed images
Momojit Biswas
Rishav Pramanik
Shibaprasad Sen
Aleksandr Sinitca
Dmitry Kaplun
Ram Sarkar
Scientific Reports, 13
[48] Attention-guided duplex adversarial U-net for pancreatic segmentation from computed tomography images
Li, Meiyu
Lian, Fenghui
Li, Yang
Guo, Shuxu
JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2022, 23 (04):
[49] Attention-guided hierarchical fusion U-Net for uncertainty-driven medical image segmentation
Munia, Afsana Ahmed
Abdar, Moloud
Hasan, Mehedi
Jalali, Mohammad S.
Banerjee, Biplab
Khosravi, Abbas
Hossain, Ibrahim
Fu, Huazhu
Frangi, Alejandro F.
INFORMATION FUSION, 2025, 115
[50] Microstructural segmentation using a union of attention guided U-Net models with different color transformed images
Biswas, Momojit
Pramanik, Rishav
Sen, Shibaprasad
Sinitca, Aleksandr
Kaplun, Dmitry
Sarkar, Ram
SCIENTIFIC REPORTS, 2023, 13 (01)

← 1 2 3 4 5 →