Triple attention network for video segmentation

被引：31

作者：

Tian, Yan ^{[1
,2
]}

Zhang, Yujie ^{[1
]}

Zhou, Di ^{[3
]}

Cheng, Guohua ^{[4
]}

Chen, Wei-Gang ^{[1
]}

Wang, Ruili ^{[1
,5
]}

机构：

[1] Zhejiang Gongshang Univ, Sch Comp & Informat Engn, Hangzhou 310018, Peoples R China

[2] Shining3D Tech Co Ltd, Shining3D Res, Hangzhou 310018, Peoples R China

[3] Zhejiang Univ Technol Co Ltd, Hangzhou 310051, Peoples R China

[4] Fudan Univ, Inst Sci & Technol Brain Inspired Intelligence, Minist Educ, Key Lab Computat Neurosci & Brain Inspired Intlli, Shanghai 200433, Peoples R China

[5] Massey Univ, Auckland 0632, New Zealand

来源：

NEUROCOMPUTING | 2020年 / 417卷 / 417期

基金：

中国国家自然科学基金;

关键词：

Video segmentation; Computer vision; Deep learning; Convolution neural network;

D O I：

10.1016/j.neucom.2020.07.078

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video segmentation automatically segments a target object throughout a video and has recently achieved good progress due to the development of deep convolutional neural networks (DCNNs). However, how to simultaneously capture long-range dependencies in multiple spaces remains an important issue in video segmentation. In this paper, we propose a novel triple attention network (TriANet) that simultaneously exploits temporal, spatial, and channel context knowledge by using the self-attention mechanism to enhance the discriminant ability of feature representations. We verify our method on the Shining3D dental, DAVIS16, and DAVIS17 datasets, and the results show our method to be competitive when compared with other state-of-the-art video segmentation methods. (C) 2020 Published by Elsevier B.V.

引用

页码：202 / 211

页数：10

共 50 条

[1] TANet: Triple Attention Network for medical image segmentation
Wei, Xin
Ye, Fanghua
Wan, Huan
Xu, Jianfeng
Min, Weidong
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 82
[2] Attention-Guided Network for Semantic Video Segmentation
Li, Jiangyun
Zhao, Yikai
Fu, Jun
Wu, Jiajia
Liu, Jing
[J]. IEEE ACCESS, 2019, 7 : 140680 - 140689
[3] PTANet: Triple Attention Network for point cloud semantic segmentation
Cheng, Haozhe
Lu, Jian
Luo, Maoxin
Liu, Wei
Zhang, Kaibing
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 102
[4] Multi-Attention Network for Unsupervised Video Object Segmentation
Zhang, Guifang
Wong, Hon-Cheng
Lo, Sio-Long
[J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 71 - 75
[5] FUSION TARGET ATTENTION MASK GENERATION NETWORK FOR VIDEO SEGMENTATION
Li, Yunyi
Chen, Fangping
Yang, Fan
Li, Yuan
Jia, Huizhu
Xie, Xiaodong
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2276 - 2280
[6] MAIN: Multi-Attention Instance Network for video segmentation
Alcazar, Juan Leon
Bravo, Maria A.
Jeanneret, Guillaume
Thabet, Ali K.
Brox, Thomas
Arbelaez, Pablo
Ghanem, Bernard
[J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 210
[7] Spatio-temporal Attention Network for Video Instance Segmentation
Liu, Xiaoyu
Ren, Haibing
Ye, Tingmeng
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 725 - 727
[8] RANet: Ranking Attention Network for Fast Video Object Segmentation
Wang, Ziqin
Xu, Jun
Liu, Li
Zhu, Fan
Shao, Ling
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3977 - 3986
[9] TA-Net: Triple attention network for medical image segmentation
Li, Yang
Yang, Jun
Ni, Jiajia
Elazab, Ahmed
Wu, Jianhuang
[J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 137
[10] Using attention for video segmentation
Boccignone, G
Chianese, A
Moscato, V
Picariello, A
[J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 838 - 841

← 1 2 3 4 5 →