TBiSeg: A transformer-based network with bi-level routing attention for inland waterway segmentation

被引：2

作者：

Fu, Chuanmao ^{[1
]}

Li, Meng ^{[1
]}

Zhang, Bo ^{[2
]}

机构：

[1] Jilin Univ, Coll Elect Sci & Engn, State Key Lab Integrated Optoelect, Changchun 130000, Jilin, Peoples R China

[2] China Ship Sci Res Ctr, Taihu Lab Deepsea, Wuxi 214082, Jiangsu, Peoples R China

来源：

OCEAN ENGINEERING | 2024年 / 311卷

关键词：

Inland waterway segmentation; Vision transformer; Deep learning; Attention mechanism; UNMANNED SURFACE VEHICLES; USV;

D O I：

10.1016/j.oceaneng.2024.119011

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

Unmanned surface vehicles (USVs) for inland waterways have recently attracted increasing attention in various fields. Accurate detection in navigable regions is crucial for ensuring USV safety in autonomous navigation. However, the complex and variable environment of inland waterways, such as confusable textures and irregular edge details, continues to pose some problems in existing methods. Therefore, to acquire navigable regions, this study proposed TBiSeg, a Vision Transformer-based efficient inland waterway segmentation network, for obtaining pixel-level results. Bi-level routing attention is used to improve the Transformer block, which enhances the understanding of inland water textures. Additionly, this study combined global and local attention through a hierarchical encoder-decoder architecture. To simulate inland waterway scenes as accurately as possible, this study used two representative public datasets for data integration and data augmentation, and conducted testing and cross-validating using multiple inland waterway datasets. Results demonstrated that the model performed better than current state-of-the-art models in segmentation accuracy and robustness in complex inland waterway environments while showing impressive generalization. The datasets and code used in this paper is available at https://github.com/dawnnazzz/TBiSeg.

引用

页数：15

共 50 条

[31] New bi-level programming model for routing and spectrum assignment in elastic optical network
Hejun Xuan
Yuping Wang
Zhanqi Xu
Shanshan Hao
Xiaoli Wang
Optical and Quantum Electronics, 2017, 49
[32] TransVPR: Transformer-Based Place Recognition with Multi-Level Attention Aggregation
Wang, Ruotong
Shen, Yanqing
Zuo, Weiliang
Zhou, Sanping
Zheng, Nanning
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13638 - 13647
[33] A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining
Shi, Bowen
Jiang, Dongsheng
Zhang, Xiaopeng
Li, Han
Dai, Wenrui
Zou, Junni
Xiong, Hongkai
Tian, Qi
COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 624 - 639
[34] Transformer-Based Cross-Modal Information Fusion Network for Semantic Segmentation
Duan, Zaipeng
Huang, Xiao
Ma, Jie
NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6361 - 6375
[35] A Transformer-based Cascade Network with Boundary Enhancement Loss for Retinal Vessel Segmentation
Cai, Binke
Ma, Liyan
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4292 - 4298
[36] An Efficient and Light Transformer-Based Segmentation Network for Remote Sensing Images of Landscapes
Chen, Lijia
Chen, Honghui
Xie, Yanqiu
He, Tianyou
Ye, Jing
Zheng, Yushan
FORESTS, 2023, 14 (11):
[37] Transformer-based semantic segmentation and CNN network for detection of histopathological lung cancer
Talib, Lareib Fatima
Amin, Javaria
Sharif, Muhammad
Raza, Mudassar
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 92
[38] Dead Broiler Detection and Segmentation Using Transformer-Based Dual Stream Network
Ham, Gyu-Sung
Oh, Kanghan
AGRICULTURE-BASEL, 2024, 14 (11):
[39] Carbon pricing initiatives-based bi-level pollution routing problem
Qiu, Rui
Xu, Jiuping
Ke, Ruimin
Zeng, Ziqiang
Wang, Yinhai
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2020, 286 (01) : 203 - 217
[40] Semi-Supervised Convolutional Vision Transformer with Bi-Level Uncertainty Estimation for Medical Image Segmentation
Huang, Huimin
Huang, Yawen
Xie, Shiao
Lin, Lanfen
Tong Ruofeng
Chen, Yen-Wei
Li, Yuexiang
Zheng, Yefeng
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5214 - 5222

← 1 2 3 4 5 →