TBiSeg: A transformer-based network with bi-level routing attention for inland waterway segmentation

被引:2
|
作者
Fu, Chuanmao [1 ]
Li, Meng [1 ]
Zhang, Bo [2 ]
机构
[1] Jilin Univ, Coll Elect Sci & Engn, State Key Lab Integrated Optoelect, Changchun 130000, Jilin, Peoples R China
[2] China Ship Sci Res Ctr, Taihu Lab Deepsea, Wuxi 214082, Jiangsu, Peoples R China
关键词
Inland waterway segmentation; Vision transformer; Deep learning; Attention mechanism; UNMANNED SURFACE VEHICLES; USV;
D O I
10.1016/j.oceaneng.2024.119011
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Unmanned surface vehicles (USVs) for inland waterways have recently attracted increasing attention in various fields. Accurate detection in navigable regions is crucial for ensuring USV safety in autonomous navigation. However, the complex and variable environment of inland waterways, such as confusable textures and irregular edge details, continues to pose some problems in existing methods. Therefore, to acquire navigable regions, this study proposed TBiSeg, a Vision Transformer-based efficient inland waterway segmentation network, for obtaining pixel-level results. Bi-level routing attention is used to improve the Transformer block, which enhances the understanding of inland water textures. Additionly, this study combined global and local attention through a hierarchical encoder-decoder architecture. To simulate inland waterway scenes as accurately as possible, this study used two representative public datasets for data integration and data augmentation, and conducted testing and cross-validating using multiple inland waterway datasets. Results demonstrated that the model performed better than current state-of-the-art models in segmentation accuracy and robustness in complex inland waterway environments while showing impressive generalization. The datasets and code used in this paper is available at https://github.com/dawnnazzz/TBiSeg.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] New bi-level programming model for routing and spectrum assignment in elastic optical network
    Hejun Xuan
    Yuping Wang
    Zhanqi Xu
    Shanshan Hao
    Xiaoli Wang
    Optical and Quantum Electronics, 2017, 49
  • [32] TransVPR: Transformer-Based Place Recognition with Multi-Level Attention Aggregation
    Wang, Ruotong
    Shen, Yanqing
    Zuo, Weiliang
    Zhou, Sanping
    Zheng, Nanning
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13638 - 13647
  • [33] A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining
    Shi, Bowen
    Jiang, Dongsheng
    Zhang, Xiaopeng
    Li, Han
    Dai, Wenrui
    Zou, Junni
    Xiong, Hongkai
    Tian, Qi
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 624 - 639
  • [34] Transformer-Based Cross-Modal Information Fusion Network for Semantic Segmentation
    Duan, Zaipeng
    Huang, Xiao
    Ma, Jie
    NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6361 - 6375
  • [35] A Transformer-based Cascade Network with Boundary Enhancement Loss for Retinal Vessel Segmentation
    Cai, Binke
    Ma, Liyan
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4292 - 4298
  • [36] An Efficient and Light Transformer-Based Segmentation Network for Remote Sensing Images of Landscapes
    Chen, Lijia
    Chen, Honghui
    Xie, Yanqiu
    He, Tianyou
    Ye, Jing
    Zheng, Yushan
    FORESTS, 2023, 14 (11):
  • [37] Transformer-based semantic segmentation and CNN network for detection of histopathological lung cancer
    Talib, Lareib Fatima
    Amin, Javaria
    Sharif, Muhammad
    Raza, Mudassar
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 92
  • [38] Dead Broiler Detection and Segmentation Using Transformer-Based Dual Stream Network
    Ham, Gyu-Sung
    Oh, Kanghan
    AGRICULTURE-BASEL, 2024, 14 (11):
  • [39] Carbon pricing initiatives-based bi-level pollution routing problem
    Qiu, Rui
    Xu, Jiuping
    Ke, Ruimin
    Zeng, Ziqiang
    Wang, Yinhai
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2020, 286 (01) : 203 - 217
  • [40] Semi-Supervised Convolutional Vision Transformer with Bi-Level Uncertainty Estimation for Medical Image Segmentation
    Huang, Huimin
    Huang, Yawen
    Xie, Shiao
    Lin, Lanfen
    Tong Ruofeng
    Chen, Yen-Wei
    Li, Yuexiang
    Zheng, Yefeng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5214 - 5222