Semantic segmentation of underwater images based on the improved SegFormer

被引:0
|
作者
Chen, Bowei [1 ,2 ]
Zhao, Wei [1 ,2 ]
Zhang, Qiusheng [3 ]
Li, Mingliang [3 ]
Qi, Mingyang [3 ]
Tang, You [3 ,4 ,5 ]
机构
[1] Qingdao Innovat & Dev Base, Harbin, Peoples R China
[2] Harbin Engn Univ, Lab Underwater Intelligence, Qingdao, Peoples R China
[3] Jilin Agr Sci & Technol Univ, Elect & Informat Engn Coll, Jilin, Peoples R China
[4] Jilin Agr Univ, Coll Informat Technol, Changchun, Peoples R China
[5] Yanbian Univ, Coll Agr, Yanji, Peoples R China
关键词
underwater images; semantic segmentation; attention mechanism; feature fusion; SegFormer;
D O I
10.3389/fmars.2025.1522160
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Underwater images segmentation is essential for tasks such as underwater exploration, marine environmental monitoring, and resource development. Nevertheless, given the complexity and variability of the underwater environment, improving model accuracy remains a key challenge in underwater image segmentation tasks. To address these issues, this study presents a high-performance semantic segmentation approach for underwater images based on the standard SegFormer model. First, the Mix Transformer backbone in SegFormer is replaced with a Swin Transformer to enhance feature extraction and facilitate efficient acquisition of global context information. Next, the Efficient Multi-scale Attention (EMA) mechanism is introduced in the backbone's downsampling stages and the decoder to better capture multi-scale features, further improving segmentation accuracy. Furthermore, a Feature Pyramid Network (FPN) structure is incorporated into the decoder to combine feature maps at multiple resolutions, allowing the model to integrate contextual information effectively, enhancing robustness in complex underwater environments. Testing on the SUIM underwater image dataset shows that the proposed model achieves high performance across multiple metrics: mean Intersection over Union (MIoU) of 77.00%, mean Recall (mRecall) of 85.04%, mean Precision (mPrecision) of 89.03%, and mean F1score (mF1score) of 86.63%. Compared to the standard SegFormer, it demonstrates improvements of 3.73% in MIoU, 1.98% in mRecall, 3.38% in mPrecision, and 2.44% in mF1score, with an increase of 9.89M parameters. The results demonstrate that the proposed method achieves superior segmentation accuracy with minimal additional computation, showcasing high performance in underwater image segmentation.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Object measurement in real underwater environments using improved stereo matching with semantic segmentation
    Zhang, Jiawei
    Han, Fenglei
    Han, Duanfeng
    Su, Zhihao
    Li, Hansheng
    Zhao, Wangyuan
    Yang, Jianfeng
    MEASUREMENT, 2023, 218
  • [42] Semantic segmentation of angiographic images
    Menegaz, G
    Lancini, R
    PROCEEDINGS OF THE 18TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 18, PTS 1-5, 1997, 18 : 670 - 671
  • [43] A deep learning method for optimizing semantic segmentation accuracy of remote sensing images based on improved UNet
    Wang, Xiaolei
    Hu, Zirong
    Shi, Shouhai
    Hou, Mei
    Xu, Lei
    Zhang, Xiang
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [44] A deep learning method for optimizing semantic segmentation accuracy of remote sensing images based on improved UNet
    Xiaolei Wang
    Zirong Hu
    Shouhai Shi
    Mei Hou
    Lei Xu
    Xiang Zhang
    Scientific Reports, 13
  • [45] Multimodal Semantic Segmentation Based On Improved Vision Transformers
    Qi, Weimin
    Chen, Hangong
    Wang, Zhiming
    Wang, Meng
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 565 - 569
  • [46] A Semantic Segmentation Algorithm Based on Improved Attention Mechanism
    Chen, Chunyu
    Wu, Xinsheng
    Chen, An
    2020 INTERNATIONAL SYMPOSIUM ON AUTONOMOUS SYSTEMS (ISAS), 2020, : 244 - 248
  • [47] A NOVEL SEGMENTATION METHOD BASED ON GRAYSCALE WAVE FOR UNDERWATER IMAGES
    Yan, Mingzhong
    Huang, Bingyi
    Zhu, Daqi
    Yang, Simon X.
    INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2018, 33 (04): : 386 - 393
  • [48] An Improved Algorithm for Defogging Based on Fused Underwater Images
    Cao, Xinli
    Xiong, Junqiao
    Shang, Yuxin
    Liu, Changrui
    Zou, Lianying
    2022 International Conference on Artificial Intelligence and Computer Information Technology, AICIT 2022, 2022,
  • [49] An Improved UNet Lightweight Network for Semantic Segmentation of Weed Images in Corn Fields
    Zuo, Yu
    Li, Wenwen
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (03): : 4413 - 4431
  • [50] Improved U-net++ Semantic Segmentation Method for Remote Sensing Images
    He, Jiajia
    Xu, Yang
    Zhang, Yongdan
    Computer Engineering and Applications, 60 (13): : 255 - 265