A convolutional vision transformer for semantic segmentation of side-scan sonar data

被引:2
|
作者
Rajani, Hayat [1 ]
Gracias, Nuno [1 ]
Garcia, Rafael [1 ]
机构
[1] Univ Girona, Comp Vis & Robot Res Inst ViCOROB, Campus Montilivi,Edifici P4, Girona 17003, Catalonia, Spain
基金
欧盟地平线“2020”;
关键词
Seafloor segmentation; Side-scan sonar; Vision transformer; Convolutional transformer; Real-time;
D O I
10.1016/j.oceaneng.2023.115647
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Distinguishing among different marine benthic habitat characteristics is of key importance in a wide set of seabed operations ranging from installations of oil rigs to laying networks of cables and monitoring the impact of humans on marine ecosystems. The Side-Scan Sonar (SSS) is a widely used imaging sensor in this regard. It produces high-resolution seafloor maps by logging the intensities of sound waves reflected back from the seafloor. In this work, we leverage these acoustic intensity maps to produce pixel-wise categorization of different seafloor types. We propose a novel architecture adapted from the Vision Transformer (ViT) in an encoder-decoder framework. Further, in doing so, the applicability of ViTs is evaluated on smaller datasets. To overcome the lack of CNN-like inductive biases, thereby making ViTs more conducive to applications in low data regimes, we propose a novel feature extraction module to replace the Multi-layer Perceptron (MLP) block within transformer layers and a novel module to extract multiscale patch embeddings. A lightweight decoder is also proposed to complement this design in order to further enhance multiscale feature extraction. With the modified architecture, we achieve state-of-the-art results and also meet real-time computational requirements. We make our code available at https://github.com/hayatrajani/s3seg-vit.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A Convolutional Vision Transformer for Semantic Segmentation of Side-Scan Sonar Data
    Rajani, Hayat
    Gracias, Nuno
    Garcia, Rafael
    [J]. arXiv, 2023,
  • [2] DcNet: Dilated Convolutional Neural Networks for Side-Scan Sonar Image Semantic Segmentation
    Xiaohong Zhao
    Rixia Qin
    Qilei Zhang
    Fei Yu
    Qi Wang
    Bo He
    [J]. Journal of Ocean University of China, 2021, 20 : 1089 - 1096
  • [3] DcNet: Dilated Convolutional Neural Networks for Side-Scan Sonar Image Semantic Segmentation
    ZHAO Xiaohong
    QIN Rixia
    ZHANG Qilei
    YU Fei
    WANG Qi
    HE Bo
    [J]. Journal of Ocean University of China, 2021, 20 (05) : 1089 - 1096
  • [4] DcNet: Dilated Convolutional Neural Networks for Side-Scan Sonar Image Semantic Segmentation
    Zhao Xiaohong
    Qin Rixia
    Zhang Qilei
    Yu Fei
    Wang Qi
    He Bo
    [J]. JOURNAL OF OCEAN UNIVERSITY OF CHINA, 2021, 20 (05) : 1089 - 1096
  • [5] Semantic Segmentation of Side-Scan Sonar Images with Few Samples
    Yang, Dianyu
    Wang, Can
    Cheng, Chensheng
    Pan, Guang
    Zhang, Feihu
    [J]. ELECTRONICS, 2022, 11 (19)
  • [6] SIDE-SCAN SONAR SYSTEM
    SOMERS, ML
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1977, 284 (1322): : 281 - 285
  • [7] Seabed Sediment Classification of Side-scan Sonar Data Using Convolutional Neural Networks
    Berthold, Tim
    Leichter, Artem
    Rosenhahn, Bodo
    Berkhahn, Volker
    Valerius, Jennifer
    [J]. 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017,
  • [8] SIDE-SCAN SONAR APPLICATIONS
    GAZEY, BK
    [J]. ULTRASONICS, 1971, 9 (03) : 173 - &
  • [9] Automatic target detection in side-scan sonar data
    Quintal, Rebecca T.
    Dysart, Paul S.
    Byrne, John Shannon
    [J]. OPTICS AND PHOTONICS IN GLOBAL HOMELAND SECURITY V AND BIOMETRIC TECHNOLOGY FOR HUMAN IDENTIFICATION VI, 2009, 7306
  • [10] RT-Seg: A Real-Time Semantic Segmentation Network for Side-Scan Sonar Images
    Wang, Qi
    Wu, Meihan
    Yu, Fei
    Feng, Chen
    Li, Kaige
    Zhu, Yuemei
    Rigall, Eric
    He, Bo
    [J]. SENSORS, 2019, 19 (09)