A Multi-Domain Collaborative Transfer Learning Method with Multi-Scale Repeated Attention Mechanism for Underwater Side-Scan Sonar Image Classification

被引:31
|
作者
Cheng, Zhen [1 ]
Huo, Guanying [1 ]
Li, Haisen [2 ]
机构
[1] Hohai Univ, Coll Internet Things Engn, Changzhou 213022, Jiangsu, Peoples R China
[2] Harbin Engn Univ, Coll Underwater Acoust Engn, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
side-scan sonar image classification; multi-domain collaborative transfer learning; multi-scale repeated attention mechanism; multi-domain datasets; feature representation; SEDIMENT CLASSIFICATION; SVM; CNN;
D O I
10.3390/rs14020355
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Due to the strong speckle noise caused by the seabed reverberation which makes it difficult to extract discriminating and noiseless features of a target, recognition and classification of underwater targets using side-scan sonar (SSS) images is a big challenge. Moreover, unlike classification of optical images which can use a large dataset to train the classifier, classification of SSS images usually has to exploit a very small dataset for training, which may cause classifier overfitting. Compared with traditional feature extraction methods using descriptors-such as Haar, SIFT, and LBP-deep learning-based methods are more powerful in capturing discriminating features. After training on a large optical dataset, e.g., ImageNet, direct fine-tuning method brings improvement to the sonar image classification using a small-size SSS image dataset. However, due to the different statistical characteristics between optical images and sonar images, transfer learning methods-e.g., fine-tuning-lack cross-domain adaptability, and therefore cannot achieve very satisfactory results. In this paper, a multi-domain collaborative transfer learning (MDCTL) method with multi-scale repeated attention mechanism (MSRAM) is proposed for improving the accuracy of underwater sonar image classification. In the MDCTL method, low-level characteristic similarity between SSS images and synthetic aperture radar (SAR) images, and high-level representation similarity between SSS images and optical images are used together to enhance the feature extraction ability of the deep learning model. Using different characteristics of multi-domain data to efficiently capture useful features for the sonar image classification, MDCTL offers a new way for transfer learning. MSRAM is used to effectively combine multi-scale features to make the proposed model pay more attention to the shape details of the target excluding the noise. Experimental results of classification show that, in using multi-domain data sets, the proposed method is more stable with an overall accuracy of 99.21%, bringing an improvement of 4.54% compared with the fine-tuned VGG19. Results given by diverse visualization methods also demonstrate that the method is more powerful in feature representation by using the MDCTL and MSRAM.
引用
收藏
页数:25
相关论文
共 50 条
  • [41] Flower image classification based on an improved lightweight neural network with multi-scale feature fusion and attention mechanism
    Zeng, Zhigao
    Huang, Cheng
    Zhu, Wenqiu
    Wen, Zhiqiang
    Yuan, Xinpan
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (08) : 13900 - 13920
  • [42] A froth image segmentation method via generative adversarial networks with multi-scale self-attention mechanism
    Zhong, Yuze
    Tang, Zhaohui
    Zhang, Hu
    Xie, Yongfang
    Gao, Xiaoliang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19663 - 19682
  • [43] A Lightweight High-Resolution RS Image Road Extraction Method Combining Multi-Scale and Attention Mechanism
    Wang, Rui
    Cai, Mingxiang
    Xia, Zixuan
    IEEE ACCESS, 2023, 11 : 108956 - 108966
  • [44] A froth image segmentation method via generative adversarial networks with multi-scale self-attention mechanism
    Yuze Zhong
    Zhaohui Tang
    Hu Zhang
    Yongfang Xie
    Xiaoliang Gao
    Multimedia Tools and Applications, 2024, 83 : 19663 - 19682
  • [45] Integrating deformable CNN and attention mechanism into multi-scale graph neural network for few-shot image classification
    Liu, Yongmin
    Xiao, Fengjiao
    Zheng, Xinying
    Deng, Weihao
    Ma, Haizhi
    Su, Xinyao
    Wu, Lei
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [46] Fine-Grained Image Classification Algorithm Using Multi-Scale Feature Fusion and Re-Attention Mechanism
    He K.
    Feng X.
    Gao S.
    Ma X.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2020, 53 (10): : 1077 - 1085
  • [47] Multi-scale organs image segmentation method improved by squeeze-and-attention based on partially supervised learning
    Mao Hongdong
    Cao Guogang
    Zhang Shu
    Liu Shunkun
    Kong Deqing
    Li Sicheng
    Peng Zeyu
    Wu Yan
    Chen Ying
    Dai Cuixia
    International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 1135 - 1142
  • [48] Multi-scale organs image segmentation method improved by squeeze-and-attention based on partially supervised learning
    Mao Hongdong
    Cao Guogang
    Zhang Shu
    Liu Shunkun
    Kong Deqing
    Li Sicheng
    Peng Zeyu
    Wu Yan
    Chen Ying
    Dai Cuixia
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (06) : 1135 - 1142
  • [49] M2S2-FNet: Multi-scale, Multi-stream feature network with Attention mechanism for classification of breast histopathological image
    Pujari, Suvarna D.
    Pawer, Meenakshi M.
    Pawar, Swati P.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (20) : 58981 - 58994
  • [50] A two-stage multi-scale domain adversarial transfer learning method and its application in fault diagnosis
    Zhang, Mingyuan
    Huang, Chengxuan
    Wang, Hongsen
    He, Chen
    Yang, Debin
    Yang, Jianhong
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (12)