A Multi-Domain Collaborative Transfer Learning Method with Multi-Scale Repeated Attention Mechanism for Underwater Side-Scan Sonar Image Classification

被引:31
|
作者
Cheng, Zhen [1 ]
Huo, Guanying [1 ]
Li, Haisen [2 ]
机构
[1] Hohai Univ, Coll Internet Things Engn, Changzhou 213022, Jiangsu, Peoples R China
[2] Harbin Engn Univ, Coll Underwater Acoust Engn, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
side-scan sonar image classification; multi-domain collaborative transfer learning; multi-scale repeated attention mechanism; multi-domain datasets; feature representation; SEDIMENT CLASSIFICATION; SVM; CNN;
D O I
10.3390/rs14020355
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Due to the strong speckle noise caused by the seabed reverberation which makes it difficult to extract discriminating and noiseless features of a target, recognition and classification of underwater targets using side-scan sonar (SSS) images is a big challenge. Moreover, unlike classification of optical images which can use a large dataset to train the classifier, classification of SSS images usually has to exploit a very small dataset for training, which may cause classifier overfitting. Compared with traditional feature extraction methods using descriptors-such as Haar, SIFT, and LBP-deep learning-based methods are more powerful in capturing discriminating features. After training on a large optical dataset, e.g., ImageNet, direct fine-tuning method brings improvement to the sonar image classification using a small-size SSS image dataset. However, due to the different statistical characteristics between optical images and sonar images, transfer learning methods-e.g., fine-tuning-lack cross-domain adaptability, and therefore cannot achieve very satisfactory results. In this paper, a multi-domain collaborative transfer learning (MDCTL) method with multi-scale repeated attention mechanism (MSRAM) is proposed for improving the accuracy of underwater sonar image classification. In the MDCTL method, low-level characteristic similarity between SSS images and synthetic aperture radar (SAR) images, and high-level representation similarity between SSS images and optical images are used together to enhance the feature extraction ability of the deep learning model. Using different characteristics of multi-domain data to efficiently capture useful features for the sonar image classification, MDCTL offers a new way for transfer learning. MSRAM is used to effectively combine multi-scale features to make the proposed model pay more attention to the shape details of the target excluding the noise. Experimental results of classification show that, in using multi-domain data sets, the proposed method is more stable with an overall accuracy of 99.21%, bringing an improvement of 4.54% compared with the fine-tuned VGG19. Results given by diverse visualization methods also demonstrate that the method is more powerful in feature representation by using the MDCTL and MSRAM.
引用
收藏
页数:25
相关论文
共 50 条
  • [21] A Study on a Remote Sensing Image Classification Method Incorporating Multi-scale Feature Extraction and Channel Attention Mechanism
    Hu, Juan
    Du, Xinran
    Wang, Hanyang
    Jin, Ting
    Yang, Houqun
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 524 - 536
  • [22] Domain adversarial transfer fault diagnosis method of an axial piston pump based on a multi-scale attention mechanism
    Dong, Zhikui
    An, Huijiang
    Liu, Siyuan
    Ma, Shihao
    Guo, Yuxuan
    Pan, Hongxin
    Ai, Chao
    MEASUREMENT, 2025, 239
  • [23] Rice Ears Detection Method Based on Multi-Scale Image Recognition and Attention Mechanism
    Qiu, Fen
    Shen, Xiaojun
    Zhou, Cheng
    He, Wuming
    Yao, Lili
    IEEE ACCESS, 2024, 12 : 68637 - 68647
  • [24] Bi-directional LSTM with multi-scale dense attention mechanism for hyperspectral image classification
    Jinxiong Gao
    Xiumei Gao
    Nan Wu
    Hongye Yang
    Multimedia Tools and Applications, 2022, 81 : 24003 - 24020
  • [25] Bi-directional LSTM with multi-scale dense attention mechanism for hyperspectral image classification
    Gao, Jinxiong
    Gao, Xiumei
    Wu, Nan
    Yang, Hongye
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24003 - 24020
  • [26] Remote sensing scene image classification model based on multi-scale features and attention mechanism
    Wang, Guowei
    Xu, Haixia
    Wang, Xinyu
    Yuan, Liming
    Wen, Xianbin
    JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (04)
  • [27] Optimizing underwater image enhancement: integrating semi-supervised learning and multi-scale aggregated attention
    Xu, Sunhan
    Wang, Jinhua
    He, Ning
    Xu, Guangmei
    Zhang, Geng
    VISUAL COMPUTER, 2024, : 3437 - 3455
  • [28] Image classification method for tomato leaf deficient nutrient elements based on attention mechanism and multi-scale feature fusion
    Han X.
    Zhao C.
    Wu H.
    Zhu H.
    Zhang Y.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2021, 37 (17): : 177 - 188
  • [29] A multi-scale multi-channel CNN introducing a channel-spatial attention mechanism hyperspectral remote sensing image classification method
    Zhao, Ru
    Zhang, Chaozhu
    Xue, Dan
    EUROPEAN JOURNAL OF REMOTE SENSING, 2024, 57 (01)
  • [30] Remote Sensing Image Scene Classification Method Based on Multi-Scale Cyclic Attention Network
    Ma X.
    Wang L.
    Qi K.
    Zheng G.
    Diqiu Kexue - Zhongguo Dizhi Daxue Xuebao/Earth Science - Journal of China University of Geosciences, 2021, 46 (10): : 3740 - 3752