A Multi-Domain Collaborative Transfer Learning Method with Multi-Scale Repeated Attention Mechanism for Underwater Side-Scan Sonar Image Classification

被引:31
|
作者
Cheng, Zhen [1 ]
Huo, Guanying [1 ]
Li, Haisen [2 ]
机构
[1] Hohai Univ, Coll Internet Things Engn, Changzhou 213022, Jiangsu, Peoples R China
[2] Harbin Engn Univ, Coll Underwater Acoust Engn, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
side-scan sonar image classification; multi-domain collaborative transfer learning; multi-scale repeated attention mechanism; multi-domain datasets; feature representation; SEDIMENT CLASSIFICATION; SVM; CNN;
D O I
10.3390/rs14020355
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Due to the strong speckle noise caused by the seabed reverberation which makes it difficult to extract discriminating and noiseless features of a target, recognition and classification of underwater targets using side-scan sonar (SSS) images is a big challenge. Moreover, unlike classification of optical images which can use a large dataset to train the classifier, classification of SSS images usually has to exploit a very small dataset for training, which may cause classifier overfitting. Compared with traditional feature extraction methods using descriptors-such as Haar, SIFT, and LBP-deep learning-based methods are more powerful in capturing discriminating features. After training on a large optical dataset, e.g., ImageNet, direct fine-tuning method brings improvement to the sonar image classification using a small-size SSS image dataset. However, due to the different statistical characteristics between optical images and sonar images, transfer learning methods-e.g., fine-tuning-lack cross-domain adaptability, and therefore cannot achieve very satisfactory results. In this paper, a multi-domain collaborative transfer learning (MDCTL) method with multi-scale repeated attention mechanism (MSRAM) is proposed for improving the accuracy of underwater sonar image classification. In the MDCTL method, low-level characteristic similarity between SSS images and synthetic aperture radar (SAR) images, and high-level representation similarity between SSS images and optical images are used together to enhance the feature extraction ability of the deep learning model. Using different characteristics of multi-domain data to efficiently capture useful features for the sonar image classification, MDCTL offers a new way for transfer learning. MSRAM is used to effectively combine multi-scale features to make the proposed model pay more attention to the shape details of the target excluding the noise. Experimental results of classification show that, in using multi-domain data sets, the proposed method is more stable with an overall accuracy of 99.21%, bringing an improvement of 4.54% compared with the fine-tuned VGG19. Results given by diverse visualization methods also demonstrate that the method is more powerful in feature representation by using the MDCTL and MSRAM.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] A Local Region-Based Level Set Method With Markov Random Field for Side-Scan Sonar Image Multi-Level Segmentation
    Li, Junwei
    Jiang, Peng
    Zhu, He
    IEEE SENSORS JOURNAL, 2021, 21 (01) : 510 - 519
  • [32] MsF-AT: A Study on Ship SAR Image Classification Based on Multi-Scale Feature and Attention Mechanism
    Zheng, Jianli
    Cao, Jianjun
    Hu, Xin
    IEEE ACCESS, 2025, 13 : 55467 - 55475
  • [33] Ceramic Microscope Image Classification Based on Multi-Scale Fusion Bottleneck Structure and Chunking Attention Mechanism
    Zhuang, Zhihuang
    Xu, Xing
    Xia, Xuewen
    Li, Yuanxiang
    Zhang, Yinglong
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (03) : 1120 - 1136
  • [34] A new multi-focus image fusion method based on multi-classification focus learning and multi-scale decomposition
    Lifeng Ma
    Yanxiang Hu
    Bo Zhang
    Jiaqi Li
    Zhijie Chen
    Wenhao Sun
    Applied Intelligence, 2023, 53 : 1452 - 1468
  • [35] A new multi-focus image fusion method based on multi-classification focus learning and multi-scale decomposition
    Ma, Lifeng
    Hu, Yanxiang
    Zhang, Bo
    Li, Jiaqi
    Chen, Zhijie
    Sun, Wenhao
    APPLIED INTELLIGENCE, 2023, 53 (02) : 1452 - 1468
  • [36] Image classification method on class imbalance datasets using multi-scale CNN and two-stage transfer learning
    Jiahuan Liu
    Fei Guo
    Huang Gao
    Zhigao Huang
    Yun Zhang
    Huamin Zhou
    Neural Computing and Applications, 2021, 33 : 14179 - 14197
  • [37] Image classification method on class imbalance datasets using multi-scale CNN and two-stage transfer learning
    Liu, Jiahuan
    Guo, Fei
    Gao, Huang
    Huang, Zhigao
    Zhang, Yun
    Zhou, Huamin
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (21): : 14179 - 14197
  • [38] Collaborative representation-based classification method using weighted multi-scale LBP for image recognition
    Song, Xiaoning
    Chen, Yao
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 682 - 687
  • [39] A multi-scale and multi-domain heart sound feature-based machine learning model for ACC/AHA heart failure stage classification
    Zheng, Yineng
    Guo, Xingming
    Wang, Yingying
    Qin, Jian
    Lv, Fajin
    PHYSIOLOGICAL MEASUREMENT, 2022, 43 (06)
  • [40] HAMNet: hyperspectral image classification based on hybrid neural network with attention mechanism and multi-scale feature fusion
    Shen, Jinyue
    Zheng, Zhouzhou
    Sun, Yingwei
    Zhao, Mengmeng
    Chang, Yankang
    Shao, Yuyi
    Zhang, Yan
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (11) : 4233 - 4258