Mobile_ViT: Underwater Acoustic Target Recognition Method Based on Local-Global Feature Fusion

被引:2
|
作者
Yao, Haiyang [1 ,2 ]
Gao, Tian [1 ,2 ]
Wang, Yong [3 ]
Wang, Haiyan [1 ,4 ]
Chen, Xiao [1 ,2 ]
机构
[1] Shaanxi Univ Sci & Technol, Sch Elect Informat & Artificial Intelligence, Xian 710021, Peoples R China
[2] Shaanxi Univ Sci & Technol, Shaanxi Joint Lab Artificial Intelligence, Xian 710021, Peoples R China
[3] Xian Microelect Technol Inst, Xian 710021, Peoples R China
[4] Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
underwater acoustic target recognition; attention mechanism; feature fusion; MobileNet; Transformer; CLASSIFICATION;
D O I
10.3390/jmse12040589
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
To overcome the challenges of inadequate representation and ineffective information exchange stemming from feature homogenization in underwater acoustic target recognition, we introduce a hybrid network named Mobile_ViT, which synergizes MobileNet and Transformer architectures. The network begins with a convolutional backbone incorporating an embedded coordinate attention mechanism to enhance the local details of inputs. This mechanism captures the long-term temporal dependencies and precise frequency-domain relationships of signals, focusing the features on the time-frequency positions. Subsequently, the Transformer's Encoder is integrated at the end of the backbone to facilitate global characterization, thus effectively overcoming the convolutional neural network's shortcomings in capturing long-range feature dependencies. Evaluation on the Shipsear and DeepShip datasets yields accuracies of 98.50% and 94.57%, respectively, marking a substantial improvement over the baseline. Notably, the proposed method also demonstrates obvious separation coefficients, signifying enhanced clustering effectiveness, and is lighter than other Transformers.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Underwater Acoustic Target Recognition Based on Multi-timeslice Demodulation Line Spectrum Feature
    Shi, Guangzhi
    Hu, Junchuan
    Han, Mei
    Li, Yuyang
    2008 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-4, 2008, : 835 - 839
  • [42] Underwater Target Noise Recognition and Classification Technology based on Multi-Classes Feature Fusion
    Zhang S.
    Wang C.
    Sun Q.
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2020, 38 (02): : 366 - 376
  • [43] Local-Global Cross-Fusion Transformer Network for Facial Expression Recognition
    Liu, Yicheng
    Li, Zecheng
    Zhang, Yanbo
    Wen, Jie
    WEB AND BIG DATA, PT II, APWEB-WAIM 2023, 2024, 14332 : 254 - 269
  • [44] A Local-Global Feature Fusing Method for Point Clouds Semantic Segmentation
    Bi, Yuanwei
    Zhang, Lujian
    Liu, Yaowen
    Huang, Yansen
    Liu, Hao
    IEEE ACCESS, 2023, 11 : 68776 - 68790
  • [45] Underwater Acoustic Multi-target Recognition Algorithm Based on Hierarchical Information Fusion Structure
    Yu, Liang
    Cheng, Yong-mei
    Song, Lin
    Liu, Zhun-ga
    Chen, Ke-zhe
    2014 17TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2014,
  • [46] KPCA and AE Based Local-Global Feature Extraction Method for Vibration Signals of Rotating Machinery
    Hu, Xiao
    Xiao, Zhihuai
    Liu, Dong
    Tang, Yongjun
    Malik, O. P.
    Xia, Xiangchen
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [47] Local-Global Feature Fusion Network for Efficient Hyperspectral Image Super-Resolution
    Xu, Jingran
    Zhao, Jiankang
    Cui, Chao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [48] Multi-scale spectral feature extraction for underwater acoustic target recognition
    Jiang, Junjun
    Shi, Tuo
    Huang, Min
    Xiao, Zhongzhe
    MEASUREMENT, 2020, 166
  • [49] Underwater acoustic target recognition using RCRNN and wavelet-auditory feature
    Qi, Pengyuan
    Yin, Guisheng
    Zhang, Liguo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 47295 - 47317
  • [50] Underwater acoustic target recognition using RCRNN and wavelet-auditory feature
    Pengyuan Qi
    Guisheng Yin
    Liguo Zhang
    Multimedia Tools and Applications, 2024, 83 : 47295 - 47317