Underwater target recognition using convolutional recurrent neural networks with 3-D Mel-spectrogram and data augmentation

被引:78
|
作者
Liu, Feng [1 ]
Shen, Tongsheng [1 ]
Luo, Zailei [1 ]
Zhao, Dexin [1 ]
Guo, Shaojun [1 ]
机构
[1] Chinese Acad Mil Sci, Natl Innovat Inst Def Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Underwater acoustic target recognition; Feature extraction; Mel-spectrogram; Data augmentation; Convolutional Recurrent Neural Networks; CLASSIFICATION; FEATURES;
D O I
10.1016/j.apacoust.2021.107989
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Passive recognition of underwater acoustic targets is a hot research issue in acoustic signal processing. The long-term interference of irregular noise in the marine environment caused the relevance of the passive recognition method of underwater targets based on the traditional technical framework to gradually decrease. Due to the interference of irregular noise in the ocean, the passive recognition method used for underwater targets based on the traditional technical framework is gradually becoming less relevant. The feature extraction method that combines deep learning and time-frequency spectrogram can better describe the differences of different targets. In this paper, the proposed model contains three steps to deal with the recognition of underwater targets: feature extraction, data augmentation and deep neural network. For the feature extraction, we use a Mel-spectrogram, as well as the delta and delta-delta features in order to construct 3-D features. In the data augmentation part, we expand the dataset with SpecAugment in the time domain and frequency domain. In deep neural network prediction part, we use the convolutional recurrent neural network (CRNN) for acoustic target recognition. Through a comparison with the ablation test, it is clear that the pipeline in our method is effective in acquiring the recognition result. After evaluating our system through the carrying out of three tasks on the ShipsEar dataset, and the recognition accuracy are 94.6%, 87.5% and 72.6% in task 1, task 2 and task 3 respectively. (C) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Cough Recognition Based on Mel-Spectrogram and Convolutional Neural Network
    Zhou, Quan
    Shan, Jianhua
    Ding, Wenlong
    Wang, Chengyin
    Yuan, Shi
    Sun, Fuchun
    Li, Haiyuan
    Fang, Bin
    [J]. FRONTIERS IN ROBOTICS AND AI, 2021, 8
  • [2] Convolutional Neural Networks Using Log Mel-Spectrogram Separation for Audio Event Classification with Unknown Devices
    Seo, Soonshin
    Kim, Changmin
    Kim, Ji-Hwan
    [J]. JOURNAL OF WEB ENGINEERING, 2022, 21 (02): : 497 - 521
  • [3] 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition
    Chen, Mingyi
    He, Xuanji
    Yang, Jing
    Zhang, Han
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (10) : 1440 - 1444
  • [4] Underwater Image Classification Using Deep Convolutional Neural Networks and Data Augmentation
    Xu, Yifeng
    Zhang, Yang
    Wang, Huigang
    Liu, Xing
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2017,
  • [5] DATA AUGMENTATION WITH GABOR FILTER IN DEEP CONVOLUTIONAL NEURAL NETWORKS FOR SAR TARGET RECOGNITION
    Jiang, Ting
    Cui, Zongyong
    Zhou, Zhi
    Cao, Zongjie
    [J]. IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 689 - 692
  • [6] EEG driving fatigue detection based on log-Mel spectrogram and convolutional recurrent neural networks
    Gao, Dongrui
    Tang, Xue
    Wan, Manqing
    Huang, Guo
    Zhang, Yongqing
    [J]. FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [7] Convolutional Neural Network With Data Augmentation for SAR Target Recognition
    Ding, Jun
    Chen, Bo
    Liu, Hongwei
    Huang, Mengyuan
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (03) : 364 - 368
  • [8] Binary Volumetric Convolutional Neural Networks for 3-D Object Recognition
    Ma, Chao
    Guo, Yulan
    Lei, Yinjie
    An, Wei
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2019, 68 (01) : 38 - 48
  • [9] Radar HRRP Target Recognition with Recurrent Convolutional Neural Networks
    Shen, Mengqi
    Chen, Bo
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING, 2018, 11266 : 243 - 251
  • [10] Automatic Recognition of fMRI-Derived Functional Networks Using 3-D Convolutional Neural Networks
    Zhao, Yu
    Dong, Qinglin
    Zhang, Shu
    Zhang, Wei
    Chen, Hanbo
    Jiang, Xi
    Guo, Lei
    Hu, Xintao
    Han, Junwei
    Liu, Tianming
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2018, 65 (09) : 1975 - 1984