Deep Convolutional Neural Network Combined with Concatenated Spectrogram for Environmental Sound Classification

被引:0
|
作者
Chi, Zhejian [1 ]
Li, Ying [1 ]
Chen, Cheng [1 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou, Peoples R China
关键词
Concatenated spectrogram; Deep convolutional neural network; Environmental sound classification;
D O I
10.1109/iccsnt47585.2019.8962462
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Environmental sound classification (ESC) is an important but challenging issue. In this paper, we propose a new deep convolutional neural network, which uses concatenated spectrogram as input features, for ESC task. This concatenated spectrogram feature we adopt can increase the richness of features compared with single spectrogram. It is generated by concatenating two regular spectrograms, the Log-Mel spectrogram and the Log-Gammatone spectrogram. The network we propose uses convolutional blocks to extract and derive high-level feature images from concatenated spectrogram, and each block is composed of three convolutional layers and a pooling layer. In order to keep depth of the network and reduce numbers of parameters, we use filter with a small receptive field in each convolutional layer. Besides, we use the average pooling to keep more information. Our method was tested on ESC-50 and UrbanSound8K and achieved classification accuracy of 83.8% and 80.3%, respectively. The experimental results show that the proposed method is suitable for ESC task.
引用
收藏
页码:251 / 254
页数:4
相关论文
共 50 条
  • [1] Deep Convolutional Neural Network with Mixup for Environmental Sound Classification
    Zhang, Zhichao
    Xu, Shugong
    Cao, Shan
    Zhang, Shunqing
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 356 - 367
  • [2] Deep Convolutional Neural Network with Transfer Learning for Environmental Sound Classification
    Lu, Jianrui
    Ma, Ruofei
    Liu, Gongliang
    Qin, Zhiliang
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS (ICCCR 2021), 2021, : 242 - 245
  • [3] Deep convolutional neural network for environmental sound classification via dilation
    Roy, Sanjiban Sekhar
    Mihalache, Sanda Florentina
    Pricop, Emil
    Rodrigues, Nishant
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (02) : 1827 - 1833
  • [4] Environmental sound classification using a regularized deep convolutional neural network with data augmentation
    Mushtaq, Zohaib
    Su, Shun-Feng
    [J]. APPLIED ACOUSTICS, 2020, 167
  • [5] Improved convolutional neural network and spectrogram image feature for traffic sound event classification
    Xu, Ke
    Yao, Jingyi
    Yao, Lingyun
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2023, 238 (13) : 4230 - 4244
  • [6] Sub-spectrogram Segmentation for Environmental Sound Classification via Convolutional Recurrent Neural Network and Score Level Fusion
    Qiao, Tianhao
    Zhang, Shunqing
    Zhang, Zhichao
    Cao, Shan
    Xu, Shugong
    [J]. PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2019), 2019, : 318 - 323
  • [7] Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification
    Salamon, Justin
    Bello, Juan Pablo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (03) : 279 - 283
  • [8] Sound Classification Using Convolutional Neural Network and Tensor Deep Stacking Network
    Khamparia, Aditya
    Gupta, Deepak
    Nhu Gia Nguyen
    Khanna, Ashish
    Pandey, Babita
    Tiwari, Prayag
    [J]. IEEE ACCESS, 2019, 7 : 7717 - 7727
  • [9] Attention based convolutional recurrent neural network for environmental sound classification
    Zhang, Zhichao
    Xu, Shugong
    Zhang, Shunqing
    Qiao, Tianhao
    Cao, Shan
    [J]. NEUROCOMPUTING, 2021, 453 (453) : 896 - 903
  • [10] A Deep Convolutional Network for Multitype Signal Detection and Classification in Spectrogram
    Li, Weihao
    Wang, Keren
    You, Ling
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020 (2020)