Deep Convolutional Neural Network with Mixup for Environmental Sound Classification

被引:71
|
作者
Zhang, Zhichao [1 ]
Xu, Shugong [1 ]
Cao, Shan [1 ]
Zhang, Shunqing [1 ]
机构
[1] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai 200444, Peoples R China
关键词
Environmental sound classification; Convolutional neural network; Mixup;
D O I
10.1007/978-3-030-03335-4_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Environmental sound classification (ESC) is an important and challenging problem. In contrast to speech, sound events have noise like nature and may be produced by a wide variety of sources. In this paper, we propose to use a novel deep convolutional neural network for ESC tasks. Our network architecture uses stacked convolutional and pooling layers to extract high-level feature representations from spectrogram-like features. Furthermore, we apply mixup to ESC tasks and explore its impacts on classification performance and feature distribution. Experiments were conducted on UrbanSound8K, ESC-50 and ESC-10 datasets. Our experimental results demonstrated that our ESC system has achieved the state-of-the-art performance (83.7%) on UrbanSound8K and competitive performance on ESC-50 and ESC-10.
引用
收藏
页码:356 / 367
页数:12
相关论文
共 50 条
  • [1] Deep Convolutional Neural Network with Transfer Learning for Environmental Sound Classification
    Lu, Jianrui
    Ma, Ruofei
    Liu, Gongliang
    Qin, Zhiliang
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS (ICCCR 2021), 2021, : 242 - 245
  • [2] Deep convolutional neural network for environmental sound classification via dilation
    Roy, Sanjiban Sekhar
    Mihalache, Sanda Florentina
    Pricop, Emil
    Rodrigues, Nishant
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (02) : 1827 - 1833
  • [3] Deep Convolutional Neural Network Combined with Concatenated Spectrogram for Environmental Sound Classification
    Chi, Zhejian
    Li, Ying
    Chen, Cheng
    [J]. PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 251 - 254
  • [4] Environmental sound classification using a regularized deep convolutional neural network with data augmentation
    Mushtaq, Zohaib
    Su, Shun-Feng
    [J]. APPLIED ACOUSTICS, 2020, 167
  • [5] Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification
    Salamon, Justin
    Bello, Juan Pablo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (03) : 279 - 283
  • [6] Sound Classification Using Convolutional Neural Network and Tensor Deep Stacking Network
    Khamparia, Aditya
    Gupta, Deepak
    Nhu Gia Nguyen
    Khanna, Ashish
    Pandey, Babita
    Tiwari, Prayag
    [J]. IEEE ACCESS, 2019, 7 : 7717 - 7727
  • [7] Attention based convolutional recurrent neural network for environmental sound classification
    Zhang, Zhichao
    Xu, Shugong
    Zhang, Shunqing
    Qiao, Tianhao
    Cao, Shan
    [J]. NEUROCOMPUTING, 2021, 453 : 896 - 903
  • [8] ENVIRONMENTAL SOUND CLASSIFICATION WITH CONVOLUTIONAL NEURAL NETWORKS
    Piczak, Karol J.
    [J]. 2015 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2015,
  • [9] Environmental Sound Classification using Deep Convolutional Neural Networks and Data Augmentation
    Davis, Nithya
    Suresh, K.
    [J]. 2018 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2018, : 41 - 45
  • [10] Robust technique for environmental sound classification using convolutional recurrent neural network
    Anam Bansal
    Naresh Kumar Garg
    [J]. Multimedia Tools and Applications, 2024, 83 : 54755 - 54772