A Method of Environmental Sound Classification Based on Residual Networks and Data Augmentation

被引:0
|
作者
Zeng, Jinfang [1 ]
Li, Youming [1 ]
Zhang, Yu [1 ]
Chen, Da [1 ]
机构
[1] Xiang Tan Univ, Sch Phys & Optoelect, Xiangtan 411105, Hunan, Peoples R China
关键词
Environmental sound classification; residual networks; data augmentation;
D O I
10.1142/S1469026821500188
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Environmental sound classication (ESC) is a challenging problem due to the complexity of sounds. To date, a variety of signal processing and machine learning techniques have been applied to ESC task, including matrix factorization, dictionary learning, waveletlterbanks and deep neural networks. It is observed that features extracted from deeper networks tend to achieve higher performance than those extracted from shallow networks. However, in ESC task, only the deep convolutional neural networks (CNNs) which contain several layers are used and the residual networks are ignored, which lead to degradation in the performance. Meanwhile, a possible explanation for the limited exploration of CNNs and the diffculty to improve on simpler models is the relative scarcity of labeled data for ESC. In this paper, a residual network called EnvResNet for the ESC task is proposed. In addition, we propose to use audio data augmentation to overcome the problem of data scarcity. The experiments will be performed on the ESC-50 database. Combined with data augmentation, the proposed model outperforms baseline implementations relying on mel-frequency cepstral coeffcients and achieves results comparable to other state-of-the-art approaches in terms of classifcation accuracy.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] A Pattern Recognition System for Environmental Sound Classification based on MFCCs and Neural Networks
    Beritelli, F.
    Grasso, R.
    ICSPCS: 2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, PROCEEDINGS, 2008, : 453 - 456
  • [32] A DOMAIN TRANSFER BASED DATA AUGMENTATION METHOD FOR AUTOMATED RESPIRATORY CLASSIFICATION
    Wang, Zijie
    Wang, Zhao
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 9017 - 9021
  • [33] HRTF-Based Data Augmentation Method for Acoustic Scene Classification
    Liu, Yingzi
    Yang, Haocong
    Shi, Chuang
    Liang, Jiangnan
    2021 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2021,
  • [34] Hybrid Computerized Method for Environmental Sound Classification
    Ullo, Silvia Liberata
    Khare, Smith K.
    Bajaj, Varun
    Sinha, G. R.
    IEEE ACCESS, 2020, 8 (08): : 124055 - 124065
  • [35] A DATA AUGMENTATION APPROACH BASED ON GENERATIVE ADVERSARIAL NETWORKS FOR DATE FRUIT CLASSIFICATION
    Ufuah, Donald
    Thomas, Gabriel
    Balocco, Simone
    Manickavasagan, Annamalai
    APPLIED ENGINEERING IN AGRICULTURE, 2022, 38 (06) : 975 - 982
  • [36] Environmental sound sources classification using neural networks
    Stoeckle, S
    Pah, N
    Kumar, DK
    McLachlan, N
    ANZIIS 2001: PROCEEDINGS OF THE SEVENTH AUSTRALIAN AND NEW ZEALAND INTELLIGENT INFORMATION SYSTEMS CONFERENCE, 2001, : 399 - 403
  • [37] Masked Conditional Neural Networks for Environmental Sound Classification
    Medhat, Fady
    Chesmore, David
    Robinson, John
    ARTIFICIAL INTELLIGENCE XXXIV, AI 2017, 2017, 10630 : 21 - 33
  • [38] EASY DATA AUGMENTATION METHOD FOR CLASSIFICATION TASKS
    Liu Guohang
    Zhang Shibin
    Tang Haozhe
    Yang Lu
    Lu Jiazhong
    Huang Yuanyuan
    2020 17TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2020, : 166 - 169
  • [39] CNN-RNN and Data Augmentation Using Deep Convolutional Generative Adversarial Network for Environmental Sound Classification
    Bahmei, Behnaz
    Birmingham, Elina
    Arzanpour, Siamak
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 682 - 686
  • [40] Data Augmentation for Environmental Sound Classification Using Diffusion Probabilistic Model with Top-K Selection Discriminator
    Chen, Yunhao
    Yan, Zihui
    Zhu, Yunjie
    Ren, Zhen
    Shen, Jianlu
    Huang, Yifan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 283 - 295