A Method of Environmental Sound Classification Based on Residual Networks and Data Augmentation

被引:0
|
作者
Zeng, Jinfang [1 ]
Li, Youming [1 ]
Zhang, Yu [1 ]
Chen, Da [1 ]
机构
[1] Xiang Tan Univ, Sch Phys & Optoelect, Xiangtan 411105, Hunan, Peoples R China
关键词
Environmental sound classification; residual networks; data augmentation;
D O I
10.1142/S1469026821500188
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Environmental sound classication (ESC) is a challenging problem due to the complexity of sounds. To date, a variety of signal processing and machine learning techniques have been applied to ESC task, including matrix factorization, dictionary learning, waveletlterbanks and deep neural networks. It is observed that features extracted from deeper networks tend to achieve higher performance than those extracted from shallow networks. However, in ESC task, only the deep convolutional neural networks (CNNs) which contain several layers are used and the residual networks are ignored, which lead to degradation in the performance. Meanwhile, a possible explanation for the limited exploration of CNNs and the diffculty to improve on simpler models is the relative scarcity of labeled data for ESC. In this paper, a residual network called EnvResNet for the ESC task is proposed. In addition, we propose to use audio data augmentation to overcome the problem of data scarcity. The experiments will be performed on the ESC-50 database. Combined with data augmentation, the proposed model outperforms baseline implementations relying on mel-frequency cepstral coeffcients and achieves results comparable to other state-of-the-art approaches in terms of classifcation accuracy.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Temporal Self-Attention-Based Residual Network for An Environmental Sound Classification
    Tripathi, Achyut Mani
    Paul, Konark
    INTERSPEECH 2022, 2022, : 1516 - 1520
  • [22] A Heart Sound Classification Method Based on Residual Block and Attention Mechanism
    Chen, Yujie
    Zhu, Wenliang
    Xu, Jinke
    Zhang, Junwei
    Zhu, Zhanpeng
    Wang, Lirong
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1060 - 1065
  • [23] A CNN Sound Classification Mechanism Using Data Augmentation
    Chu, Hung-Chi
    Zhang, Young-Lin
    Chiang, Hao-Chu
    SENSORS, 2023, 23 (15)
  • [24] Investigation of Data Augmentation Techniques in Environmental Sound Recognition
    Sarris, Anastasios Loukas
    Vryzas, Nikolaos
    Vrysis, Lazaros
    Dimoulas, Charalampos
    ELECTRONICS, 2024, 13 (23):
  • [25] Optimization Method of Residual Networks of Residual Networks for Image Classification
    Zhang, Ke
    Guo, Liru
    Gao, Ce
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 321 - 325
  • [26] Optimization Method of Residual Networks of Residual Networks for Image Classification
    Lin, Long
    Yuan, Hao
    Guo, Liru
    Kuang, Yingqun
    Zhang, Ke
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2018, PT III, 2018, 10956 : 212 - 222
  • [27] ENVIRONMENTAL SOUND CLASSIFICATION WITH CONVOLUTIONAL NEURAL NETWORKS
    Piczak, Karol J.
    2015 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2015,
  • [28] Data Augmentation Based on Generative Adversarial Networks for Endoscopic Image Classification
    Park, Hyun-Cheol
    Hong, In-Pyo
    Poudel, Sahadev
    Choi, Chang
    IEEE ACCESS, 2023, 11 : 49216 - 49225
  • [29] Research on Data Augmentation for Image Classification Based on Convolution Neural Networks
    Jia Shijie
    Wang Ping
    Jia Peiyi
    Hu Siping
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 4165 - 4170
  • [30] Environmental Sound Classification Method Based on Compact Bilinear Attention Network
    Dong S.
    Xia Z.
    Cai W.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (06): : 102 - 107