A Method of Environmental Sound Classification Based on Residual Networks and Data Augmentation

被引:0
|
作者
Zeng, Jinfang [1 ]
Li, Youming [1 ]
Zhang, Yu [1 ]
Chen, Da [1 ]
机构
[1] Xiang Tan Univ, Sch Phys & Optoelect, Xiangtan 411105, Hunan, Peoples R China
关键词
Environmental sound classification; residual networks; data augmentation;
D O I
10.1142/S1469026821500188
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Environmental sound classication (ESC) is a challenging problem due to the complexity of sounds. To date, a variety of signal processing and machine learning techniques have been applied to ESC task, including matrix factorization, dictionary learning, waveletlterbanks and deep neural networks. It is observed that features extracted from deeper networks tend to achieve higher performance than those extracted from shallow networks. However, in ESC task, only the deep convolutional neural networks (CNNs) which contain several layers are used and the residual networks are ignored, which lead to degradation in the performance. Meanwhile, a possible explanation for the limited exploration of CNNs and the diffculty to improve on simpler models is the relative scarcity of labeled data for ESC. In this paper, a residual network called EnvResNet for the ESC task is proposed. In addition, we propose to use audio data augmentation to overcome the problem of data scarcity. The experiments will be performed on the ESC-50 database. Combined with data augmentation, the proposed model outperforms baseline implementations relying on mel-frequency cepstral coeffcients and achieves results comparable to other state-of-the-art approaches in terms of classifcation accuracy.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] INTERMIX: AN INTERFERENCE-BASED DATA AUGMENTATION AND REGULARIZATION TECHNIQUE FOR AUTOMATIC DEEP SOUND CLASSIFICATION
    Sawhney, Ramit
    Neerkaje, Atula Tejaswi
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3443 - 3447
  • [42] FILTERAUGMENT: AN ACOUSTIC ENVIRONMENTAL DATA AUGMENTATION METHOD
    Nam, Hyeonuk
    Kim, Seong-Hu
    Park, Yong-Hwa
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4308 - 4312
  • [43] Data Augmentation for Intrusion Detection and Classification in Cloud Networks
    Chkirbene, Zina
    Ben Abdallah, Habib
    Hassine, Kawther
    Hamila, Ridha
    Erbad, Aiman
    IWCMC 2021: 2021 17TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2021, : 831 - 836
  • [44] Environmental sound classification method based on WVD and the improved ResNet50
    Sun, Wei
    Ma, Junjie
    Wang, Yu
    Shi, Weihao
    Xing, Lu
    Zhou, Zhiwei
    Ye, Hong
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND DIGITAL APPLICATIONS, MIDA2024, 2024, : 344 - 349
  • [45] Data Augmentation and Deep Learning Methods in Sound Classification: A Systematic Review
    Abayomi-Alli, Olusola O.
    Damasevicius, Robertas
    Qazi, Atika
    Adedoyin-Olowe, Mariam
    Misra, Sanjay
    ELECTRONICS, 2022, 11 (22)
  • [46] Dynamic Data Augmentation Method for Hyperspectral Image Classification Based on Siamese Structure
    Gao, Hongmin
    Zhang, Junpeng
    Cao, Xueying
    Chen, Zhonghao
    Zhang, Yiyan
    Li, Chenming
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 8063 - 8076
  • [47] TimeGAN data augmentation-based fault classification method for complex processes
    Yang, Lei
    He, Pengju
    Chou, Xingxing
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (09): : 1768 - 1780
  • [48] Iterative Translation-Based Data Augmentation Method for Text Classification Tasks
    Lee, Sangwon
    Liu, Ling
    Choi, Wonik
    IEEE ACCESS, 2021, 9 : 160437 - 160445
  • [49] DropMask: A data augmentation method for convolutional networks
    Gong, Diancheng
    Wang, Zhiling
    Wang, Hanqi
    Liang, Huawei
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 1718 - 1722
  • [50] Environmental Sound Classification Based on Knowledge Distillation
    Cui, Qianjin
    Zhao, Kun
    Wang, Li
    Gao, Kai
    Cao, Fang
    Wang, Xiaoman
    2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1, 2022, : 245 - 249