Environmental Sound Classification using Deep Convolutional Neural Networks and Data Augmentation

被引:0
|
作者
Davis, Nithya [1 ]
Suresh, K. [1 ]
机构
[1] Coll Engn, Dept Elect & Commun Engn, Trivandrum, Kerala, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work is about environmental sound classification by deep convolutional neural networks and data augmentation. Data augmentation is applied to increase the labeled training dataset. Data augmentation process improves the performance of audio classification. In this paper, first we present a strategy for generating a deep convolutional neural network (CNN) framework for environmental sound analysis with Urbansound8K audio dataset. Secondly we analyze the performance of data augmentation methods on Urbansound8K audio dataset and compare the performance of CNN with different data augmentation methodologies. Data augmentation is basically a deformation technique. By this approach we can increase the number of dataset elements into its multiples. Here, compare the performance of different augmentation method to identify which one is the best augmentation technique for environmental sound analysis. Different types of data augmentations were applied to the dataset in the previous works. We introduce a new data augmentation method using LPCC feature.
引用
收藏
页码:41 / 45
页数:5
相关论文
共 50 条
  • [1] Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification
    Salamon, Justin
    Bello, Juan Pablo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (03) : 279 - 283
  • [2] Environmental sound classification using a regularized deep convolutional neural network with data augmentation
    Mushtaq, Zohaib
    Su, Shun-Feng
    [J]. APPLIED ACOUSTICS, 2020, 167
  • [3] Underwater Image Classification Using Deep Convolutional Neural Networks and Data Augmentation
    Xu, Yifeng
    Zhang, Yang
    Wang, Huigang
    Liu, Xing
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2017,
  • [4] Skin melanoma classification using ROI and data augmentation with deep convolutional neural networks
    Hosny, Khalid M.
    Kassem, Mohamed A.
    Foaud, Mohamed M.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (33-34) : 24029 - 24055
  • [5] Skin melanoma classification using ROI and data augmentation with deep convolutional neural networks
    Khalid M. Hosny
    Mohamed A. Kassem
    Mohamed M. Foaud
    [J]. Multimedia Tools and Applications, 2020, 79 : 24029 - 24055
  • [6] Data Augmentation and the Improvement of the Performance of Convolutional Neural Networks for Heart Sound Classification
    Takezaki, Shumpei
    Kishida, Kazuya
    [J]. IAENG International Journal of Computer Science, 2022, 49 (04)
  • [7] ENVIRONMENTAL SOUND CLASSIFICATION WITH CONVOLUTIONAL NEURAL NETWORKS
    Piczak, Karol J.
    [J]. 2015 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2015,
  • [8] CNN-RNN and Data Augmentation Using Deep Convolutional Generative Adversarial Network for Environmental Sound Classification
    Bahmei, Behnaz
    Birmingham, Elina
    Arzanpour, Siamak
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 682 - 686
  • [9] Sound Classification Using Convolutional Neural Networks
    Jaiswal, Kaustumbh
    Patel, Dhairya Kalpeshbhai
    [J]. 2018 SEVENTH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING IN EMERGING MARKETS (CCEM), 2018, : 81 - 84
  • [10] Hyperspectral Data Classification using Deep Convolutional Neural Networks
    Salman, Mesut
    Yuksel, Seniha Esen
    [J]. 2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 2129 - 2132