Noise robust sound event classification with convolutional neural network

被引:61
|
作者
Ozer, Ilyas [1 ]
Ozer, Zeynep [1 ]
Findik, Oguz [1 ]
机构
[1] Karabuk Univ, Comp Engn Dept, Karabuk, Turkey
关键词
Sound event classification; Convolutional neural networks; Spectrogram; RECOGNITION; RETRIEVAL; DEEP;
D O I
10.1016/j.neucom.2017.07.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic sound recognition (ASR) is a remarkable field of research in recent years. The ability to automatically recognize sound events through computers in a complex audio environment is very useful for machine hearing, acoustic surveillance and multimedia retrieval applications. On the other hand, ASR task become highly difficult as the ambient noise levels increase and many traditional methods show very weak performance under noise. Recent studies has shown that spectrogram image features (SIF) have high performance under noise, while success rates in clean conditions are relatively lower than in the state-of-the-art approaches. In this study, after converting highly overlapped spectrograms into linear quantized images and reducing dimensions by applying various image resizing methods, feature extraction and classification are performed with convolutional neural networks (CNN), which have very high performance in image classification. In the mismatched case, the proposed method achieves a performance improvement of 4.5%, which is equivalent to a relative error reduction of 63.4%, with a classification success of 97.4%, while the multicondition training method achieves an average of 98.63% success rate. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:505 / 512
页数:8
相关论文
共 50 条
  • [21] Optimized Convolutional Neural Network for Robust Crop/Weed Classification
    Panda, Bikramaditya
    Mishra, Manoj Kumar
    Mishra, Bhabani Shankar Prasad
    Tiwari, Abhinandan Kumar
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (04)
  • [22] Sound Classification Using Convolutional Neural Network and Tensor Deep Stacking Network
    Khamparia, Aditya
    Gupta, Deepak
    Nhu Gia Nguyen
    Khanna, Ashish
    Pandey, Babita
    Tiwari, Prayag
    IEEE ACCESS, 2019, 7 : 7717 - 7727
  • [23] Noise Masking Recurrent Neural Network for Respiratory Sound Classification
    Kochetov, Kirill
    Putin, Evgeny
    Balashov, Maksim
    Filchenkov, Andrey
    Shalyto, Anatoly
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 208 - 217
  • [24] Noise-Robust Sound-Event Classification System with Texture Analysis
    Choi, Yongju
    Atif, Othmane
    Lee, Jonguk
    Park, Daihee
    Chung, Yongwha
    SYMMETRY-BASEL, 2018, 10 (09):
  • [25] Attention based convolutional recurrent neural network for environmental sound classification
    Zhang, Zhichao
    Xu, Shugong
    Zhang, Shunqing
    Qiao, Tianhao
    Cao, Shan
    NEUROCOMPUTING, 2021, 453 (453) : 896 - 903
  • [26] Deep Convolutional Neural Network with Transfer Learning for Environmental Sound Classification
    Lu, Jianrui
    Ma, Ruofei
    Liu, Gongliang
    Qin, Zhiliang
    2021 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS (ICCCR 2021), 2021, : 242 - 245
  • [27] Parralel Recurrent Convolutional Neural Network for Abnormal Heart Sound Classification
    Gharehbaghi, Arash
    Partovi, Elaheh
    Babic, Ankica
    CARING IS SHARING-EXPLOITING THE VALUE IN DATA FOR HEALTH AND INNOVATION-PROCEEDINGS OF MIE 2023, 2023, 302 : 526 - 530
  • [28] Urban sound event classification with the N-order dense convolutional network
    Cao Y.
    Huang Z.
    Zhang W.
    Liu C.
    Li W.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (06): : 9 - 16and94
  • [29] Deep convolutional neural network for environmental sound classification via dilation
    Roy, Sanjiban Sekhar
    Mihalache, Sanda Florentina
    Pricop, Emil
    Rodrigues, Nishant
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (02) : 1827 - 1833
  • [30] Event Detection and Classification Using Deep Compressed Convolutional Neural Network
    Swapnika, K.
    Vasumathi, D.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (12) : 312 - 322