Noise robust sound event classification with convolutional neural network

被引:61
|
作者
Ozer, Ilyas [1 ]
Ozer, Zeynep [1 ]
Findik, Oguz [1 ]
机构
[1] Karabuk Univ, Comp Engn Dept, Karabuk, Turkey
关键词
Sound event classification; Convolutional neural networks; Spectrogram; RECOGNITION; RETRIEVAL; DEEP;
D O I
10.1016/j.neucom.2017.07.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic sound recognition (ASR) is a remarkable field of research in recent years. The ability to automatically recognize sound events through computers in a complex audio environment is very useful for machine hearing, acoustic surveillance and multimedia retrieval applications. On the other hand, ASR task become highly difficult as the ambient noise levels increase and many traditional methods show very weak performance under noise. Recent studies has shown that spectrogram image features (SIF) have high performance under noise, while success rates in clean conditions are relatively lower than in the state-of-the-art approaches. In this study, after converting highly overlapped spectrograms into linear quantized images and reducing dimensions by applying various image resizing methods, feature extraction and classification are performed with convolutional neural networks (CNN), which have very high performance in image classification. In the mismatched case, the proposed method achieves a performance improvement of 4.5%, which is equivalent to a relative error reduction of 63.4%, with a classification success of 97.4%, while the multicondition training method achieves an average of 98.63% success rate. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:505 / 512
页数:8
相关论文
共 50 条
  • [41] Device Robust Acoustic Scene Classification Using Adaptive Noise Reduction and Convolutional Recurrent Attention Neural Network
    Venkatesh, Spoorthy
    Koolagudi, Shashidhar G.
    SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 688 - 699
  • [42] COMBINING ROBUST SPIKE CODING WITH SPIKING NEURAL NETWORKS FOR SOUND EVENT CLASSIFICATION
    Dennis, Jonathan
    Tran Huy Dat
    Li, Haizhou
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 176 - 180
  • [43] Illumination robust deep convolutional neural network for medical image classification
    Dash, Sonali
    Parida, Priyadarsan
    Mohanty, Jnyana Ranjan
    SOFT COMPUTING, 2023,
  • [44] Deep Convolutional Neural Network for SEM Image Noise Variance Classification
    Swee, Sim Kok
    Chen, Lim Choon
    Chiang, Tan Shing
    Khim, Toa Chean
    ENGINEERING LETTERS, 2023, 31 (01) : 19 - 19
  • [45] Noise invariant partial discharge classification based on convolutional neural network
    Raymond, Wong Jee Keen
    Xin, Chong Wan
    Kin, Lai Weng
    Illias, Hazlee Azil
    MEASUREMENT, 2021, 177
  • [46] Inter-floor noise classification using convolutional neural network
    Shin, Hye-kyung
    Park, Sang Hee
    Kim, Kyoung-woo
    PLOS ONE, 2020, 15 (12):
  • [47] Bearing Fault Classification Based on Convolutional Neural Network in Noise Environment
    Jiang, Qinyu
    Chang, Faliang
    Sheng, Bowen
    IEEE ACCESS, 2019, 7 : 69795 - 69807
  • [48] Normal/Abnormal Heart Sound Recordings Classification Using Convolutional Neural Network
    Nilanon, Tanachat
    Yao, Jiayu
    Hao, Junheng
    Purushotham, Sanjay
    Liu, Yan
    2016 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), VOL 43, 2016, 43 : 585 - 588
  • [49] Fast environmental sound classification based on resource adaptive convolutional neural network
    Zheng Fang
    Bo Yin
    Zehua Du
    Xianqing Huang
    Scientific Reports, 12
  • [50] Hand Gesture Classification Based on Nonaudible Sound Using Convolutional Neural Network
    Kim, Jinhyuck
    Choi, Sunwoong
    JOURNAL OF SENSORS, 2019, 2019