Noise robust sound event classification with convolutional neural network

被引:61
|
作者
Ozer, Ilyas [1 ]
Ozer, Zeynep [1 ]
Findik, Oguz [1 ]
机构
[1] Karabuk Univ, Comp Engn Dept, Karabuk, Turkey
关键词
Sound event classification; Convolutional neural networks; Spectrogram; RECOGNITION; RETRIEVAL; DEEP;
D O I
10.1016/j.neucom.2017.07.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic sound recognition (ASR) is a remarkable field of research in recent years. The ability to automatically recognize sound events through computers in a complex audio environment is very useful for machine hearing, acoustic surveillance and multimedia retrieval applications. On the other hand, ASR task become highly difficult as the ambient noise levels increase and many traditional methods show very weak performance under noise. Recent studies has shown that spectrogram image features (SIF) have high performance under noise, while success rates in clean conditions are relatively lower than in the state-of-the-art approaches. In this study, after converting highly overlapped spectrograms into linear quantized images and reducing dimensions by applying various image resizing methods, feature extraction and classification are performed with convolutional neural networks (CNN), which have very high performance in image classification. In the mismatched case, the proposed method achieves a performance improvement of 4.5%, which is equivalent to a relative error reduction of 63.4%, with a classification success of 97.4%, while the multicondition training method achieves an average of 98.63% success rate. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:505 / 512
页数:8
相关论文
共 50 条
  • [1] Convolutional neural network based traffic sound classification robust to environmental noise
    Lee, Jaejun
    Kim, Wansoo
    Lee, Kyogu
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2018, 37 (06): : 469 - 474
  • [2] A noise robust convolutional neural network for image classification
    Momeny, Mohammad
    Latif, Ali Mohammad
    Sarram, Mehdi Agha
    Sheikhpour, Razieh
    Zhang, Yu Dong
    RESULTS IN ENGINEERING, 2021, 10
  • [3] Room Acoustic Adversarial Neural Network for Robust Sound Event Classification
    Upadhyaya, Sreenivasa
    Buyens, Wim
    Desmet, Wim
    Karsmakers, Peter
    AES: Journal of the Audio Engineering Society, 2024, 72 (11): : 754 - 766
  • [4] Environment Sound Event Classification With a Two-Stream Convolutional Neural Network
    Dong, Xifeng
    Yin, Bo
    Cong, Yanping
    Du, Zehua
    Huang, Xianqing
    IEEE ACCESS, 2020, 8 : 125714 - 125721
  • [5] Robust technique for environmental sound classification using convolutional recurrent neural network
    Anam Bansal
    Naresh Kumar Garg
    Multimedia Tools and Applications, 2024, 83 : 54755 - 54772
  • [6] Robust technique for environmental sound classification using convolutional recurrent neural network
    Bansal, Anam
    Garg, Naresh Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 54755 - 54772
  • [7] Improved convolutional neural network and spectrogram image feature for traffic sound event classification
    Xu, Ke
    Yao, Jingyi
    Yao, Lingyun
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2024, 238 (13) : 4230 - 4244
  • [8] Robust Sound Event Classification with Local Time-Frequency Information and Convolutional Neural Networks
    Yao, Yanli
    Yu, Qiang
    Wang, Longbiao
    Dang, Jianwu
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: TEXT AND TIME SERIES, PT IV, 2019, 11730 : 351 - 361
  • [9] ROBUST SOUND EVENT RECOGNITION USING CONVOLUTIONAL NEURAL NETWORKS
    Zhang, Haomin
    McLoughlin, Ian
    Song, Yan
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 559 - 563
  • [10] Animal Sound Classification Using A Convolutional Neural Network
    Sasmaz, Emre
    Tek, F. Boray
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2018, : 625 - 629