An Ensemble One Dimensional Convolutional Neural Network with Bayesian Optimization for Environmental Sound Classification

被引:18
|
作者
Ragab, Mohammed Gamal [1 ]
Abdulkadir, Said Jadid [1 ,2 ]
Aziz, Norshakirah [1 ,2 ]
Alhussian, Hitham [1 ,2 ]
Bala, Abubakar [3 ,4 ]
Alqushaibi, Alawi [1 ]
机构
[1] Univ Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia
[2] Univ Teknol PETRONAS, Ctr Res Data Sci CERDAS, Seri Iskandar 32610, Perak, Malaysia
[3] Univ Teknol PETRONAS, Elect & Elect Engn Dept, Seri Iskandar 32610, Perak, Malaysia
[4] Bayero Univ Kano, Elect Engn Dept, Kano 700241, Nigeria
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 10期
关键词
Bayesian optimization; convolutional neural networks; deep learning; ensemble learning; environmental sound classification; optimization; UrbanSound8k; PARTICLE SWARM OPTIMIZATION; FAULT-DIAGNOSIS; HYBRID MODEL; RECOGNITION; CNN; 1D; ARCHITECTURES;
D O I
10.3390/app11104660
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
With the growth of deep learning in various classification problems, many researchers have used deep learning methods in environmental sound classification tasks. This paper introduces an end-to-end method for environmental sound classification based on a one-dimensional convolution neural network with Bayesian optimization and ensemble learning, which directly learns features representation from the audio signal. Several convolutional layers were used to capture the signal and learn various filters relevant to the classification problem. Our proposed method can deal with any audio signal length, as a sliding window divides the signal into overlapped frames. Bayesian optimization accomplished hyperparameter selection and model evaluation with cross-validation. Multiple models with different settings have been developed based on Bayesian optimization to ensure network convergence in both convex and non-convex optimization. An UrbanSound8K dataset was evaluated for the performance of the proposed end-to-end model. The experimental results achieved a classification accuracy of 94.46%, which is 5% higher than existing end-to-end approaches with fewer trainable parameters. Four measurement indices, namely: sensitivity, specificity, accuracy, precision, recall, F-measure, area under ROC curve, and the area under the precision-recall curve were used to measure the model performance. The proposed approach outperformed state-of-the-art end-to-end approaches that use hand-crafted features as input in selected measurement indices and time complexity.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Deep Convolutional Neural Network with Mixup for Environmental Sound Classification
    Zhang, Zhichao
    Xu, Shugong
    Cao, Shan
    Zhang, Shunqing
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 356 - 367
  • [2] An Ensemble Stacked Convolutional Neural Network Model for Environmental Event Sound Recognition
    Li, Shaobo
    Yao, Yong
    Hu, Jie
    Liu, Guokai
    Yao, Xuemei
    Hu, Jianjun
    [J]. APPLIED SCIENCES-BASEL, 2018, 8 (07):
  • [3] Attention based convolutional recurrent neural network for environmental sound classification
    Zhang, Zhichao
    Xu, Shugong
    Zhang, Shunqing
    Qiao, Tianhao
    Cao, Shan
    [J]. NEUROCOMPUTING, 2021, 453 (453) : 896 - 903
  • [4] Deep Convolutional Neural Network with Transfer Learning for Environmental Sound Classification
    Lu, Jianrui
    Ma, Ruofei
    Liu, Gongliang
    Qin, Zhiliang
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL AND ROBOTICS (ICCCR 2021), 2021, : 242 - 245
  • [5] Deep convolutional neural network for environmental sound classification via dilation
    Roy, Sanjiban Sekhar
    Mihalache, Sanda Florentina
    Pricop, Emil
    Rodrigues, Nishant
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (02) : 1827 - 1833
  • [6] ENVIRONMENTAL SOUND CLASSIFICATION WITH CONVOLUTIONAL NEURAL NETWORKS
    Piczak, Karol J.
    [J]. 2015 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2015,
  • [7] Robust technique for environmental sound classification using convolutional recurrent neural network
    Anam Bansal
    Naresh Kumar Garg
    [J]. Multimedia Tools and Applications, 2024, 83 : 54755 - 54772
  • [8] Fast environmental sound classification based on resource adaptive convolutional neural network
    Zheng Fang
    Bo Yin
    Zehua Du
    Xianqing Huang
    [J]. Scientific Reports, 12
  • [9] Fast environmental sound classification based on resource adaptive convolutional neural network
    Fang, Zheng
    Yin, Bo
    Du, Zehua
    Huang, Xianqing
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [10] Deep Convolutional Neural Network Combined with Concatenated Spectrogram for Environmental Sound Classification
    Chi, Zhejian
    Li, Ying
    Chen, Cheng
    [J]. PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 251 - 254