Acoustic Scene Recognition Based on Convolutional Neural Networks

被引:0
|
作者
Sun, Fengjiao [1 ]
Wang, Mingjiang [1 ]
Xu, Qihang [1 ]
Xuan, Xiaogung [1 ]
Zhang, Xin [1 ]
机构
[1] Harbin Inst Technol, Elect & Informat Engn Coll, Shenzhen, Peoples R China
关键词
Audio scene recognition; Log-mel spectrum; Convolutional neural network; Softmax;
D O I
10.1109/siprocess.2019.8868402
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Audio scene recognition is a process of automatically determining the scene around the device by extracting the features of scene audio signals. It is more about the perception and understanding of non-speech signals, and has a profound guiding significance for the machine to make more intelligent choices. To solve this problem, this paper proposes an audio scene recognition method based on convolutional neural network. Firstly, short-time Fourier transform and Mel filter hank are used to transform the audio signal into log-mel spectrum. Then, log-mel fragments are trained by using CNN neural network, and the features are extracted. Finally, softmax was used to identify and classify CNN features. This method is used to test the data set of IEEE DCASE 2018. Experimental results show that this method has a high recognition rate.
引用
收藏
页码:122 / 126
页数:5
相关论文
共 50 条
  • [1] Multispectral scene recognition based on dual convolutional neural networks
    Sevo, Igor
    Avramovic, Aleksej
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2017, : 126 - 130
  • [2] Shallow Convolutional Neural Networks for Acoustic Scene Classification
    LU Lu
    YANG Yuhong
    JIANG Yuzhi
    AI Haojun
    TU Weiping
    [J]. Wuhan University Journal of Natural Sciences, 2018, 23 (02) : 178 - 184
  • [3] SCENE TEXT RECOGNITION WITH DEEPER CONVOLUTIONAL NEURAL NETWORKS
    Zhang, Yuqi
    Wang, Wei
    Wang, Liang
    Wang, Liuan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2384 - 2388
  • [4] Natural Scene Recognition Based on Convolutional Neural Networks and Deep Boltzmannn Machines
    Gao, Jingyu
    Yang, Jinfu
    Zhang, Jizhao
    Li, Mingai
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, 2015, : 2369 - 2374
  • [5] An Attention-Based Convolutional Recurrent Neural Networks for Scene Text Recognition
    Alshawi, Adil Abdullah Abdulhussein
    Tanha, Jafar
    Balafar, Mohammad Ali
    [J]. IEEE ACCESS, 2024, 12 : 8123 - 8134
  • [6] Indoor Scene Recognition Mechanism Based on Direction-Driven Convolutional Neural Networks
    Daou, Andrea
    Pothin, Jean-Baptiste
    Honeine, Paul
    Bensrhair, Abdelaziz
    [J]. SENSORS, 2023, 23 (12)
  • [7] Hybrid convolutional neural networks for articulatory and acoustic information based speech recognition
    Mitra, Vikramjit
    Sivaraman, Ganesh
    Nam, Hosung
    Espy-Wilson, Carol
    Saltzman, Elliot
    Tiede, Mark
    [J]. SPEECH COMMUNICATION, 2017, 89 : 103 - 112
  • [8] SUBSPECTRALNET - USING SUB-SPECTROGRAM BASED CONVOLUTIONAL NEURAL NETWORKS FOR ACOUSTIC SCENE CLASSIFICATION
    Phaye, Sai Samarth R.
    Benetos, Emmanouil
    Wang, Ye
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 825 - 829
  • [9] Underwater Acoustic Target Recognition Based on Dual Attention Networks and Multiresolution Convolutional Neural Networks
    Liu, Chengwei
    Hong, Feng
    Feng, Haihong
    Hu, Menglu
    [J]. OCEANS 2021: SAN DIEGO - PORTO, 2021,
  • [10] Urdu Natural Scene Character Recognition using Convolutional Neural Networks
    Ali, Asghar
    Pickering, Mark
    Shafi, Kamran
    [J]. 2018 IEEE 2ND INTERNATIONAL WORKSHOP ON ARABIC AND DERIVED SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2018, : 29 - 34