Acoustic Scene Recognition Based on Convolutional Neural Networks

被引:0
|
作者
Sun, Fengjiao [1 ]
Wang, Mingjiang [1 ]
Xu, Qihang [1 ]
Xuan, Xiaogung [1 ]
Zhang, Xin [1 ]
机构
[1] Harbin Inst Technol, Elect & Informat Engn Coll, Shenzhen, Peoples R China
关键词
Audio scene recognition; Log-mel spectrum; Convolutional neural network; Softmax;
D O I
10.1109/siprocess.2019.8868402
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Audio scene recognition is a process of automatically determining the scene around the device by extracting the features of scene audio signals. It is more about the perception and understanding of non-speech signals, and has a profound guiding significance for the machine to make more intelligent choices. To solve this problem, this paper proposes an audio scene recognition method based on convolutional neural network. Firstly, short-time Fourier transform and Mel filter hank are used to transform the audio signal into log-mel spectrum. Then, log-mel fragments are trained by using CNN neural network, and the features are extracted. Finally, softmax was used to identify and classify CNN features. This method is used to test the data set of IEEE DCASE 2018. Experimental results show that this method has a high recognition rate.
引用
收藏
页码:122 / 126
页数:5
相关论文
共 50 条
  • [31] Image Recognition with MapReduce Based Convolutional Neural Networks
    Leung, Jackie
    Chen, Min
    2019 IEEE 10TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2019, : 119 - 125
  • [32] Convolutional neural networks recognition algorithm based on PCA
    Shi H.
    Xu Y.
    Ma S.
    Li Y.
    Li S.
    Xi'an Dianzi Keji Daxue Xuebao, 3 (161-166): : 161 - 166
  • [33] Human Pulse Recognition based on Convolutional Neural Networks
    Zhang, Shi-Ru
    Sun, Qing-Fu
    2016 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C), 2016, : 366 - 369
  • [34] Hand gesture recognition based on convolutional neural networks
    Hu, Yu-lu
    Wang, Lian-ming
    LIDAR IMAGING DETECTION AND TARGET RECOGNITION 2017, 2017, 10605
  • [35] Acoustic spatial patterns recognition based on convolutional neural network and acoustic visualization
    Wu, Haijun
    Wei, Xinyue
    Zha, Yang
    Jiang, Weikang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 147 (01): : 459 - 468
  • [36] Recurrent Convolutional Neural Networks for Scene Labeling
    Pinheiro, Pedro O.
    Collobert, Ronan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 1), 2014, 32
  • [37] A Convolutional Neural Network Approach for Acoustic Scene Classification
    Valenti, Michele
    Squartini, Stefano
    Diment, Aleksandr
    Parascandolo, Giambattista
    Virtanen, Tuomas
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1547 - 1554
  • [38] Aerial Scene Classification with Convolutional Neural Networks
    Jia, Sibo
    Liu, Huaping
    Sun, Fuchun
    ADVANCES IN NEURAL NETWORKS - ISNN 2015, 2015, 9377 : 258 - 265
  • [39] Scene Disparity Estimation with Convolutional Neural Networks
    Anas, Essa R.
    Guo, Li
    Onsy, Ahmed
    Matuszewski, Bogdan J.
    MULTIMODAL SENSING: TECHNOLOGIES AND APPLICATIONS, 2019, 11059
  • [40] Text Detection and Recognition for Natural Scene Images Using Deep Convolutional Neural Networks
    Wu, Xianyu
    Luo, Chao
    Zhang, Qian
    Zhou, Jiliu
    Yang, Hao
    Li, Yulian
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 61 (01): : 289 - 300