Acoustic Scene Recognition Based on Convolutional Neural Networks

被引:0
|
作者
Sun, Fengjiao [1 ]
Wang, Mingjiang [1 ]
Xu, Qihang [1 ]
Xuan, Xiaogung [1 ]
Zhang, Xin [1 ]
机构
[1] Harbin Inst Technol, Elect & Informat Engn Coll, Shenzhen, Peoples R China
关键词
Audio scene recognition; Log-mel spectrum; Convolutional neural network; Softmax;
D O I
10.1109/siprocess.2019.8868402
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Audio scene recognition is a process of automatically determining the scene around the device by extracting the features of scene audio signals. It is more about the perception and understanding of non-speech signals, and has a profound guiding significance for the machine to make more intelligent choices. To solve this problem, this paper proposes an audio scene recognition method based on convolutional neural network. Firstly, short-time Fourier transform and Mel filter hank are used to transform the audio signal into log-mel spectrum. Then, log-mel fragments are trained by using CNN neural network, and the features are extracted. Finally, softmax was used to identify and classify CNN features. This method is used to test the data set of IEEE DCASE 2018. Experimental results show that this method has a high recognition rate.
引用
收藏
页码:122 / 126
页数:5
相关论文
共 50 条
  • [41] Convolutional recurrent neural networks with hidden Markov model bootstrap for scene text recognition
    Wang, Fenglei
    Guo, Qiang
    Lei, Jun
    Zhang, Jun
    IET COMPUTER VISION, 2017, 11 (06) : 497 - 504
  • [42] Classification for SAR Scene Matching Areas Based on Convolutional Neural Networks
    Zhong, Chengliang
    Mu, Xiaodong
    He, Xiangchen
    Zhan, Bichao
    Niu, Ben
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (09) : 1377 - 1381
  • [43] Remote Sensing Image Scene Classification Based on Convolutional Neural Networks
    Liu, Yumei
    Informatica (Slovenia), 2025, 49 (09): : 45 - 54
  • [44] Road Scene Depth Estimation Based on Deep Convolutional Neural Networks
    Yuan Jianzhong
    Zhou Wujie
    Pan Ting
    Gu Pengli
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (08)
  • [45] A Semantic-based Scene segmentation using convolutional neural networks
    Shaaban, Aya M.
    Salem, Nancy M.
    Al-atabany, Walid, I
    AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2020, 125
  • [46] SCENE SEMANTIC CLASSIFICATION BASED ON SCALE INVARIANCE CONVOLUTIONAL NEURAL NETWORKS
    Liu, Yanfei
    Zhong, Yanfei
    Zhao, Ji
    Ma, Ailong
    Qin, Qianqing
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 4754 - 4757
  • [47] An Innovative Acoustic Rain Gauge Based on Convolutional Neural Networks
    Avanzato, Roberta
    Beritelli, Francesco
    INFORMATION, 2020, 11 (04)
  • [48] An Improved Convolutional Neural Network-Based Scene Image Recognition Method
    Wang, Pinhe
    Qiao, Jianzhong
    Liu, Nannan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [49] Gabor Feature based Convolutional Neural Network for Object Recognition in Natural Scene
    Hu Yao
    Hu Dan
    Li Chuyi
    Yu Weiyu
    2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2016, : 386 - 390
  • [50] An Improved Convolutional Neural Network-Based Scene Image Recognition Method
    Wang, Pinhe
    Qiao, Jianzhong
    Liu, Nannan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022