Acoustic Scene Recognition Based on Convolutional Neural Networks

被引:0
|
作者
Sun, Fengjiao [1 ]
Wang, Mingjiang [1 ]
Xu, Qihang [1 ]
Xuan, Xiaogung [1 ]
Zhang, Xin [1 ]
机构
[1] Harbin Inst Technol, Elect & Informat Engn Coll, Shenzhen, Peoples R China
关键词
Audio scene recognition; Log-mel spectrum; Convolutional neural network; Softmax;
D O I
10.1109/siprocess.2019.8868402
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Audio scene recognition is a process of automatically determining the scene around the device by extracting the features of scene audio signals. It is more about the perception and understanding of non-speech signals, and has a profound guiding significance for the machine to make more intelligent choices. To solve this problem, this paper proposes an audio scene recognition method based on convolutional neural network. Firstly, short-time Fourier transform and Mel filter hank are used to transform the audio signal into log-mel spectrum. Then, log-mel fragments are trained by using CNN neural network, and the features are extracted. Finally, softmax was used to identify and classify CNN features. This method is used to test the data set of IEEE DCASE 2018. Experimental results show that this method has a high recognition rate.
引用
收藏
页码:122 / 126
页数:5
相关论文
共 50 条
  • [21] Deep Convolutional Neural Networks and Data Augmentation for Acoustic Event Recognition
    Takahashi, Naoya
    Gygli, Michael
    Pfister, Beat
    Van Goole, Luc
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2982 - 2986
  • [22] Acoustic Pornography Recognition Using Convolutional Neural Networks and Bag of Refinements
    Zhou, Lifeng
    Wei, Kaifeng
    Li, Yuke
    Hao, Yiya
    Yang, Weiqiang
    Zhu, Haoqi
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 840 - 845
  • [23] Acoustic event recognition using cochleagram image and convolutional neural networks
    Sharan, Roneel V.
    Moir, Tom J.
    APPLIED ACOUSTICS, 2019, 148 : 62 - 66
  • [24] Image Object and Scene Recognition Based on Improved Convolutional Neural Network
    Li, Guoyan
    Wang, Fei
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2024, 21 (05) : 925 - 937
  • [25] Convolutional Attention Networks for Scene Text Recognition
    Xie, Hongtao
    Fang, Shancheng
    Zha, Zheng-Jun
    Yang, Yating
    Li, Yan
    Zhang, Yongdong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (01)
  • [26] Recognition of a Plant Leaf Based on Convolutional Neural Networks
    Guo, Yingjiu
    Wang, Dayu
    Zhu, Hongwei
    Li, Ailan
    TENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2018), 2018, 10806
  • [27] Face Recognition Based on Lightweight Convolutional Neural Networks
    Liu, Wenting
    Zhou, Li
    Chen, Jie
    INFORMATION, 2021, 12 (05)
  • [28] EEG Based Emotion Recognition with Convolutional Neural Networks
    Ozcan, Caner
    Cizmeci, Hnseyin
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [29] Convolutional Neural Networks in Hand Based Recognition System
    Prihodova, Katerina
    VISION 2025: EDUCATION EXCELLENCE AND MANAGEMENT OF INNOVATIONS THROUGH SUSTAINABLE ECONOMIC COMPETITIVE ADVANTAGE, 2019, : 4744 - 4750
  • [30] Gesture recognition system based on Convolutional neural networks
    Chistyakov, I. S.
    Chepin, E. V.
    2ND INTERNATIONAL TELECOMMUNICATION CONFERENCE ADVANCED MICRO- AND NANOELECTRONIC SYSTEMS AND TECHNOLOGIES, 2019, 498