Deep Neural Decision Forest for Acoustic Scene Classification

被引:0
|
作者
Sun, Jianyuan [1 ,3 ]
Liu, Xubo [1 ]
Mei, Xinhao [1 ]
Zhao, Jinzheng [1 ]
Plumbley, Mark D. [1 ]
Kilic, Volkan [2 ]
Wang, Wenwu [1 ]
机构
[1] Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Surrey, England
[2] Izmir Katip Celebi Univ, Dept Elect & Elect Engn, Izmir, Turkey
[3] Qingdao Univ, Coll Comp Sci & Technol, Qingdao, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
acoustic scene classification; random forest; convolution neural networks; deep learning;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoustic scene classification (ASC) aims to classify an audio clip based on the characteristic of the recording environment. In this regard, deep learning based approaches have emerged as a useful tool for ASC problems. Conventional approaches to improving the classification accuracy include integrating auxiliary methods such as attention mechanism, pre-trained models and ensemble multiple sub-networks. However, due to the complexity of audio clips captured from different environments, it is difficult to distinguish their categories without using any auxiliary methods for existing deep learning models using only a single classifier. In this paper, we propose a novel approach for ASC using deep neural decision forest (DNDF). DNDF combines a fixed number of convolutional layers and a decision forest as the final classifier. The decision forest consists of a fixed number of decision tree classifiers, which have been shown to offer better classification performance than a single classifier in some datasets. In particular, the decision forest differs substantially from traditional random forests as it is stochastic, differentiable, and capable of using the back-propagation to update and learn feature representations in neural network. Experimental results on the DCASE2019 and ESC-50 datasets demonstrate that our proposed DNDF method improves the ASC performance in terms of classification accuracy and shows competitive performance as compared with state-of-the-art baselines.
引用
收藏
页码:772 / 776
页数:5
相关论文
共 50 条
  • [1] Analysis of Deep Neural Network Models for Acoustic Scene Classification
    Basbug, Ahmet Melih
    Sert, Mustafa
    [J]. 2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [2] Deep Scattering Spectra with Deep Neural Networks for Acoustic Scene Classification Tasks
    ZHANG Pengyuan
    CHEN Hangting
    BAI Haichuan
    YUAN Qingsheng
    [J]. Chinese Journal of Electronics, 2019, 28 (06) : 1177 - 1183
  • [3] Deep Scattering Spectra with Deep Neural Networks for Acoustic Scene Classification Tasks
    Zhang, Pengyuan
    Chen, Hangting
    Bai, Haichuan
    Yuan, Qingsheng
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2019, 28 (06) : 1177 - 1183
  • [4] The Receptive Field as a Regularizer in Deep Convolutional Neural Networks for Acoustic Scene Classification
    Koutini, Khaled
    Eghbal-zadeh, Hamid
    Dorfer, Matthias
    Widmer, Gerhard
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [5] Lightweight deep neural networks for acoustic scene classification and an effective visualization for presenting sound scene contexts
    Pham, Lam
    Ngo, Dat
    Salovic, Dusan
    Jalali, Anahid
    Schindler, Alexander
    Nguyen, Phu X.
    Tran, Khoa
    Vu, Hai Canh
    [J]. APPLIED ACOUSTICS, 2023, 211
  • [6] Deep semantic learning for acoustic scene classification
    Yun-Fei Shao
    Xin-Xin Ma
    Yong Ma
    Wei-Qiang Zhang
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2024
  • [7] Deep Scalogram Representations for Acoustic Scene Classification
    Zhao Ren
    Kun Qian
    Zixing Zhang
    Vedhas Pandit
    Alice Baird
    Bjrn Schuller
    [J]. IEEE/CAA Journal of Automatica Sinica, 2018, 5 (03) : 662 - 669
  • [8] Deep Segment Model for Acoustic Scene Classification
    Wang, Yajian
    Du, Jun
    Chen, Hang
    Wang, Qing
    Lee, Chin-Hui
    [J]. INTERSPEECH 2022, 2022, : 4177 - 4181
  • [9] Deep semantic learning for acoustic scene classification
    Shao, Yun-Fei
    Ma, Xin-Xin
    Ma, Yong
    Zhang, Wei-Qiang
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01)
  • [10] Deep Scalogram Representations for Acoustic Scene Classification
    Ren, Zhao
    Qian, Kun
    Zhang, Zixing
    Pandit, Vedhas
    Baird, Alice
    Schuller, Bjoern
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2018, 5 (03) : 662 - 669