A Robust Framework For Acoustic Scene Classification

被引:19
|
作者
Lam Pham [1 ]
McLoughlin, Ian [1 ]
Huy Phan [1 ]
Palaniappan, Ramaswamy [1 ]
机构
[1] Univ Kent, Sch Comp, Medway, Kent, England
来源
关键词
Machine hearing; acoustic scene classification; convolutional neural network; deep neural network; spectrogram; log-Mel; Gammatone filter; constant Q transform;
D O I
10.21437/Interspeech.2019-1841
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Acoustic scene classification (ASC) using front-end time-frequency features and back-end neural network classifiers has demonstrated good performance in recent years. However a profusion of systems has arisen to suit different tasks and datasets, utilising different feature and classifier types. This paper aims at a robust framework that can explore and utilise a range of different time-frequency features and neural networks, either singly or merged, to achieve good classification performance. In particular, we exploit three different types of front-end time-frequency feature; log energy Mel filter, Gammatone filter and constant Q transform. At the back-end we evaluate effective a two-stage model that exploits a Convolutional Neural Network for pre-trained feature extraction, followed by Deep Neural Network classifiers as a post-trained feature adaptation model and classifier. We also explore the use of a data augmentation technique for these features that effectively generates a variety of intermediate data, reinforcing model learning abilities, particularly for marginal cases. We assess performance on the DCASE2016 dataset, demonstrating good classification accuracies exceeding 90%, significantly outperforming the DCASE2016 baseline and highly competitive compared to state-of-the-art systems.
引用
收藏
页码:3634 / 3638
页数:5
相关论文
共 50 条
  • [1] Robust acoustic scene classification using a multi-spectrogram encoder-decoder framework
    Pham, Lam
    Phan, Huy
    Nguyen, Truc
    Palaniappan, Ramaswamy
    Mertins, Alfred
    McLoughlin, Ian
    DIGITAL SIGNAL PROCESSING, 2021, 110
  • [2] Feature Alignment for Robust Acoustic Scene Classification Across Devices
    Zhao, Jingqiao
    Kong, Qiuqiang
    Song, Xiaoning
    Feng, Zhenhua
    Wu, Xiaojun
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 578 - 582
  • [3] Robust Acoustic Scene Classification in the Presence of Active Foreground Speech
    Song, Siyuan
    Desplanques, Brecht
    De Moor, Celest
    Demuynck, Kris
    Madhu, Nilesh
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 995 - 999
  • [4] Novel Augmentation Schemes for Device Robust Acoustic Scene Classification
    Sonowal, Sukanya
    Tamse, Anish
    INTERSPEECH 2022, 2022, : 4182 - 4186
  • [5] Acoustic Scene Classification
    Barchiesi, Daniele
    Giannoulis, Dimitrios
    Stowell, Dan
    Plumbley, Mark D.
    IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (03) : 16 - 34
  • [6] A TWO-STAGE APPROACH TO DEVICE-ROBUST ACOUSTIC SCENE CLASSIFICATION
    Hu, Hu
    Yang, Chao-Han Huck
    Xia, Xianjun
    Bai, Xue
    Tang, Xin
    Wang, Yajian
    Niu, Shutong
    Chai, Li
    Li, Juanjuan
    Zhu, Hongning
    Bao, Feng
    Zhao, Yuanjun
    Siniscalchi, Sabato Marco
    Wang, Yannan
    Du, Jun
    Lee, Chin-Hui
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 845 - 849
  • [7] DOMAIN MISMATCH ROBUST ACOUSTIC SCENE CLASSIFICATION USING CHANNEL INFORMATION CONVERSION
    Mun, Seongkyu
    Shon, Suwon
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 845 - 849
  • [8] A Layer-wise Score Level Ensemble Framework for Acoustic Scene Classification
    Singh, Arshdeep
    Thakur, Anshul
    Rajan, Padmanabhan
    Bhavsar, Arnav
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 837 - 841
  • [9] ACOUSTIC SCENE CLASSIFICATION: A COMPETITION REVIEW
    Gharib, Shayan
    Derrar, Honain
    Niizumi, Daisuke
    Senttula, Tuukka
    Tommola, Janne
    Heittola, Toni
    Virtanen, Tuomas
    Huttunen, Heikki
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [10] Acoustic Event and Scene Classification: A Review
    Manjunath Mulimani
    Spoorthy Venkatesh
    Shashidhar G. Koolagudi
    SN Computer Science, 6 (1)