Deep Scattering Spectra with Deep Neural Networks for Acoustic Scene Classification Tasks

被引:0
|
作者
ZHANG Pengyuan [1 ,2 ]
CHEN Hangting [1 ,2 ]
BAI Haichuan [1 ,2 ]
YUAN Qingsheng [3 ]
机构
[1] Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics, Chinese Academy of Sciences
[2] University of Chinese Academy of Sciences
[3] National Computer Network Emergency Response Technical Team/Coordination Center of China
关键词
Acoustic scene classification; Time-delay neural network; Deep scattering spectrum; Detection and classification of acoustic scenes and events(DCASE);
D O I
暂无
中图分类号
TB52 [声学测量]; O657.3 [光化学分析法(光谱分析法)]; TP183 [人工神经网络与计算];
学科分类号
070302 ; 0804 ; 081104 ; 0812 ; 081704 ; 0835 ; 1405 ;
摘要
As one of the most commonly used features, Mel-frequency cepstral coefficients(MFCCs) are less discriminative at high frequency. A novel technique,known as Deep scattering spectrum(DSS), addresses this issue and looks to preserve greater details. DSS feature has shown promise both on classification and recognition tasks. In this paper, we extend the use of DSS feature for acoustic scene classification task. Results on Detection and classification of acoustic scenes and events(DCASE) 2016 and 2017 show that DSS provided 4.8% and 17.4% relative improvements in accuracy over MFCC features, within a state-of-the-art time delay neural network framework.
引用
收藏
页码:1177 / 1183
页数:7
相关论文
共 50 条
  • [1] Deep Scattering Spectra with Deep Neural Networks for Acoustic Scene Classification Tasks
    Zhang, Pengyuan
    Chen, Hangting
    Bai, Haichuan
    Yuan, Qingsheng
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2019, 28 (06) : 1177 - 1183
  • [2] Deep Scattering Spectra with Deep Neural Networks for LVCSR Tasks
    Sainath, Tara N.
    Peddinti, Vijayaditya
    Kingsbury, Brian
    Fousek, Petr
    Ramabhadran, Bhuvana
    Nahamoo, David
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 900 - 904
  • [3] The Receptive Field as a Regularizer in Deep Convolutional Neural Networks for Acoustic Scene Classification
    Koutini, Khaled
    Eghbal-zadeh, Hamid
    Dorfer, Matthias
    Widmer, Gerhard
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [4] Deep Neural Decision Forest for Acoustic Scene Classification
    Sun, Jianyuan
    Liu, Xubo
    Mei, Xinhao
    Zhao, Jinzheng
    Plumbley, Mark D.
    Kilic, Volkan
    Wang, Wenwu
    [J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 772 - 776
  • [5] Lightweight deep neural networks for acoustic scene classification and an effective visualization for presenting sound scene contexts
    Pham, Lam
    Ngo, Dat
    Salovic, Dusan
    Jalali, Anahid
    Schindler, Alexander
    Nguyen, Phu X.
    Tran, Khoa
    Vu, Hai Canh
    [J]. APPLIED ACOUSTICS, 2023, 211
  • [6] Audio Scene Classification with Deep Recurrent Neural Networks
    Huy Phan
    Koch, Philipp
    Katzberg, Fabrice
    Maass, Marco
    Mazur, Radoslaw
    Mertins, Alfred
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3043 - 3047
  • [7] Analysis of Deep Neural Network Models for Acoustic Scene Classification
    Basbug, Ahmet Melih
    Sert, Mustafa
    [J]. 2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [8] APPLICATION OF RECURRENT AND DEEP NEURAL NETWORKS IN CLASSIFICATION TASKS
    Lima de Campos, Lidio Mauro
    Duarte, Danilo Souza
    [J]. REVISTA GESTAO & TECNOLOGIA-JOURNAL OF MANAGEMENT AND TECHNOLOGY, 2020, 20 (03): : 110 - 130
  • [9] An Investigation of High-Resolution Modeling Units of Deep Neural Networks for Acoustic Scene Classification
    Bao, Xiao
    Gao, Tian
    Du, Jun
    Dai, Li-Rong
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3028 - 3035
  • [10] An Investigation on Multiscale Normalised Deep Scattering Spectrum with Deep Residual Network for Acoustic Scene Classification
    Kek, Xing Yong
    Chin, Cheng Siong
    Li, Ye
    [J]. 22ND IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2021-FALL), 2021, : 29 - 36