Deep Scattering Spectra with Deep Neural Networks for Acoustic Scene Classification Tasks

被引:0
|
作者
ZHANG Pengyuan [1 ,2 ]
CHEN Hangting [1 ,2 ]
BAI Haichuan [1 ,2 ]
YUAN Qingsheng [3 ]
机构
[1] Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics, Chinese Academy of Sciences
[2] University of Chinese Academy of Sciences
[3] National Computer Network Emergency Response Technical Team/Coordination Center of China
关键词
Acoustic scene classification; Time-delay neural network; Deep scattering spectrum; Detection and classification of acoustic scenes and events(DCASE);
D O I
暂无
中图分类号
TB52 [声学测量]; O657.3 [光化学分析法(光谱分析法)]; TP183 [人工神经网络与计算];
学科分类号
070302 ; 0804 ; 081104 ; 0812 ; 081704 ; 0835 ; 1405 ;
摘要
As one of the most commonly used features, Mel-frequency cepstral coefficients(MFCCs) are less discriminative at high frequency. A novel technique,known as Deep scattering spectrum(DSS), addresses this issue and looks to preserve greater details. DSS feature has shown promise both on classification and recognition tasks. In this paper, we extend the use of DSS feature for acoustic scene classification task. Results on Detection and classification of acoustic scenes and events(DCASE) 2016 and 2017 show that DSS provided 4.8% and 17.4% relative improvements in accuracy over MFCC features, within a state-of-the-art time delay neural network framework.
引用
收藏
页码:1177 / 1183
页数:7
相关论文
共 50 条
  • [21] NIR/RGB image fusion for scene classification using deep neural networks
    Rahman Soroush
    Yasser Baleghi
    [J]. The Visual Computer, 2023, 39 : 2725 - 2739
  • [22] NIR/RGB image fusion for scene classification using deep neural networks
    Soroush, Rahman
    Baleghi, Yasser
    [J]. VISUAL COMPUTER, 2023, 39 (07): : 2725 - 2739
  • [23] Scene Context Classification with Event-Driven Spiking Deep Neural Networks
    Negri, Pablo
    Soto, Miguel
    Linares-Barranco, Bernabe
    Serrano-Gotarredona, Teresa
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS), 2018, : 569 - 572
  • [24] Deep mutual attention network for acoustic scene classification
    Xie, Wei
    He, Qianhua
    Yu, Zitong
    Li, Yanxiong
    [J]. DIGITAL SIGNAL PROCESSING, 2022, 123
  • [25] Acoustic Scene Classification using Deep Fisher network
    Venkatesh, Spoorthy
    Mulimani, Manjunath
    Koolagudi, Shashidhar G.
    [J]. DIGITAL SIGNAL PROCESSING, 2023, 139
  • [26] Deep mutual attention network for acoustic scene classification
    Xie, Wei
    He, Qianhua
    Yu, Zitong
    Li, Yanxiong
    [J]. Digital Signal Processing: A Review Journal, 2022, 123
  • [27] Acoustic Scene Classification Using Deep Convolutional Neural Network via Transfer Learning
    Ye, Min
    Zhong, Hong
    Song, Xiao
    Huang, Shilei
    Cheng, Gang
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 19 - 22
  • [28] Acoustic Scene Classification using Deep Learning Architectures
    Spoorthy, V
    Mulimani, Manjunath
    Koolagudi, Shashidhar G.
    [J]. 2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [29] DEEP NEURAL NETWORKS FOR AUDIO SCENE RECOGNITION
    Petetin, Yohan
    Laroche, Cyrille
    Mayoue, Aurelien
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 125 - 129
  • [30] Deep Convolutional-Shepard Interpolation Neural Networks for Image Classification Tasks
    Smith, Kaleb E.
    Williams, Phillip
    Chaiya, Tatsanee
    Ble, Max
    [J]. IMAGE ANALYSIS AND RECOGNITION (ICIAR 2018), 2018, 10882 : 185 - 192