Sparse Representation Frameworks for Acoustic Scene Classification

被引:0
|
作者
Tyagi, Akansha [1 ]
Rajan, Padmanabhan [1 ]
机构
[1] Indian Inst Technol, Sch Comp & Elect Engn, Mandi, Himachal Prades, India
来源
关键词
Acoustic Scene Classification; Sparse Representation Classification; Sparse Auto-Encoder; Sparse Representation;
D O I
10.1007/978-3-031-48309-7_15
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work addresses the task of acoustic scene classification (ASC) by using sparse representation frameworks, motivated by the inherent sparseness of audio data. We explore three different sparse representation classification (SRC) frameworks, generating sparse acoustic scene representations. The first two frameworks focus on producing linear and non-linear features respectively. On the other hand, the third framework presents a novel approach-a two-branch deep sparse auto-encoder (DSAE) representation framework that generates non-linear and discriminative features. In the proposed framework, the first branch induces sparsity, while the second focuses on enforcing discrimination within the learned sparse acoustic scene representations. These representations are later used to classify the acoustic scene data into different acoustic scene classes. We also compare the performance of the three sparse frameworks by evaluating them on three ASC datasets. Our results indicate that acoustic scene representations based on DSAE outperform the sparse representations obtained from the other two frameworks. This results in an average performance gain of approximately 8% across all the ASC datasets.
引用
收藏
页码:177 / 188
页数:12
相关论文
共 50 条
  • [11] Sparse representation for waveforms classification
    Xiao, Shanzhu
    Zhao, Bendong
    Lu, Huanzhang
    Wu, Dongya
    2018 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2018,
  • [12] LEARNING THE SPARSE REPRESENTATION FOR CLASSIFICATION
    Yang, Jianchao
    Wang, Jiangping
    Huang, Thomas
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [13] Sparse neighbor representation for classification
    Hui, Kang-hua
    Li, Chun-li
    Zhang, Lei
    PATTERN RECOGNITION LETTERS, 2012, 33 (05) : 661 - 669
  • [14] Mid-Level Feature Representation via Sparse Autoencoder for Remotely Sensed Scene Classification
    Li, Erzhu
    Du, Peijun
    Samat, Alim
    Meng, Yaping
    Che, Meiqin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2017, 10 (03) : 1068 - 1081
  • [15] Object clique representation for scene classification
    Chen, Jingjing
    Cao, Xiaochun
    Zhang, Bao
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 2829 - 2832
  • [16] ACOUSTIC FEATURE EXTRACTION BY TENSOR-BASED SPARSE REPRESENTATION FOR SOUND EFFECTS CLASSIFICATION
    Zhang, Xueyuan
    He, Qianhua
    Feng, Xiaohui
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 166 - 170
  • [17] Traffic Scene Classification on a Representation Budget
    Sikiric, Ivan
    Brkic, Karla
    Bevandic, Petra
    Kreso, Ivan
    Krapac, Josip
    Segvic, Sinisa
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (01) : 336 - 345
  • [18] Scene Classification with a Sparse Set of Salient Regions
    Borji, Ali
    Itti, Laurent
    2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011, : 1902 - 1908
  • [19] ACOUSTIC SCENE CLASSIFICATION: A COMPETITION REVIEW
    Gharib, Shayan
    Derrar, Honain
    Niizumi, Daisuke
    Senttula, Tuukka
    Tommola, Janne
    Heittola, Toni
    Virtanen, Tuomas
    Huttunen, Heikki
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [20] Acoustic Event and Scene Classification: A Review
    Manjunath Mulimani
    Spoorthy Venkatesh
    Shashidhar G. Koolagudi
    SN Computer Science, 6 (1)