Sparse Representation Frameworks for Acoustic Scene Classification

被引:0
|
作者
Tyagi, Akansha [1 ]
Rajan, Padmanabhan [1 ]
机构
[1] Indian Inst Technol, Sch Comp & Elect Engn, Mandi, Himachal Prades, India
来源
关键词
Acoustic Scene Classification; Sparse Representation Classification; Sparse Auto-Encoder; Sparse Representation;
D O I
10.1007/978-3-031-48309-7_15
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work addresses the task of acoustic scene classification (ASC) by using sparse representation frameworks, motivated by the inherent sparseness of audio data. We explore three different sparse representation classification (SRC) frameworks, generating sparse acoustic scene representations. The first two frameworks focus on producing linear and non-linear features respectively. On the other hand, the third framework presents a novel approach-a two-branch deep sparse auto-encoder (DSAE) representation framework that generates non-linear and discriminative features. In the proposed framework, the first branch induces sparsity, while the second focuses on enforcing discrimination within the learned sparse acoustic scene representations. These representations are later used to classify the acoustic scene data into different acoustic scene classes. We also compare the performance of the three sparse frameworks by evaluating them on three ASC datasets. Our results indicate that acoustic scene representations based on DSAE outperform the sparse representations obtained from the other two frameworks. This results in an average performance gain of approximately 8% across all the ASC datasets.
引用
收藏
页码:177 / 188
页数:12
相关论文
共 50 条
  • [1] Acoustic Scene Classification using Binaural Representation and Classifier Combination
    Arabnezhad, Fatemeh
    Nasersharif, Babak
    2019 9TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE 2019), 2019, : 351 - 355
  • [2] TRANSIENT ACOUSTIC SIGNAL CLASSIFICATION USING JOINT SPARSE REPRESENTATION
    Zhang, Haichao
    Nasrabadi, Nasser M.
    Huang, Thomas S.
    Zhang, Yanning
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2220 - 2223
  • [3] Feature Extraction and Scene Classification for Remote Sensing Image Based on Sparse Representation
    Guo, Youliang
    Zhang, Junping
    Zhong, Shengwei
    ALGORITHMS, TECHNOLOGIES, AND APPLICATIONS FOR MULTISPECTRAL AND HYPERSPECTRAL IMAGERY XXV, 2019, 10986
  • [4] SCENE IMAGE CLASSIFICATION USING REDUCED VIRTUAL FEATURE REPRESENTATION IN SPARSE FRAMEWORK
    Sharma, Krishan
    Gupta, Shikha
    Dileep, A. D.
    Rameshan, Renu
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2701 - 2705
  • [5] Acoustic Scene Classification
    Barchiesi, Daniele
    Giannoulis, Dimitrios
    Stowell, Dan
    Plumbley, Mark D.
    IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (03) : 16 - 34
  • [6] A Deep Scene Representation for Aerial Scene Classification
    Zheng, Xiangtao
    Yuan, Yuan
    Lu, Xiaoqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (07): : 4799 - 4809
  • [7] Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification
    Hou, Yuanbo
    Song, Siyang
    Yu, Chuang
    Wang, Wenwu
    Botteldooren, Dick
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1382 - 1386
  • [8] Scene Classification with the Discriminative Representation
    Sun, Hao
    Chen, Yaxiong
    Chen, Wenjing
    Huang, Zhangcan
    2017 2ND INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP), 2017, : 267 - 271
  • [9] ACOUSTIC SCENE CLASSIFICATION USING SPARSE FEATURE LEARNING AND EVENT-BASED POOLING
    Lee, Kyogu
    Hyung, Ziwon
    Nam, Juhan
    2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2013,
  • [10] Hapto-acoustic Scene Representation
    Ritterbusch, Sebastian
    Constantinescu, Angela
    Koch, Volker
    COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PT II, 2012, 7383 : 644 - 650