Sparse Representation Frameworks for Acoustic Scene Classification

被引：0

作者：

Tyagi, Akansha ^{[1
]}

Rajan, Padmanabhan ^{[1
]}

机构：

[1] Indian Inst Technol, Sch Comp & Elect Engn, Mandi, Himachal Prades, India

来源：

SPEECH AND COMPUTER, SPECOM 2023, PT I | 2023年 / 14338卷

关键词：

Acoustic Scene Classification; Sparse Representation Classification; Sparse Auto-Encoder; Sparse Representation;

D O I：

10.1007/978-3-031-48309-7_15

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This work addresses the task of acoustic scene classification (ASC) by using sparse representation frameworks, motivated by the inherent sparseness of audio data. We explore three different sparse representation classification (SRC) frameworks, generating sparse acoustic scene representations. The first two frameworks focus on producing linear and non-linear features respectively. On the other hand, the third framework presents a novel approach-a two-branch deep sparse auto-encoder (DSAE) representation framework that generates non-linear and discriminative features. In the proposed framework, the first branch induces sparsity, while the second focuses on enforcing discrimination within the learned sparse acoustic scene representations. These representations are later used to classify the acoustic scene data into different acoustic scene classes. We also compare the performance of the three sparse frameworks by evaluating them on three ASC datasets. Our results indicate that acoustic scene representations based on DSAE outperform the sparse representations obtained from the other two frameworks. This results in an average performance gain of approximately 8% across all the ASC datasets.

引用

页码：177 / 188

页数：12

共 50 条

[1] Acoustic Scene Classification using Binaural Representation and Classifier Combination
Arabnezhad, Fatemeh
Nasersharif, Babak
2019 9TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE 2019), 2019, : 351 - 355
[2] TRANSIENT ACOUSTIC SIGNAL CLASSIFICATION USING JOINT SPARSE REPRESENTATION
Zhang, Haichao
Nasrabadi, Nasser M.
Huang, Thomas S.
Zhang, Yanning
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2220 - 2223
[3] Feature Extraction and Scene Classification for Remote Sensing Image Based on Sparse Representation
Guo, Youliang
Zhang, Junping
Zhong, Shengwei
ALGORITHMS, TECHNOLOGIES, AND APPLICATIONS FOR MULTISPECTRAL AND HYPERSPECTRAL IMAGERY XXV, 2019, 10986
[4] SCENE IMAGE CLASSIFICATION USING REDUCED VIRTUAL FEATURE REPRESENTATION IN SPARSE FRAMEWORK
Sharma, Krishan
Gupta, Shikha
Dileep, A. D.
Rameshan, Renu
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2701 - 2705
[5] Acoustic Scene Classification
Barchiesi, Daniele
Giannoulis, Dimitrios
Stowell, Dan
Plumbley, Mark D.
IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (03) : 16 - 34
[6] A Deep Scene Representation for Aerial Scene Classification
Zheng, Xiangtao
Yuan, Yuan
Lu, Xiaoqiang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (07): : 4799 - 4809
[7] Audio Event-Relational Graph Representation Learning for Acoustic Scene Classification
Hou, Yuanbo
Song, Siyang
Yu, Chuang
Wang, Wenwu
Botteldooren, Dick
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1382 - 1386
[8] Scene Classification with the Discriminative Representation
Sun, Hao
Chen, Yaxiong
Chen, Wenjing
Huang, Zhangcan
2017 2ND INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP), 2017, : 267 - 271
[9] ACOUSTIC SCENE CLASSIFICATION USING SPARSE FEATURE LEARNING AND EVENT-BASED POOLING
Lee, Kyogu
Hyung, Ziwon
Nam, Juhan
2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2013,
[10] Hapto-acoustic Scene Representation
Ritterbusch, Sebastian
Constantinescu, Angela
Koch, Volker
COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PT II, 2012, 7383 : 644 - 650

← 1 2 3 4 5 →