Feature Alignment for Robust Acoustic Scene Classification Across Devices

被引：4

作者：

Zhao, Jingqiao ^{[1
]}

Kong, Qiuqiang ^{[2
]}

Song, Xiaoning ^{[1
]}

Feng, Zhenhua ^{[2
]}

Wu, Xiaojun ^{[1
]}

机构：

[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China

[2] Univ Surrey, Sch Comp Sci & Elect Engn, Guildford GU2 7XH, Surrey, England

来源：

IEEE SIGNAL PROCESSING LETTERS | 2022年 / 29卷

基金：

中国国家自然科学基金;

关键词：

Training; Acoustics; Performance evaluation; Task analysis; Kernel; Hidden Markov models; Feature extraction; Acoustic scene classification; domain adaption; feature alignment;

D O I：

10.1109/LSP.2022.3145336

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This letter presents a feature alignment method for domain adaptive Acoustic Scene Classification (ASC) across recording devices. First, we design a two-stream network, in which each stream processes two features, i.e., Log-Mel spectrogram and delta-deltas, using two sub-networks. Second, we investigate different loss functions for feature alignment between the feature maps obtained by the source and target domains. Last, we present an alternate training strategy to deal with the data imbalance problem between paired and unpaired samples. The experimental results obtained on the DCASE benchmarks demonstrate the effectiveness and superiority of the proposed method. The source code of the proposed method is available at https://github.com/Jingqiao-Zhao/FAASC.

引用

页码：578 / 582

页数：5

共 50 条

[1] Acoustic Scene Classification Across Cities and Devices via Feature Disentanglement
Tan, Yizhou
Ai, Haojun
Li, Shengchen
Plumbley, Mark D.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1286 - 1297
[2] A Robust Framework For Acoustic Scene Classification
Lam Pham
McLoughlin, Ian
Huy Phan
Palaniappan, Ramaswamy
INTERSPEECH 2019, 2019, : 3634 - 3638
[3] Multi-level distance embedding learning for robust acoustic scene classification with unseen devices
Jiang, Gang
Ma, Zhongchen
Mao, Qirong
Zhang, Jianming
PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 1089 - 1099
[4] Multi-level distance embedding learning for robust acoustic scene classification with unseen devices
Gang Jiang
Zhongchen Ma
Qirong Mao
Jianming Zhang
Pattern Analysis and Applications, 2023, 26 (3) : 1089 - 1099
[5] Robust Acoustic Scene Classification to Multiple Devices Using Maximum Classifier Discrepancy and Knowledge Distillation
Takeyama, Saori
Komatsu, Tatsuya
Miyazaki, Koichi
Togami, Masahito
Ono, Shunsuke
28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 36 - 40
[6] Feature Extraction of Binaural Recordings for Acoustic Scene Classification
Zielinski, Slawomir K.
Lee, Hyunkook
PROCEEDINGS OF THE 2018 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2018, : 585 - 588
[7] Constrained Learned Feature Extraction for Acoustic Scene Classification
Zhang, Teng
Wu, Ji
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (08) : 1216 - 1228
[8] ACOUSTIC SCENE CLASSIFICATION WITH MATRIX FACTORIZATION FOR UNSUPERVISED FEATURE LEARNING
Bisot, Victor
Serizel, Romain
Essid, Slim
Richard, Gael
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6445 - 6449
[9] REVERBERATION-BASED FEATURE EXTRACTION FOR ACOUSTIC SCENE CLASSIFICATION
Markovic, Milos
Geiger, Juergen
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 781 - 785
[10] Feature Learning With Matrix Factorization Applied to Acoustic Scene Classification
Bisot, Victor
Serizel, Romain
Essid, Slim
Richard, Gael
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1216 - 1229

← 1 2 3 4 5 →