DOMAIN MISMATCH ROBUST ACOUSTIC SCENE CLASSIFICATION USING CHANNEL INFORMATION CONVERSION

被引：0

作者：

Mun, Seongkyu ^{[1
]}

Shon, Suwon ^{[2
]}

机构：

[1] Naver Corp, Clova AI Res, Seongnam, South Korea

[2] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年

关键词：

acoustic scene classification; factorized hierarchical variational autoencoder; domain adaptation; REPRESENTATIONS;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In recent acoustic scene classification (ASC) research field, training and test device channel mismatch have become an issue for the real world implementation. To address the issue, this paper proposes a channel domain conversion using factorized hierarchical variational autoencoder. Proposed method adapts both the source and target domain to a pre-defined specific domain. Unlike the conventional approach, the relationship between the target and source domain and information of each domain are not required in the adaptation process. Based on the experimental results using the IEEE Detection and Classification of Acoustic Scenes and Event 2018 task 1-B dataset and the baseline system, it is shown that the proposed approach can mitigate the channel mismatching issue of different recording devices.

引用

页码：845 / 849

页数：5

共 50 条

[1] A Robust Framework For Acoustic Scene Classification
Lam Pham
McLoughlin, Ian
Huy Phan
Palaniappan, Ramaswamy
INTERSPEECH 2019, 2019, : 3634 - 3638
[2] Capturing Discriminative Information Using a Deep Architecture in Acoustic Scene Classification
Shim, Hye-jin
Jung, Jee-weon
Kim, Ju-ho
Yu, Ha-jin
APPLIED SCIENCES-BASEL, 2021, 11 (18):
[3] PROTOTYPICAL NETWORKS FOR DOMAIN ADAPTATION IN ACOUSTIC SCENE CLASSIFICATION
Singh, Shubhr
Bear, Helen L.
Benetos, Emmanouil
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 346 - 350
[4] Acoustic Scene Classification Using Spectrograms
Felipe, Gustavo Zanoni
da Costa, Yandre Maldonado e Gomes
Helal, Lucas Georges
2017 36TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2017,
[5] Feature Alignment for Robust Acoustic Scene Classification Across Devices
Zhao, Jingqiao
Kong, Qiuqiang
Song, Xiaoning
Feng, Zhenhua
Wu, Xiaojun
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 578 - 582
[6] Robust Acoustic Scene Classification in the Presence of Active Foreground Speech
Song, Siyuan
Desplanques, Brecht
De Moor, Celest
Demuynck, Kris
Madhu, Nilesh
29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 995 - 999
[7] Novel Augmentation Schemes for Device Robust Acoustic Scene Classification
Sonowal, Sukanya
Tamse, Anish
INTERSPEECH 2022, 2022, : 4182 - 4186
[8] Missing data recovery using autoencoder for multi-channel acoustic scene classification
Shiroma, Yuki
Kinoshita, Yuma
Imoto, Keisuke
Shiota, Sayaka
Ono, Nobutaka
Kiya, Hitoshi
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 767 - 771
[9] Acoustic Scene Classification using Audio Tagging
Jung, Jee-weon
Shim, Hye-jin
Kim, Ju-ho
Kim, Seung-bin
Yu, Ha-Jin
INTERSPEECH 2020, 2020, : 1176 - 1180
[10] Robust acoustic scene classification using a multi-spectrogram encoder-decoder framework
Pham, Lam
Phan, Huy
Nguyen, Truc
Palaniappan, Ramaswamy
Mertins, Alfred
McLoughlin, Ian
DIGITAL SIGNAL PROCESSING, 2021, 110

← 1 2 3 4 5 →