DOMAIN MISMATCH ROBUST ACOUSTIC SCENE CLASSIFICATION USING CHANNEL INFORMATION CONVERSION

被引:0
|
作者
Mun, Seongkyu [1 ]
Shon, Suwon [2 ]
机构
[1] Naver Corp, Clova AI Res, Seongnam, South Korea
[2] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
来源
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年
关键词
acoustic scene classification; factorized hierarchical variational autoencoder; domain adaptation; REPRESENTATIONS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In recent acoustic scene classification (ASC) research field, training and test device channel mismatch have become an issue for the real world implementation. To address the issue, this paper proposes a channel domain conversion using factorized hierarchical variational autoencoder. Proposed method adapts both the source and target domain to a pre-defined specific domain. Unlike the conventional approach, the relationship between the target and source domain and information of each domain are not required in the adaptation process. Based on the experimental results using the IEEE Detection and Classification of Acoustic Scenes and Event 2018 task 1-B dataset and the baseline system, it is shown that the proposed approach can mitigate the channel mismatching issue of different recording devices.
引用
收藏
页码:845 / 849
页数:5
相关论文
共 50 条
  • [1] A Robust Framework For Acoustic Scene Classification
    Lam Pham
    McLoughlin, Ian
    Huy Phan
    Palaniappan, Ramaswamy
    INTERSPEECH 2019, 2019, : 3634 - 3638
  • [2] Capturing Discriminative Information Using a Deep Architecture in Acoustic Scene Classification
    Shim, Hye-jin
    Jung, Jee-weon
    Kim, Ju-ho
    Yu, Ha-jin
    APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [3] PROTOTYPICAL NETWORKS FOR DOMAIN ADAPTATION IN ACOUSTIC SCENE CLASSIFICATION
    Singh, Shubhr
    Bear, Helen L.
    Benetos, Emmanouil
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 346 - 350
  • [4] Acoustic Scene Classification Using Spectrograms
    Felipe, Gustavo Zanoni
    da Costa, Yandre Maldonado e Gomes
    Helal, Lucas Georges
    2017 36TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2017,
  • [5] Feature Alignment for Robust Acoustic Scene Classification Across Devices
    Zhao, Jingqiao
    Kong, Qiuqiang
    Song, Xiaoning
    Feng, Zhenhua
    Wu, Xiaojun
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 578 - 582
  • [6] Robust Acoustic Scene Classification in the Presence of Active Foreground Speech
    Song, Siyuan
    Desplanques, Brecht
    De Moor, Celest
    Demuynck, Kris
    Madhu, Nilesh
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 995 - 999
  • [7] Novel Augmentation Schemes for Device Robust Acoustic Scene Classification
    Sonowal, Sukanya
    Tamse, Anish
    INTERSPEECH 2022, 2022, : 4182 - 4186
  • [8] Missing data recovery using autoencoder for multi-channel acoustic scene classification
    Shiroma, Yuki
    Kinoshita, Yuma
    Imoto, Keisuke
    Shiota, Sayaka
    Ono, Nobutaka
    Kiya, Hitoshi
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 767 - 771
  • [9] Acoustic Scene Classification using Audio Tagging
    Jung, Jee-weon
    Shim, Hye-jin
    Kim, Ju-ho
    Kim, Seung-bin
    Yu, Ha-Jin
    INTERSPEECH 2020, 2020, : 1176 - 1180
  • [10] Robust acoustic scene classification using a multi-spectrogram encoder-decoder framework
    Pham, Lam
    Phan, Huy
    Nguyen, Truc
    Palaniappan, Ramaswamy
    Mertins, Alfred
    McLoughlin, Ian
    DIGITAL SIGNAL PROCESSING, 2021, 110