Feature Alignment for Robust Acoustic Scene Classification Across Devices

被引:4
|
作者
Zhao, Jingqiao [1 ]
Kong, Qiuqiang [2 ]
Song, Xiaoning [1 ]
Feng, Zhenhua [2 ]
Wu, Xiaojun [1 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Univ Surrey, Sch Comp Sci & Elect Engn, Guildford GU2 7XH, Surrey, England
基金
中国国家自然科学基金;
关键词
Training; Acoustics; Performance evaluation; Task analysis; Kernel; Hidden Markov models; Feature extraction; Acoustic scene classification; domain adaption; feature alignment;
D O I
10.1109/LSP.2022.3145336
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This letter presents a feature alignment method for domain adaptive Acoustic Scene Classification (ASC) across recording devices. First, we design a two-stream network, in which each stream processes two features, i.e., Log-Mel spectrogram and delta-deltas, using two sub-networks. Second, we investigate different loss functions for feature alignment between the feature maps obtained by the source and target domains. Last, we present an alternate training strategy to deal with the data imbalance problem between paired and unpaired samples. The experimental results obtained on the DCASE benchmarks demonstrate the effectiveness and superiority of the proposed method. The source code of the proposed method is available at https://github.com/Jingqiao-Zhao/FAASC.
引用
收藏
页码:578 / 582
页数:5
相关论文
共 50 条
  • [21] FEATURE PROJECTION-BASED UNSUPERVISED DOMAIN ADAPTATION FOR ACOUSTIC SCENE CLASSIFICATION
    Mezza, Alessandro Ilic
    Habets, Emanuel A. P.
    Mueller, Meinard
    Sarti, Augusto
    PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [22] Adaptive time-frequency feature resolution network for acoustic scene classification
    Zhang, Tao
    Liang, Jinhua
    Feng, Guoqing
    APPLIED ACOUSTICS, 2022, 195
  • [23] LARGE-SCALE AUDIO FEATURE EXTRACTION AND SVM FOR ACOUSTIC SCENE CLASSIFICATION
    Geiger, Juergen T.
    Schuller, Bjoern
    Rigoll, Gerhard
    2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2013,
  • [24] ACOUSTIC SCENE CLASSIFICATION WITH MISMATCHED RECORDING DEVICES USING MIXTURE OF EXPERTS LAYER
    Truc Nguyen
    Pernkopf, Franz
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1666 - 1671
  • [25] A TWO-STAGE APPROACH TO DEVICE-ROBUST ACOUSTIC SCENE CLASSIFICATION
    Hu, Hu
    Yang, Chao-Han Huck
    Xia, Xianjun
    Bai, Xue
    Tang, Xin
    Wang, Yajian
    Niu, Shutong
    Chai, Li
    Li, Juanjuan
    Zhu, Hongning
    Bao, Feng
    Zhao, Yuanjun
    Siniscalchi, Sabato Marco
    Wang, Yannan
    Du, Jun
    Lee, Chin-Hui
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 845 - 849
  • [26] Acoustic Scene Classification with Mismatched Devices Using CliqueNets and Mixup Data Augmentation
    Nguyen, Truc
    Pernkopf, Franz
    INTERSPEECH 2019, 2019, : 2330 - 2334
  • [27] DOMAIN MISMATCH ROBUST ACOUSTIC SCENE CLASSIFICATION USING CHANNEL INFORMATION CONVERSION
    Mun, Seongkyu
    Shon, Suwon
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 845 - 849
  • [28] A Robust Feature Extraction Algorithm for the Classification of Acoustic Targets in Wild Environments
    Jingchang Huang
    Shiliang Xiao
    Qianwei Zhou
    Feng Guo
    Xing You
    Haiyan Li
    Baoqing Li
    Circuits, Systems, and Signal Processing, 2015, 34 : 2395 - 2406
  • [29] ROBUST ACOUSTIC FEATURE EXTRACTION FOR SOUND CLASSIFICATION BASED ON NOISE REDUCTION
    Ye, Jiaxing
    Kobayashi, Takumi
    Murakawa, Masahiro
    Higuchi, Tetsuya
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [30] A Robust Feature Extraction Algorithm for the Classification of Acoustic Targets in Wild Environments
    Huang, Jingchang
    Xiao, Shiliang
    Zhou, Qianwei
    Guo, Feng
    You, Xing
    Li, Haiyan
    Li, Baoqing
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (07) : 2395 - 2406