Feature Alignment for Robust Acoustic Scene Classification Across Devices

被引:4
|
作者
Zhao, Jingqiao [1 ]
Kong, Qiuqiang [2 ]
Song, Xiaoning [1 ]
Feng, Zhenhua [2 ]
Wu, Xiaojun [1 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Univ Surrey, Sch Comp Sci & Elect Engn, Guildford GU2 7XH, Surrey, England
基金
中国国家自然科学基金;
关键词
Training; Acoustics; Performance evaluation; Task analysis; Kernel; Hidden Markov models; Feature extraction; Acoustic scene classification; domain adaption; feature alignment;
D O I
10.1109/LSP.2022.3145336
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This letter presents a feature alignment method for domain adaptive Acoustic Scene Classification (ASC) across recording devices. First, we design a two-stream network, in which each stream processes two features, i.e., Log-Mel spectrogram and delta-deltas, using two sub-networks. Second, we investigate different loss functions for feature alignment between the feature maps obtained by the source and target domains. Last, we present an alternate training strategy to deal with the data imbalance problem between paired and unpaired samples. The experimental results obtained on the DCASE benchmarks demonstrate the effectiveness and superiority of the proposed method. The source code of the proposed method is available at https://github.com/Jingqiao-Zhao/FAASC.
引用
收藏
页码:578 / 582
页数:5
相关论文
共 50 条
  • [1] Acoustic Scene Classification Across Cities and Devices via Feature Disentanglement
    Tan, Yizhou
    Ai, Haojun
    Li, Shengchen
    Plumbley, Mark D.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1286 - 1297
  • [2] A Robust Framework For Acoustic Scene Classification
    Lam Pham
    McLoughlin, Ian
    Huy Phan
    Palaniappan, Ramaswamy
    INTERSPEECH 2019, 2019, : 3634 - 3638
  • [3] Multi-level distance embedding learning for robust acoustic scene classification with unseen devices
    Jiang, Gang
    Ma, Zhongchen
    Mao, Qirong
    Zhang, Jianming
    PATTERN ANALYSIS AND APPLICATIONS, 2023, 26 (03) : 1089 - 1099
  • [4] Multi-level distance embedding learning for robust acoustic scene classification with unseen devices
    Gang Jiang
    Zhongchen Ma
    Qirong Mao
    Jianming Zhang
    Pattern Analysis and Applications, 2023, 26 (3) : 1089 - 1099
  • [5] Robust Acoustic Scene Classification to Multiple Devices Using Maximum Classifier Discrepancy and Knowledge Distillation
    Takeyama, Saori
    Komatsu, Tatsuya
    Miyazaki, Koichi
    Togami, Masahito
    Ono, Shunsuke
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 36 - 40
  • [6] Feature Extraction of Binaural Recordings for Acoustic Scene Classification
    Zielinski, Slawomir K.
    Lee, Hyunkook
    PROCEEDINGS OF THE 2018 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2018, : 585 - 588
  • [7] Constrained Learned Feature Extraction for Acoustic Scene Classification
    Zhang, Teng
    Wu, Ji
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (08) : 1216 - 1228
  • [8] ACOUSTIC SCENE CLASSIFICATION WITH MATRIX FACTORIZATION FOR UNSUPERVISED FEATURE LEARNING
    Bisot, Victor
    Serizel, Romain
    Essid, Slim
    Richard, Gael
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6445 - 6449
  • [9] REVERBERATION-BASED FEATURE EXTRACTION FOR ACOUSTIC SCENE CLASSIFICATION
    Markovic, Milos
    Geiger, Juergen
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 781 - 785
  • [10] Feature Learning With Matrix Factorization Applied to Acoustic Scene Classification
    Bisot, Victor
    Serizel, Romain
    Essid, Slim
    Richard, Gael
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1216 - 1229