LEARNABLE NONLINEAR COMPRESSION FOR ROBUST SPEAKER VERIFICATION

被引:2
|
作者
Liu, Xuechen [1 ,2 ]
Sahidullah, Md [2 ]
Kinnunen, Tomi [1 ]
机构
[1] Univ Eastern Finland, Sch Comp, Joensuu, Finland
[2] Univ Lorraine, INRIA, CNRS, LORIA, F-54000 Nancy, France
关键词
Speaker Verification; Nonlinear Compression; Multi-Regime Compression; RECOGNITION;
D O I
10.1109/ICASSP43922.2022.9747185
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this study, we focus on nonlinear compression methods in spectral features for speaker verification based on deep neural network. We consider different kinds of channel-dependent (CD) nonlinear compression methods optimized in a data-driven manner. Our methods are based on power nonlinearities and dynamic range compression (DRC). We also propose multi-regime (MR) design on the nonlinearities, at improving robustness. Results on VoxCeleb1 and VoxMovies data demonstrate improvements brought by proposed compression methods over both the commonly-used logarithm and their static counterparts, especially for ones based on power function. While CD generalization improves performance on VoxCeleb1, MR provides more robustness on VoxMovies, with a maximum relative equal error rate reduction of 21.6%.
引用
收藏
页码:7962 / 7966
页数:5
相关论文
共 50 条
  • [1] Learnable MFCCs for Speaker Verification
    Liu, Xuechen
    Sahidullah, Md
    Kinnunen, Tomi
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [2] Learnable Sparse Filterbank for Speaker Verification
    Peng, Junyi
    Gu, Rongzhi
    Mosner, Ladislav
    Plchot, Oldrich
    Burget, Lukas
    Cernocky, Jan
    INTERSPEECH 2022, 2022, : 5110 - 5114
  • [3] DISENTANGLED SPEAKER EMBEDDING FOR ROBUST SPEAKER VERIFICATION
    Yi, Lu
    Mak, Man-Wai
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7662 - 7666
  • [4] Robust speaker identification and verification
    Wang, Jia-Ching
    Yang, Chung-Hsien
    Wang, Jhing-Fa
    Lee, Hsiao-Ping
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2007, 2 (02) : 52 - 59
  • [5] TOWARDS ROBUST SPEAKER VERIFICATION WITH TARGET SPEAKER ENHANCEMENT
    Zhang, Chunlei
    Yu, Meng
    Weng, Chao
    Yu, Dong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6693 - 6697
  • [6] LMD: A Learnable Mask Network to Detect Adversarial Examples for Speaker Verification
    Chen, Xing
    Wang, Jie
    Zhang, Xiao-Lei
    Zhang, Wei-Qiang
    Yang, Kunde
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2476 - 2490
  • [7] Robust Speaker Verification for Mobile Transmission
    Manjusha, V.
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 518 - 521
  • [8] Disentangled Speaker and Nuisance Attribute Embedding for Robust Speaker Verification
    Kang, Woo Hyun
    Mun, Sung Hwan
    Han, Min Hyun
    Kim, Nam Soo
    IEEE ACCESS, 2020, 8 : 141838 - 141849
  • [9] A nonlinear autoregressive model for speaker verification
    Srinivasan, Sundararajan
    Ma, Tao
    Lazarou, Georgios
    Picone, Joseph
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 17 - 25
  • [10] Acoustic Factor Analysis for Robust Speaker Verification
    Hasan, Taufiq
    Hansen, John H. L.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (04): : 842 - 853