LEARNABLE NONLINEAR COMPRESSION FOR ROBUST SPEAKER VERIFICATION

被引:2
|
作者
Liu, Xuechen [1 ,2 ]
Sahidullah, Md [2 ]
Kinnunen, Tomi [1 ]
机构
[1] Univ Eastern Finland, Sch Comp, Joensuu, Finland
[2] Univ Lorraine, INRIA, CNRS, LORIA, F-54000 Nancy, France
关键词
Speaker Verification; Nonlinear Compression; Multi-Regime Compression; RECOGNITION;
D O I
10.1109/ICASSP43922.2022.9747185
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this study, we focus on nonlinear compression methods in spectral features for speaker verification based on deep neural network. We consider different kinds of channel-dependent (CD) nonlinear compression methods optimized in a data-driven manner. Our methods are based on power nonlinearities and dynamic range compression (DRC). We also propose multi-regime (MR) design on the nonlinearities, at improving robustness. Results on VoxCeleb1 and VoxMovies data demonstrate improvements brought by proposed compression methods over both the commonly-used logarithm and their static counterparts, especially for ones based on power function. While CD generalization improves performance on VoxCeleb1, MR provides more robustness on VoxMovies, with a maximum relative equal error rate reduction of 21.6%.
引用
收藏
页码:7962 / 7966
页数:5
相关论文
共 50 条
  • [31] Robust Speaker Verification using Self Organizing Map
    Das, Pranab
    Bhatacharjee, Utpal
    2014 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT, 2014,
  • [32] Robust Methods for Text-Dependent Speaker Verification
    Bhukya, Ramesh K.
    Prasanna, S. R. Mahadeva
    Sarma, Biswajit Dev
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (11) : 5253 - 5288
  • [33] Robust Session Variability Compensation for SVM Speaker Verification
    Seo, Hyunson
    Jung, Chi-Sang
    Kang, Hong-Goo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1631 - 1641
  • [34] Improved Multitaper PNCC Feature for Robust Speaker Verification
    Liu, Yi
    He, Liang
    Liu, Jia
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 168 - 172
  • [35] Robust Speaker Verification Under Additive Noise Condition
    Zhang E.-H.
    Wang M.-H.
    Tang Z.-M.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (06): : 1244 - 1250
  • [36] Robust Methods for Text-Dependent Speaker Verification
    Ramesh K. Bhukya
    S. R. Mahadeva Prasanna
    Biswajit Dev Sarma
    Circuits, Systems, and Signal Processing, 2019, 38 : 5253 - 5288
  • [37] Feature recovery for noise-robust speaker verification
    Huang, Houjun
    Xu, Yunfei
    Zhou, Ruohua
    Yan, Yonghong
    ELECTRONICS LETTERS, 2015, 51 (18) : 1459 - 1461
  • [38] Gradient Regularization for Noise-Robust Speaker Verification
    Li, Jianchen
    Han, Jiqing
    Song, Hongwei
    INTERSPEECH 2021, 2021, : 1074 - 1078
  • [39] Refining Cosine Distance Features for Robust Speaker Verification
    Balasingam, M. D.
    Kumar, C. Santhosh
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 152 - 155
  • [40] A Fused Speech Enhancement Framework for Robust Speaker Verification
    Wu, Yanfeng
    Li, Taihao
    Zhao, Junan
    Wang, Qirui
    Xu, Jing
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 883 - 887