LEARNABLE NONLINEAR COMPRESSION FOR ROBUST SPEAKER VERIFICATION

被引：2

作者：

Liu, Xuechen ^{[1
,2
]}

Sahidullah, Md ^{[2
]}

Kinnunen, Tomi ^{[1
]}

机构：

[1] Univ Eastern Finland, Sch Comp, Joensuu, Finland

[2] Univ Lorraine, INRIA, CNRS, LORIA, F-54000 Nancy, France

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

关键词：

Speaker Verification; Nonlinear Compression; Multi-Regime Compression; RECOGNITION;

D O I：

10.1109/ICASSP43922.2022.9747185

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this study, we focus on nonlinear compression methods in spectral features for speaker verification based on deep neural network. We consider different kinds of channel-dependent (CD) nonlinear compression methods optimized in a data-driven manner. Our methods are based on power nonlinearities and dynamic range compression (DRC). We also propose multi-regime (MR) design on the nonlinearities, at improving robustness. Results on VoxCeleb1 and VoxMovies data demonstrate improvements brought by proposed compression methods over both the commonly-used logarithm and their static counterparts, especially for ones based on power function. While CD generalization improves performance on VoxCeleb1, MR provides more robustness on VoxMovies, with a maximum relative equal error rate reduction of 21.6%.

引用

页码：7962 / 7966

页数：5

共 50 条

[31] Robust Speaker Verification using Self Organizing Map
Das, Pranab
Bhatacharjee, Utpal
2014 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT, 2014,
[32] Robust Methods for Text-Dependent Speaker Verification
Bhukya, Ramesh K.
Prasanna, S. R. Mahadeva
Sarma, Biswajit Dev
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (11) : 5253 - 5288
[33] Robust Session Variability Compensation for SVM Speaker Verification
Seo, Hyunson
Jung, Chi-Sang
Kang, Hong-Goo
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (06): : 1631 - 1641
[34] Improved Multitaper PNCC Feature for Robust Speaker Verification
Liu, Yi
He, Liang
Liu, Jia
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 168 - 172
[35] Robust Speaker Verification Under Additive Noise Condition
Zhang E.-H.
Wang M.-H.
Tang Z.-M.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (06): : 1244 - 1250
[36] Robust Methods for Text-Dependent Speaker Verification
Ramesh K. Bhukya
S. R. Mahadeva Prasanna
Biswajit Dev Sarma
Circuits, Systems, and Signal Processing, 2019, 38 : 5253 - 5288
[37] Feature recovery for noise-robust speaker verification
Huang, Houjun
Xu, Yunfei
Zhou, Ruohua
Yan, Yonghong
ELECTRONICS LETTERS, 2015, 51 (18) : 1459 - 1461
[38] Gradient Regularization for Noise-Robust Speaker Verification
Li, Jianchen
Han, Jiqing
Song, Hongwei
INTERSPEECH 2021, 2021, : 1074 - 1078
[39] Refining Cosine Distance Features for Robust Speaker Verification
Balasingam, M. D.
Kumar, C. Santhosh
PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), 2018, : 152 - 155
[40] A Fused Speech Enhancement Framework for Robust Speaker Verification
Wu, Yanfeng
Li, Taihao
Zhao, Junan
Wang, Qirui
Xu, Jing
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 883 - 887

← 1 2 3 4 5 →