LEARNABLE NONLINEAR COMPRESSION FOR ROBUST SPEAKER VERIFICATION

被引：2

作者：

Liu, Xuechen ^{[1
,2
]}

Sahidullah, Md ^{[2
]}

Kinnunen, Tomi ^{[1
]}

机构：

[1] Univ Eastern Finland, Sch Comp, Joensuu, Finland

[2] Univ Lorraine, INRIA, CNRS, LORIA, F-54000 Nancy, France

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

关键词：

Speaker Verification; Nonlinear Compression; Multi-Regime Compression; RECOGNITION;

D O I：

10.1109/ICASSP43922.2022.9747185

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this study, we focus on nonlinear compression methods in spectral features for speaker verification based on deep neural network. We consider different kinds of channel-dependent (CD) nonlinear compression methods optimized in a data-driven manner. Our methods are based on power nonlinearities and dynamic range compression (DRC). We also propose multi-regime (MR) design on the nonlinearities, at improving robustness. Results on VoxCeleb1 and VoxMovies data demonstrate improvements brought by proposed compression methods over both the commonly-used logarithm and their static counterparts, especially for ones based on power function. While CD generalization improves performance on VoxCeleb1, MR provides more robustness on VoxMovies, with a maximum relative equal error rate reduction of 21.6%.

引用

页码：7962 / 7966

页数：5

共 50 条

[1] Learnable MFCCs for Speaker Verification
Liu, Xuechen
Sahidullah, Md
Kinnunen, Tomi
2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
[2] Learnable Sparse Filterbank for Speaker Verification
Peng, Junyi
Gu, Rongzhi
Mosner, Ladislav
Plchot, Oldrich
Burget, Lukas
Cernocky, Jan
INTERSPEECH 2022, 2022, : 5110 - 5114
[3] DISENTANGLED SPEAKER EMBEDDING FOR ROBUST SPEAKER VERIFICATION
Yi, Lu
Mak, Man-Wai
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7662 - 7666
[4] Robust speaker identification and verification
Wang, Jia-Ching
Yang, Chung-Hsien
Wang, Jhing-Fa
Lee, Hsiao-Ping
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2007, 2 (02) : 52 - 59
[5] TOWARDS ROBUST SPEAKER VERIFICATION WITH TARGET SPEAKER ENHANCEMENT
Zhang, Chunlei
Yu, Meng
Weng, Chao
Yu, Dong
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6693 - 6697
[6] LMD: A Learnable Mask Network to Detect Adversarial Examples for Speaker Verification
Chen, Xing
Wang, Jie
Zhang, Xiao-Lei
Zhang, Wei-Qiang
Yang, Kunde
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2476 - 2490
[7] Robust Speaker Verification for Mobile Transmission
Manjusha, V.
2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 518 - 521
[8] Disentangled Speaker and Nuisance Attribute Embedding for Robust Speaker Verification
Kang, Woo Hyun
Mun, Sung Hwan
Han, Min Hyun
Kim, Nam Soo
IEEE ACCESS, 2020, 8 : 141838 - 141849
[9] A nonlinear autoregressive model for speaker verification
Srinivasan, Sundararajan
Ma, Tao
Lazarou, Georgios
Picone, Joseph
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 17 - 25
[10] Acoustic Factor Analysis for Robust Speaker Verification
Hasan, Taufiq
Hansen, John H. L.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (04): : 842 - 853

← 1 2 3 4 5 →