Investigation of Different Calibration Methods for Deep Speaker Embedding Based Verification Systems

被引:0
|
作者
Novoselov, Sergey [1 ]
Lavrentyeva, Galina [1 ]
Volokhov, Vladimir [1 ,2 ]
Volkova, Marina [1 ,2 ]
Khmelev, Nikita [1 ,2 ]
Akulov, Artem [1 ,2 ]
机构
[1] ITMO Univ, St Petersburg, Russia
[2] STC Ltd, St Petersburg, Russia
来源
关键词
Speaker verification; Calibration; MagNetO;
D O I
10.1007/978-3-031-48309-7_13
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep speaker embedding extractors have already become new state-of-the-art systems in the speaker verification field. However, the problem of verification score calibration for such systems often remains out of focus. An irrelevant score calibration leads to serious issues, especially in the case of unknown acoustic conditions, even if we use a strong speaker verification system in terms of threshold-free metrics. This paper presents an investigation over several methods of score calibration: a classical approach based on the logistic regression model; the recently presented magnitude estimation network MagNetO that uses activations from the pooling layer of the trained deep speaker extractor and generalization of such approach based on separate scale and offset prediction neural networks. An additional focus of this research is to estimate the impact of score normalization on the calibration performance of the system. The obtained results demonstrate that there are no serious problems if in-domain development data are used for calibration tuning. Otherwise, a trade-off between good calibration performance and threshold-free system quality arises. In most cases using adaptive s-norm helps to stabilize score distributions and to improve system performance.
引用
收藏
页码:159 / 168
页数:10
相关论文
共 50 条
  • [21] Voice-quality Features for Deep Neural Network Based Speaker Verification Systems
    Woubie, Abraham
    Koivisto, Lauri
    Backstrom, Tom
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 176 - 180
  • [22] Data Augmentation using Variational Autoencoder for Embedding based Speaker Verification
    Wu, Zhanghao
    Wang, Shuai
    Qian, Yanmin
    Yu, Kai
    INTERSPEECH 2019, 2019, : 1163 - 1167
  • [23] Mutual Information-based Embedding Decoupling for Generalizable Speaker Verification
    Li, Jianchen
    Han, Jiqing
    Deng, Shiwen
    Zheng, Tieran
    He, Yongjun
    Zheng, Guibin
    INTERSPEECH 2023, 2023, : 3147 - 3151
  • [24] Investigation of Bottleneck Features and Multilingual Deep Neural Networks for Speaker Verification
    Tian, Yao
    Cai, Meng
    He, Liang
    Liu, Jia
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1151 - 1155
  • [25] An Investigation into Direct Scoring Methods without SVM Training in Speaker Verification
    Zhang, Ce
    Zheng, Rong
    Xu, Bo
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1437 - 1440
  • [26] A Study on Angular Based Embedding Learning for Text-independent Speaker Verification
    Chen, Zhiyong
    Ren, Zongze
    Xu, Shugong
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 445 - 449
  • [27] Mitigate the reverberation effect on the speaker verification performance using different methods
    Khamis A. Al-Karawi
    International Journal of Speech Technology, 2021, 24 : 143 - 153
  • [28] Mitigate the reverberation effect on the speaker verification performance using different methods
    Al-Karawif, Khamis A.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (01) : 143 - 153
  • [29] Deep domain adaptation for anti-spoofing in speaker verification systems
    Himawan, Ivan
    Villavicencio, Fernando
    Sridharan, Sridha
    Fookes, Clinton
    COMPUTER SPEECH AND LANGUAGE, 2019, 58 : 377 - 402
  • [30] Channel adaptation based on deep neural networks for speaker verification
    Long Y.
    Ni J.
    Ye H.
    2016, Sichuan University (48): : 151 - 155