Speaker verification using frame and utterance level likelihood normalization

被引:0
|
作者
Nakagawa, S
Markov, KP
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new method, where the likelihood normalization technique is applied at both the frame and utterance levels. In this method based on Gaussian Mixture Models (GMM), every frame of the test utterance is inputed to the claimed and all background speaker models in parallel. In this procedure, for each frame, likelihoods from all the background models are available, hence they can be used for normalization of the claimed speaker likelihood at every frame. A special kind of likelihood normalization, called Weighting Models Rank, is also proposed. We have evaluated our method using two databases - TIMIT and NTT. Results show that the combination of frame and utterance level likelihood normalization in some cases reduces the equal error rate (EER) more than twice.
引用
收藏
页码:1087 / 1090
页数:4
相关论文
共 50 条
  • [31] Few-shot short utterance speaker verification using meta-learning
    Wang, Weijie
    Zhao, Hong
    Yang, Yikun
    Chang, YouKang
    You, Haojie
    [J]. PEERJ COMPUTER SCIENCE, 2023, 9
  • [32] Few-shot short utterance speaker verification using meta-learning
    Wang, Weijie
    Zhao, Hong
    Yang, Yikun
    Chang, YouKang
    You, Haojie
    [J]. PeerJ Computer Science, 2023, 9
  • [33] Similarity normalization for speaker verification by fuzzy fusion
    Pham, T
    Wagner, M
    [J]. PATTERN RECOGNITION, 2000, 33 (02) : 309 - 315
  • [34] FEATURE NORMALIZATION FOR SPEAKER VERIFICATION IN ROOM REVERBERATION
    Ganapathy, Sriram
    Pelecanos, Jason
    Omar, Mohamed Kamal
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4836 - 4839
  • [35] UTTERANCE-LEVEL AGGREGATION FOR SPEAKER RECOGNITION IN THE WILD
    Xie, Weidi
    Nagrani, Arsha
    Chung, Joon Son
    Zisserman, Andrew
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5791 - 5795
  • [36] Minimising Speaker Verification Utterance Length through Confidence Based Early Verification Decisions
    Vogt, Robbie
    Sridharan, Sridha
    [J]. ADVANCES IN BIOMETRICS, 2009, 5558 : 454 - 463
  • [37] Utterance Verification for Text-Dependent Speaker Recognition: a Comparative Assessment Using the RedDots Corpus
    Kinnunen, Tomi
    Sahidullah, Md
    Kukanov, Ivan
    Delgado, Hector
    Todisco, Massimiliano
    Sarkar, Achintya
    Thomsen, Nicolai Baek
    Hautamaki, Ville
    Evans, Nicholas
    Tana, Zheng-Hua
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 430 - 434
  • [38] Speaker-discriminative Embedding Learning via Affinity Matrix for Short Utterance Speaker Verification
    Peng, Junyi
    Gu, Rongzhi
    Zou, Yuexian
    Wangt, Wenwu
    [J]. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 314 - 319
  • [39] An Adaptive i-Vector Extraction for Speaker Verification with Short Utterance
    Poddar, Arnab
    Sahidullah, Md
    Saha, Goutam
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 326 - 332
  • [40] Speaker verification using mixture likelihood profiles extracted from speaker independent Hidden Markov Models
    Setlur, AR
    Sukkar, RA
    Gandhi, MB
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 109 - 112