Speaker verification using frame and utterance level likelihood normalization

被引:0
|
作者
Nakagawa, S
Markov, KP
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new method, where the likelihood normalization technique is applied at both the frame and utterance levels. In this method based on Gaussian Mixture Models (GMM), every frame of the test utterance is inputed to the claimed and all background speaker models in parallel. In this procedure, for each frame, likelihoods from all the background models are available, hence they can be used for normalization of the claimed speaker likelihood at every frame. A special kind of likelihood normalization, called Weighting Models Rank, is also proposed. We have evaluated our method using two databases - TIMIT and NTT. Results show that the combination of frame and utterance level likelihood normalization in some cases reduces the equal error rate (EER) more than twice.
引用
收藏
页码:1087 / 1090
页数:4
相关论文
共 50 条
  • [21] Speaker adaptive training: A maximum likelihood approach to speaker normalization
    Anastasakos, T
    McDonough, J
    Makhoul, J
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1043 - 1046
  • [22] Brief Review of Short Utterance Speaker Verification Systems
    Nirmal, Asmita
    Jayaswal, Deepak
    [J]. BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 419 - 426
  • [23] Compensating Utterance Information in Fixed Phrase Speaker Verification
    Das, Rohan Kumar
    Madhavi, Maulik
    Li, Haizhou
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1708 - 1712
  • [24] Improving short utterance i-vector speaker verification using utterance variance modelling and compensation techniques
    Kanagasundaram, A.
    Dean, D.
    Sridharan, S.
    Gonzalez-Dominguez, J.
    Gonzalez-Rodriguez, J.
    Ramos, D.
    [J]. SPEECH COMMUNICATION, 2014, 59 : 69 - 82
  • [25] Speaker verification using speaker- and test-dependent fast score normalization
    Ramos-Castro, Daniel
    Fierrez-Aguilar, Julian
    Gonzalez-Rodriguez, Joaquin
    Ortega-Garcia, Javier
    [J]. PATTERN RECOGNITION LETTERS, 2007, 28 (01) : 90 - 98
  • [26] Speaker Verification from Short Utterance Perspective: A Review
    Das, Rohan Kumar
    Prasanna, S. R. Mahadeva
    [J]. IETE TECHNICAL REVIEW, 2018, 35 (06) : 599 - 617
  • [27] Improving Short Utterance based I-vector Speaker Recognition using Source and Utterance-Duration Normalization Techniques
    Kanagasundaram, A.
    Dean, D.
    Gonzalez-Dominguez, J.
    Sridharan, S.
    Ramos, D.
    Gonzalez-Rodriguez, J.
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2464 - 2468
  • [28] Speaker verification using normalized log-likelihood score
    Liu, CS
    Wang, HC
    Lee, CH
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (01): : 56 - 60
  • [29] A proposed likelihood transformation for speaker verification
    Tran, D
    Wagner, M
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1069 - 1072
  • [30] A new cohort normalization using local acoustic information for speaker verification
    Isobe, T
    Takahashi, J
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 841 - 844