Frame level likelihood normalization for text-independent speaker identification using Gaussian Mixture Models

被引:0
|
作者
Markov, K
Nakagawa, S
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we propose a new speaker identification system, where the likelihood normalization technique, widely used for speaker verification, is introduced. In the new system, which is based on Gaussian Mixture Models, every frame of the test utterance is inputed to all the reference models in parallel. In this procedure, for each frame, likelihoods from all the models are available, hence they can be normalized at every frame. A special kind of likelihood normalization, called Weighting Models Rank, is also proposed. Experiments were performed using two databases - TIMIT and NTT. Evaluation results dearly show that frame level likelihood normalization technique is superior to the standard accumulated likelihood approach.
引用
收藏
页码:1764 / 1767
页数:4
相关论文
共 50 条
  • [21] Text-Independent Speaker Identification Using Vowel Formants
    Almaadeed, Noor
    Aggoun, Amar
    Amira, Abbes
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 82 (03): : 345 - 356
  • [22] A two-level classifier for text-independent speaker identification
    Hadjitodorov, S
    Boyanov, B
    Dalakchieva, N
    [J]. SPEECH COMMUNICATION, 1997, 21 (03) : 209 - 217
  • [23] Text-independent Speaker Identification in Birds
    Fox, E. J. S.
    Roberts, J. D.
    Bennamoun, M.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2122 - 2125
  • [24] DISTRIBUTED AUTOMATIC TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GMM-UBM SPEAKER MODELS
    Chowdhury, Md Foezur Rahman
    Selouani, Sid-Ahmed
    O'Shaughnessy, Douglas
    [J]. 2009 IEEE 22ND CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1 AND 2, 2009, : 1039 - +
  • [25] Super-Dirichlet Mixture Models using Differential Line Spectral Frequencies for Text-Independent Speaker Identification
    Ma, Zhanyu
    Leijon, Arne
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2360 - +
  • [26] Score normalization for text-independent speaker verification systems
    Auckenthaler, R
    Carey, M
    Lloyd-Thomas, H
    [J]. DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 42 - 54
  • [27] A New Score Normalization for Text-Independent Speaker Verification
    Ning, Hongke
    Zou, Y. X.
    Hu, Xuyan
    [J]. 2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 636 - 639
  • [28] Text-independent speaker identification using fenonic speaker Markov modeling
    Birnbaum, M
    Brown, KL
    Bardenhagen, S
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 677 - 680
  • [29] Speaker verification using frame and utterance level likelihood normalization
    Nakagawa, S
    Markov, KP
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1087 - 1090
  • [30] A novel text-independent speaker identification method based on common Gaussian bases
    Hao, Chen
    Zhao, Rongchun
    [J]. 2005 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2005, : 72 - 78