Frame level likelihood normalization for text-independent speaker identification using Gaussian Mixture Models

被引：0

作者：

Markov, K

Nakagawa, S

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper we propose a new speaker identification system, where the likelihood normalization technique, widely used for speaker verification, is introduced. In the new system, which is based on Gaussian Mixture Models, every frame of the test utterance is inputed to all the reference models in parallel. In this procedure, for each frame, likelihoods from all the models are available, hence they can be normalized at every frame. A special kind of likelihood normalization, called Weighting Models Rank, is also proposed. Experiments were performed using two databases - TIMIT and NTT. Evaluation results dearly show that frame level likelihood normalization technique is superior to the standard accumulated likelihood approach.

引用

页码：1764 / 1767

页数：4

共 50 条

[21] Text-Independent Speaker Identification Using Vowel Formants
Almaadeed, Noor
Aggoun, Amar
Amira, Abbes
[J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 82 (03): : 345 - 356
[22] A two-level classifier for text-independent speaker identification
Hadjitodorov, S
Boyanov, B
Dalakchieva, N
[J]. SPEECH COMMUNICATION, 1997, 21 (03) : 209 - 217
[23] Text-independent Speaker Identification in Birds
Fox, E. J. S.
Roberts, J. D.
Bennamoun, M.
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2122 - 2125
[24] DISTRIBUTED AUTOMATIC TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GMM-UBM SPEAKER MODELS
Chowdhury, Md Foezur Rahman
Selouani, Sid-Ahmed
O'Shaughnessy, Douglas
[J]. 2009 IEEE 22ND CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1 AND 2, 2009, : 1039 - +
[25] Super-Dirichlet Mixture Models using Differential Line Spectral Frequencies for Text-Independent Speaker Identification
Ma, Zhanyu
Leijon, Arne
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2360 - +
[26] Score normalization for text-independent speaker verification systems
Auckenthaler, R
Carey, M
Lloyd-Thomas, H
[J]. DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 42 - 54
[27] A New Score Normalization for Text-Independent Speaker Verification
Ning, Hongke
Zou, Y. X.
Hu, Xuyan
[J]. 2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 636 - 639
[28] Text-independent speaker identification using fenonic speaker Markov modeling
Birnbaum, M
Brown, KL
Bardenhagen, S
[J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 677 - 680
[29] Speaker verification using frame and utterance level likelihood normalization
Nakagawa, S
Markov, KP
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1087 - 1090
[30] A novel text-independent speaker identification method based on common Gaussian bases
Hao, Chen
Zhao, Rongchun
[J]. 2005 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2005, : 72 - 78

← 1 2 3 4 5 →