COMPARISON OF TEXT-INDEPENDENT SPEAKER RECOGNITION METHODS USING VECTOR-QUANTIZATION DISTORTION AND DISCRETE AND CONTINUOUS HMMS

被引:0
|
作者
MATSUI, T
FURUI, S
机构
[1] NTT Human Interface Laboratories, Musashino
关键词
SPEAKER RECOGNITION; TEXT-INDEPENDENT; VECTOR QUANTIZATION; ERGODIC HMM; UTTERANCE VARIATION;
D O I
10.1002/ecjc.4430771207
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The results of speaker recognition methods using vector quantization (VQ) distortion and discrete or continuous ergodic hidden Markov models (HMMs) are compared. The effectiveness of these methods is examined from the viewpoint of robustness against utterance variation such as differences in content, temporal variation, and changes in utterance speed. It is shown that the continuous HMM performs much better than the discrete HMM and its performance is close to that of the VQ distortion method. When the amount of training data is limited, however, the VQ distortion method achieves a better recognition rate than the continuous HMM. The transition information between the states is shown to contribute little to identifying the individual characteristics of a voice. An increase in the number of states or in the number of mixture components in a state both have an equal effect, and recognition performance is almost completely determined by the product of these two numbers.
引用
收藏
页码:63 / 70
页数:8
相关论文
共 50 条
  • [1] Comparison of text-independent speaker recognition methods using vector-quantization distortion and discrete and continuous HMMs
    Matsui, Tomoko
    Furui, Sadaoki
    Electronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi), 1994, 77 (12): : 63 - 69
  • [2] Comparison of Text-Independent Speaker Recognition Methods Using VQ-Distortion and Discrete/Continuous HMM's
    Matsui, Tomoko
    Furui, Sadaoki
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03): : 456 - 459
  • [3] Learning Vector Quantization in text-independent Automatic Speaker Recognition
    Filgueiras, TE
    Messina, RO
    Cabral, EF
    VTH BRAZILIAN SYMPOSIUM ON NEURAL NETWORKS, PROCEEDINGS, 1998, : 135 - 139
  • [4] On Text-independent Speaker Recognition via Improved Vector Quantization Method
    Liu Ting-ting
    Guan Sheng-xiao
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 3912 - 3916
  • [5] Text-independent speaker recognition using support vector machine
    Hou, FL
    Wang, BX
    2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : C402 - C407
  • [6] Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition System
    Omer, Ala Eldin
    2017 INTERNATIONAL CONFERENCE ON COMMUNICATION, CONTROL, COMPUTING AND ELECTRONICS ENGINEERING (ICCCCEE), 2017,
  • [7] Ensemble of Support Vector Machine for Text-Independent Speaker Recognition
    Lei, Zhenchun
    Yang, Yingchun
    Wu, Zhaohui
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (5A): : 163 - 167
  • [8] Comparison of text-independent speaker recognition methods on telephone speech with acoustic mismatch
    vanVuuren, S
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1788 - 1791
  • [10] Text-independent speaker recognition using graph matching
    Hautamaki, Ville
    Kinnunen, Tomi
    Franti, Pasi
    PATTERN RECOGNITION LETTERS, 2008, 29 (09) : 1427 - 1432