COMPARISON OF TEXT-INDEPENDENT SPEAKER RECOGNITION METHODS USING VECTOR-QUANTIZATION DISTORTION AND DISCRETE AND CONTINUOUS HMMS

被引:0
|
作者
MATSUI, T
FURUI, S
机构
[1] NTT Human Interface Laboratories, Musashino
关键词
SPEAKER RECOGNITION; TEXT-INDEPENDENT; VECTOR QUANTIZATION; ERGODIC HMM; UTTERANCE VARIATION;
D O I
10.1002/ecjc.4430771207
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The results of speaker recognition methods using vector quantization (VQ) distortion and discrete or continuous ergodic hidden Markov models (HMMs) are compared. The effectiveness of these methods is examined from the viewpoint of robustness against utterance variation such as differences in content, temporal variation, and changes in utterance speed. It is shown that the continuous HMM performs much better than the discrete HMM and its performance is close to that of the VQ distortion method. When the amount of training data is limited, however, the VQ distortion method achieves a better recognition rate than the continuous HMM. The transition information between the states is shown to contribute little to identifying the individual characteristics of a voice. An increase in the number of states or in the number of mixture components in a state both have an equal effect, and recognition performance is almost completely determined by the product of these two numbers.
引用
收藏
页码:63 / 70
页数:8
相关论文
共 50 条
  • [31] Robust Text-independent Speaker recognition with Short Utterances using Gaussian Mixture Models
    Chakroun, Rania
    Frikha, Mondher
    2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 2204 - 2209
  • [32] A Study on Text-Independent Speaker Recognition Systems in Emotional Conditions Using Different Pattern Recognition Models
    Alluri, K. N. R. K. Raju
    Achanta, Sivanand
    Prasath, Rajendra
    Gangashetty, Suryakanth V.
    Vuppala, Anil Kumar
    MINING INTELLIGENCE AND KNOWLEDGE EXPLORATION (MIKE 2016), 2017, 10089 : 66 - 73
  • [33] Text-independent speaker identification system using discrete wavelet transform with linear prediction coding
    Othman Alrusaini
    Khaled Daqrouq
    Journal of Umm Al-Qura University for Engineering and Architecture, 2024, 15 (2): : 112 - 119
  • [34] Comparison of Text Independent Speaker Identification Systems using GMM and i-Vector Methods
    Nayana, P. A.
    Mathew, Dominic
    Thomas, Abraham
    7TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2017), 2017, 115 : 47 - 54
  • [35] Wavelet feature domain adaptive noise reduction using learning algorithm for text-independent speaker recognition
    Lung, Shung-Yung
    PATTERN RECOGNITION, 2007, 40 (09) : 2603 - 2606
  • [36] Feature extracted from wavelet decomposition using biorthogonal Riesz basis for text-independent speaker recognition
    Lung, Shung-Yung
    PATTERN RECOGNITION, 2008, 41 (10) : 3068 - 3070
  • [37] Text-Independent Speaker Recognition in Clean and Noisy Backgrounds Using Modified VQ-LBG Algorithm
    M. Mallikarjunan
    P. Karmali Radha
    K. P. Bharath
    Rajesh Kumar Muthu
    Circuits, Systems, and Signal Processing, 2019, 38 : 2810 - 2828
  • [38] Text-Independent Speaker Recognition in Clean and Noisy Backgrounds Using Modified VQ-LBG Algorithm
    Mallikarjunan, M.
    Radha, P. Karmali
    Bharath, K. P.
    Muthu, Rajesh Kumar
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (06) : 2810 - 2828
  • [39] Text-independent speaker identification using Radon and discrete cosine transforms based features from speech spectrogram
    Ajmera, Pawan K.
    Jadhav, Dattatray V.
    Holambe, Raghunath S.
    PATTERN RECOGNITION, 2011, 44 (10-11) : 2749 - 2759
  • [40] Recognition of Speaker-Independent Isolated Persian Digits Using an Enhanced Vector Quantization Algorithm
    Jamali, Mobin
    Ghafarinia, Vahid
    Montazeri, Mohammad Ali
    2015 SIGNAL PROCESSING AND INTELLIGENT SYSTEMS CONFERENCE (SPIS), 2015, : 164 - 168