Text-Independent Speaker Recognition in Clean and Noisy Backgrounds Using Modified VQ-LBG Algorithm

被引:5
|
作者
Mallikarjunan, M. [1 ]
Radha, P. Karmali [1 ]
Bharath, K. P. [1 ]
Muthu, Rajesh Kumar [1 ]
机构
[1] Vellore Inst Technol, Sch Elect Engn, Vellore 632014, Tamil Nadu, India
关键词
MFCC; Modified VQ-LBG; Feature extraction; GMM-UBM;
D O I
10.1007/s00034-018-0992-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speaker recognition is the process of identifying the proper speaker by analyzing the spectral shape of the speech signal. This process is done by extracting the desired features and matching the features of the speech signal. In this paper, we adopted the Mel frequency cepstrum coefficient (MFCC) technique for extracting the features from the speaker speech sample. These cepstrum coefficients are named as extracted features. The extracted MFCC features are given as input to the modified vector quantization via Linde-Buzo-Gray (modified VQ-LBG) process and expectation maximization (EM) algorithm. Vector quantization technique is mainly used for feature matching where a separate codebook will be generated for each speaker. The EM algorithm is utilized to develop the Gaussian mixture model-universal background model (GMM-UBM). In GMM-UBM model, k means cluster is summed up to consolidate data about the covariance structure of the information and the focuses of the inert Gaussians. From our analysis, the modified VQ-LBG algorithm gives better performance compared to the GMM-UBM model.
引用
收藏
页码:2810 / 2828
页数:19
相关论文
共 50 条
  • [1] Text-Independent Speaker Recognition in Clean and Noisy Backgrounds Using Modified VQ-LBG Algorithm
    M. Mallikarjunan
    P. Karmali Radha
    K. P. Bharath
    Rajesh Kumar Muthu
    [J]. Circuits, Systems, and Signal Processing, 2019, 38 : 2810 - 2828
  • [2] Chinese Text Speech Recognition Derived from VQ-LBG Algorithm
    Wang, Xiaojun
    Lai, Weidong
    [J]. AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, 2012, 137 : 303 - 311
  • [3] VQ score normalisation for text-dependent and text-independent speaker recognition
    Finan, RA
    Sapeluk, AT
    Damper, RI
    [J]. AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 211 - 218
  • [4] TEXT-INDEPENDENT SPEAKER RECOGNITION
    ATAL, BS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 52 (01): : 181 - &
  • [5] Text-independent speaker identification in noisy background
    Zhou, Y
    Xu, BL
    [J]. PROGRESS IN NATURAL SCIENCE, 2001, 11 : S384 - S387
  • [7] Adaptive fuzzy wavelet algorithm for text-independent speaker recognition
    Lung, SY
    [J]. PATTERN RECOGNITION, 2004, 37 (10) : 2095 - 2096
  • [8] Text-independent speaker recognition using graph matching
    Hautamaki, Ville
    Kinnunen, Tomi
    Franti, Pasi
    [J]. PATTERN RECOGNITION LETTERS, 2008, 29 (09) : 1427 - 1432
  • [9] Effect of Spoken Text on Text-independent Speaker Recognition
    Alsulaiman, Mansour
    [J]. PROCEEDINGS FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2014, : 279 - 284
  • [10] Comparison of Text-Independent Speaker Recognition Methods Using VQ-Distortion and Discrete/Continuous HMM's
    Matsui, Tomoko
    Furui, Sadaoki
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03): : 456 - 459