Speaker identification using hybrid Karhunen-Loeve transform and Gaussian mixture model approach

被引:3
|
作者
Chen, CCT [1 ]
Chen, CT [1 ]
Hou, CK [1 ]
机构
[1] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung 804, Taiwan
关键词
Karhunen-Loeve transform; Bhattacharyya distance; Gaussian mixture models; speaker identification; Mel frequency cepstral coefficients;
D O I
10.1016/j.patcog.2003.08.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a classification scheme that incorporates Karhunen-Loeve transform (KLT) and Gaussian mixture model (GMM) for text-independent speaker identification. Our results show that the combination is beneficial to both classification accuracy and computational cost. For a database with 500 Mandarin speakers, it is demonstrated that accuracy improvement of up to 4% and computational cost saving of 10 times compared to those of the conventional GMM model can be achieved. (C) 2003 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1073 / 1075
页数:3
相关论文
共 50 条
  • [41] Modification of Karhunen-Loeve transform for pattern recognition
    Barat, P
    Roy, A
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1998, 23 (4): : 341 - 350
  • [42] SOME ASPECTS OF THE FAST KARHUNEN-LOEVE TRANSFORM
    KITAJIMA, H
    SHIMONO, T
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1980, 28 (09) : 1773 - 1776
  • [43] Model order selection for the singular value decomposition and the discrete Karhunen-Loeve transform using a Bayesian approach
    Rajan, JJ
    Rayner, PJW
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1997, 144 (02): : 116 - 123
  • [44] Nonlinear System Identification: An Effective Framework Based on the Karhunen-Loeve Transform
    Turchetti, Claudio
    Biagetti, Giorgio
    Gianfelici, Francesco
    Crippa, Paolo
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (02) : 536 - 550
  • [45] NEURAL MODEL FOR KARHUNEN-LOEVE TRANSFORM WITH APPLICATION TO ADAPTIVE IMAGE COMPRESSION
    ABBAS, HM
    FAHMY, MM
    IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1993, 140 (02): : 135 - 143
  • [46] KARHUNEN-LOEVE DECOMPOSITION OF GAUSSIAN MEASURES ON BANACH SPACES
    Bay, Xavier
    Croix, Jean-Charles
    PROBABILITY AND MATHEMATICAL STATISTICS-POLAND, 2019, 39 (02): : 279 - 297
  • [47] Improving Karhunen-Loeve based transform coding by using square isometries
    Breazu, M
    Volovici, D
    Mihu, IZ
    Brad, R
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 1881 - 1885
  • [48] Monitoring of signals from manufacturing processes using the Karhunen-Loeve transform
    Tumer, IY
    Wood, KL
    Busch-Vishniac, IJ
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2000, 14 (06) : 1011 - 1026
  • [49] Uncovering correlated variability in epigenomic datasets using the Karhunen-Loeve transform
    Madrigal, Pedro
    Krajewski, Pawel
    BIODATA MINING, 2015, 8
  • [50] Multispectral data restoration by the wavelet Karhunen-Loeve transform
    Starck, JL
    Querre, P
    SIGNAL PROCESSING, 2001, 81 (12) : 2449 - 2459