Speaker identification using hybrid Karhunen-Loeve transform and Gaussian mixture model approach

被引:3
|
作者
Chen, CCT [1 ]
Chen, CT [1 ]
Hou, CK [1 ]
机构
[1] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung 804, Taiwan
关键词
Karhunen-Loeve transform; Bhattacharyya distance; Gaussian mixture models; speaker identification; Mel frequency cepstral coefficients;
D O I
10.1016/j.patcog.2003.08.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a classification scheme that incorporates Karhunen-Loeve transform (KLT) and Gaussian mixture model (GMM) for text-independent speaker identification. Our results show that the combination is beneficial to both classification accuracy and computational cost. For a database with 500 Mandarin speakers, it is demonstrated that accuracy improvement of up to 4% and computational cost saving of 10 times compared to those of the conventional GMM model can be achieved. (C) 2003 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1073 / 1075
页数:3
相关论文
共 50 条
  • [1] Application of wavelet Karhunen-Loeve transform to text independent speaker identification
    Lung, SY
    [J]. Proceedings of the Sixth IASTED International Conference on Signal and Image Processing, 2004, : 157 - 158
  • [2] A GREEDY APPROACH TO THE DISTRIBUTED KARHUNEN-LOEVE TRANSFORM
    Amar, Alon
    Leshem, Amir
    Gastpar, Michael
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 2970 - 2973
  • [3] The distributed Karhunen-Loeve transform
    Gastpar, Michael
    Dragotti, Pier Luigi
    Vetterli, Martin
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (12) : 5177 - 5196
  • [4] Generalized Karhunen-Loeve transform
    Hua, YB
    Liu, WQ
    [J]. IEEE SIGNAL PROCESSING LETTERS, 1998, 5 (06) : 141 - 142
  • [5] The distributed Karhunen-Loeve transform
    Gastpar, M
    Dragotti, PL
    Vetterli, M
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 57 - 60
  • [6] Extended Karhunen-Loeve Transform
    Soto-Quiros, Pablo
    Torokhti, Anatoli
    [J]. 2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 484 - 487
  • [7] Relative Karhunen-Loeve transform
    Yamashita, Y
    Ogawa, H
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1996, 44 (02) : 371 - 378
  • [8] Video Steganography Using Karhunen-Loeve Transform
    Roy, Subhajit
    Mukherjee, Srilekha
    Sanyal, Goutam
    [J]. 2018 2ND INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (ICDSP 2018), 2018, : 142 - 146
  • [9] Digital watermarking using Karhunen-Loeve transform
    Stanescu, Daniela
    Stratulat, Mircea
    Ciubotaru, Bogdan
    Chiciudean, Dan
    Cioarga, Razvan
    Borca, Daniel
    [J]. SACI 2007: 4TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS, PROCEEDINGS, 2007, : 187 - +
  • [10] On compression using the distributed Karhunen-Loeve transform
    Gastpar, M
    Dragotti, PL
    Vetterli, M
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 901 - 904