An experimental comparison of modelling techniques for speaker recognition under limited data condition

被引:13
|
作者
Jayanna, H. S. [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Commun Engn, Gauhati 781039, Assam, India
关键词
Speaker recognition; limited data; CVQ; FVQ; SOM; LVQ; GMM; GMM-UBM; IDENTIFICATION; SPEECH;
D O I
10.1007/s12046-009-0042-9
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Most of the existing modelling techniques for the speaker recognition task make an implicit assumption of sufficient data for speaker modelling and hence may lead to poor modelling under limited data condition. The present work gives an experimental evaluation of the modelling techniques like Crisp Vector Quantization (CVQ), Fuzzy Vector Quantization (FVQ), Self-Organizing Map (SOM), Learning Vector Quantization (LVQ), and Gaussian Mixture Model (GMM) classifiers. An experimental evaluation of the most widely used Gaussian Mixture Model-Universal Background Model (GMM-UBM) is also made. The experimental knowledge is then used to select a subset of classifiers for obtaining the combined classifiers. It is proposed that the combined LVQ and GMM-UBM classifier provides relatively better performance compared to all the individual as well as combined classifiers.
引用
收藏
页码:717 / 728
页数:12
相关论文
共 50 条
  • [1] An experimental comparison of modelling techniques for speaker recognition under limited data condition
    H. S. Jayanna
    S. R. Mahadeva Prasanna
    [J]. Sadhana, 2009, 34 : 717 - 728
  • [2] Speaker recognition under limited data condition by noise addition
    Krishnamoorthy, P.
    Jayanna, H. S.
    Prasanna, S. R. M.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (10) : 13487 - 13490
  • [3] Multiple frame size and rate analysis for speaker recognition under limited data condition
    Jayanna, H. S.
    Prasanna, S. R. Mahadeva
    [J]. IET SIGNAL PROCESSING, 2009, 3 (03) : 189 - 204
  • [4] Comparison of Generative and Discriminative Approaches for Speaker Recognition with Limited Data
    Silovsky, Jan
    Cerva, Petr
    Zdansky, Jindrich
    [J]. RADIOENGINEERING, 2009, 18 (03) : 307 - 316
  • [5] Automatic Speaker Recognition with Limited Data
    Li, Ruirui
    Jiang, Jyun-Yu
    Liu, Jiahao
    Hsieh, Chu-Cheng
    Wang, Wei
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 340 - 348
  • [6] Speaker recognition under stressed condition
    Senthil Raja G.
    Dandapat S.
    [J]. International Journal of Speech Technology, 2010, 13 (03) : 141 - 161
  • [7] Fuzzy vector quantization for speaker recognition under limited data conditions
    Jayanna, H. S.
    Prasanna, S. R. Mahadeva
    [J]. 2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 124 - 127
  • [8] Combination of System and Source Characteristics for Speaker Verification Under Limited Data Condition
    Kumari, T. R. Jayanthi
    Jayanna, H. S.
    [J]. 2016 IEEE 12TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA), 2016, : 157 - 161
  • [9] Comparison of Various Techniques for Speaker Recognition
    Kumar, Ajay
    Singh, Ravindra
    Kavita
    Sehgal, Shravan Kumar
    [J]. PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 938 - 942
  • [10] Training speaker recognition systems with limited data
    Vaessen, Nik
    van Leeuwen, David A.
    [J]. INTERSPEECH 2022, 2022, : 4760 - 4764