Speaker Recognition for Hindi Speech Signal using MFCC-GMM Approach

被引:32
|
作者
Maurya, Ankur [1 ]
Kumar, Divya [1 ]
Agarwal, R. K. [2 ]
机构
[1] Motilal Nehru Natl Inst Technol Allahabad, Allahabad 211004, Uttar Pradesh, India
[2] Natl Inst Technol Kurukshetra, Kurukshetra 136119, Haryana, India
关键词
Identification rate (IR); MFCC-GMM; MFCC-VQ; VERIFICATION; IDENTIFICATION;
D O I
10.1016/j.procs.2017.12.112
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Speaker recognition for different languages is still a big challenge for researchers. The accuracy of identification rate (IR) is great issue, if the utterance of speech sample is less. This paper aims to implement speaker recognition for Hindi speech samples using Mel frequency cepestral coffiecient vector quantization (MFCC-VQ) and Mel frequency cepestral cofficient-Gaussian mixture model (MFCC-GMM) for text dependent and text independent phrases. The accuracy of text independent recognition by MFCC-VQ and MFCC-GMM for Hindi speech sample is 77.64% and 86.27% respectively. However, the accuracy has increased significantly for text dependent recognition. The accuracy of Hindi speech samples are 85.49 % and 94.12 % using MFCC-VQ and MFCC-GMM approach. We have tested 15 speakers consisting 10 male and 5 female speakers. The total number of trails for each speaker is 17. (C) 2018 The Authors. Published by Elsevier B.V.
引用
收藏
页码:880 / 887
页数:8
相关论文
共 50 条
  • [31] Higher Accuracy of Hindi Speech Recognition Due to Online Speaker Adaptation
    Sivaraman, Ganesh
    Malta, Swapnil
    Nabar, Neeraj
    Samudravijaya, K.
    TECHNOLOGY SYSTEMS AND MANAGEMENT, 2011, 145 : 233 - +
  • [32] Throat Microphone Speech Recognition using MFCC
    Vijayan, Amritha
    Mathai, Bipil Mary
    Valsalan, Karthik
    Johnson, Riyanka Raji
    Mathew, Lani Rachel
    Gopakumar, K.
    2017 INTERNATIONAL CONFERENCE ON NETWORKS & ADVANCES IN COMPUTATIONAL TECHNOLOGIES (NETACT), 2017, : 392 - 395
  • [33] Emotion Recognition in Speech Using MFCC and Classifiers
    Ajitha, G.
    Prashanth, Addagatla
    Radhika, Chelle
    Chaitanya, Kancharapu
    COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING ( ICCVBIC 2021), 2022, 1420 : 197 - 207
  • [34] AUTOMATIC EMOTION RECOGNITION IN SPEECH SIGNAL USING TEAGER ENERGY OPERATOR AND MFCC FEATURES
    He, Ling
    Lech, Margaret
    Allen, Nicholas
    2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 3, 2012, : 695 - 699
  • [35] Speaker recognition system using MFCC features and vector quantization
    Wang, Wei
    Deng, Huiwen
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2006, 27 (SUPPL.): : 2253 - 2255
  • [36] Discriminative speaker recognition using large margin GMM
    Jourani, Reda
    Daoudi, Khalid
    Andre-Obrecht, Regine
    Aboutajdine, Driss
    NEURAL COMPUTING & APPLICATIONS, 2013, 22 (7-8): : 1329 - 1336
  • [37] TEXT INDEPENDENT SPEAKER RECOGNITION SYSTEM USING GMM
    Bagul, S. G.
    Shastri, R. K.
    2013 INTERNATIONAL CONFERENCE ON HUMAN COMPUTER INTERACTIONS (ICHCI), 2013,
  • [38] Discriminative speaker recognition using large margin GMM
    Reda Jourani
    Khalid Daoudi
    Régine André-Obrecht
    Driss Aboutajdine
    Neural Computing and Applications, 2013, 22 : 1329 - 1336
  • [39] User Identification System Using Biometrics Speaker Recognition by MFCC and DTW along with signal processing package
    Muttaqi, Tazwar
    Mousavinezhad, S. Hossein
    Mahamud, Shaikh
    2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2018, : 79 - 83
  • [40] Text dependant speaker recognition using MFCC, LPC and DWT
    Chelali F.Z.
    Djeradi A.
    International Journal of Speech Technology, 2017, 20 (03) : 725 - 740