Speaker Recognition for Hindi Speech Signal using MFCC-GMM Approach

被引：32

作者：

Maurya, Ankur ^{[1
]}

Kumar, Divya ^{[1
]}

Agarwal, R. K. ^{[2
]}

机构：

[1] Motilal Nehru Natl Inst Technol Allahabad, Allahabad 211004, Uttar Pradesh, India

[2] Natl Inst Technol Kurukshetra, Kurukshetra 136119, Haryana, India

来源：

6TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS | 2018年 / 125卷

关键词：

Identification rate (IR); MFCC-GMM; MFCC-VQ; VERIFICATION; IDENTIFICATION;

D O I：

10.1016/j.procs.2017.12.112

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Speaker recognition for different languages is still a big challenge for researchers. The accuracy of identification rate (IR) is great issue, if the utterance of speech sample is less. This paper aims to implement speaker recognition for Hindi speech samples using Mel frequency cepestral coffiecient vector quantization (MFCC-VQ) and Mel frequency cepestral cofficient-Gaussian mixture model (MFCC-GMM) for text dependent and text independent phrases. The accuracy of text independent recognition by MFCC-VQ and MFCC-GMM for Hindi speech sample is 77.64% and 86.27% respectively. However, the accuracy has increased significantly for text dependent recognition. The accuracy of Hindi speech samples are 85.49 % and 94.12 % using MFCC-VQ and MFCC-GMM approach. We have tested 15 speakers consisting 10 male and 5 female speakers. The total number of trails for each speaker is 17. (C) 2018 The Authors. Published by Elsevier B.V.

引用

页码：880 / 887

页数：8

共 50 条

[31] Higher Accuracy of Hindi Speech Recognition Due to Online Speaker Adaptation
Sivaraman, Ganesh
Malta, Swapnil
Nabar, Neeraj
Samudravijaya, K.
TECHNOLOGY SYSTEMS AND MANAGEMENT, 2011, 145 : 233 - +
[32] Throat Microphone Speech Recognition using MFCC
Vijayan, Amritha
Mathai, Bipil Mary
Valsalan, Karthik
Johnson, Riyanka Raji
Mathew, Lani Rachel
Gopakumar, K.
2017 INTERNATIONAL CONFERENCE ON NETWORKS & ADVANCES IN COMPUTATIONAL TECHNOLOGIES (NETACT), 2017, : 392 - 395
[33] Emotion Recognition in Speech Using MFCC and Classifiers
Ajitha, G.
Prashanth, Addagatla
Radhika, Chelle
Chaitanya, Kancharapu
COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING ( ICCVBIC 2021), 2022, 1420 : 197 - 207
[34] AUTOMATIC EMOTION RECOGNITION IN SPEECH SIGNAL USING TEAGER ENERGY OPERATOR AND MFCC FEATURES
He, Ling
Lech, Margaret
Allen, Nicholas
2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 3, 2012, : 695 - 699
[35] Speaker recognition system using MFCC features and vector quantization
Wang, Wei
Deng, Huiwen
Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2006, 27 (SUPPL.): : 2253 - 2255
[36] Discriminative speaker recognition using large margin GMM
Jourani, Reda
Daoudi, Khalid
Andre-Obrecht, Regine
Aboutajdine, Driss
NEURAL COMPUTING & APPLICATIONS, 2013, 22 (7-8): : 1329 - 1336
[37] TEXT INDEPENDENT SPEAKER RECOGNITION SYSTEM USING GMM
Bagul, S. G.
Shastri, R. K.
2013 INTERNATIONAL CONFERENCE ON HUMAN COMPUTER INTERACTIONS (ICHCI), 2013,
[38] Discriminative speaker recognition using large margin GMM
Reda Jourani
Khalid Daoudi
Régine André-Obrecht
Driss Aboutajdine
Neural Computing and Applications, 2013, 22 : 1329 - 1336
[39] User Identification System Using Biometrics Speaker Recognition by MFCC and DTW along with signal processing package
Muttaqi, Tazwar
Mousavinezhad, S. Hossein
Mahamud, Shaikh
2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2018, : 79 - 83
[40] Text dependant speaker recognition using MFCC, LPC and DWT
Chelali F.Z.
Djeradi A.
International Journal of Speech Technology, 2017, 20 (03) : 725 - 740

← 1 2 3 4 5 →