A MFCC based Hindi Speech Recognition Technique using HTK Toolkit

被引:0
|
作者
Tripathy, Shweta [1 ]
Baranwal, Neha [1 ]
Nandi, G. C. [1 ]
机构
[1] Indian Inst Informat Technol, Robot & AI Lab, Dewghat, Jhalwa Allahaba, India
关键词
MFCC; LPC; HMM; Speech recognition;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To utilize the robot's capabilities, it is necessary for us to communicate with them efficiently. Thus, Human Robot Interaction is attracting the attention of most of the researchers these days. In this paper a speech recognition system has been developed using different feature extraction techniques like MFCC (mel frequency cepestral coefficient), LPC (linear predictive coding) and HMM (hidden markov model) is used as the classifier. Less work has been done for Hindi language in this field with a vocabulary size not very large. So, work in this paper has been done for Hindi database, with a vocabulary size a bit extended. HMM has been implemented using HTK Toolkit. Afterwards the performances of both of the techniques used have been compared. The work has been done using audacity for sound recordings and Cygwin to execute the HTK commands in Linux type environment in windows platform. As well as, the system developed has been tested in the speaker dependent and speaker independent both types of environments, whose performance results, as well as, the comparison graph of the system shows that MFCC performs well as compared to LPC in each and every condition.
引用
收藏
页码:539 / 544
页数:6
相关论文
共 50 条
  • [1] Speech Recognition using HTK Toolkit for Marathi Language
    Chavan, Supriya S.
    Handore, S. M.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 1591 - 1597
  • [2] Automatic Speech Recognition of Isolated Words in Hindi Language using MFCC
    Patil, U. G.
    Shirbahadurkar, S. D.
    Paithane, A. N.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMPUTING, ANALYTICS AND SECURITY TRENDS (CAST), 2016, : 433 - 438
  • [3] Speaker Adaptive Model for Hindi Speech using Kaldi Speech Recognition toolkit
    Upadhyaya, Prashant
    Mittal, Sanjeev Kumar
    Varshney, Yash Vardhan
    Farooq, Omar
    Abidi, Musiur Raza
    [J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2017, : 222 - 226
  • [4] Spontaneous Speech Recognition for the Credit Card Corpus Using the HTK Toolkit
    Young, Stephen J.
    Woodland, Philip C.
    Byrne, William J.
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04): : 615 - 621
  • [5] Speaker Recognition for Hindi Speech Signal using MFCC-GMM Approach
    Maurya, Ankur
    Kumar, Divya
    Agarwal, R. K.
    [J]. 6TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS, 2018, 125 : 880 - 887
  • [6] Continuous Hindi Speech Recognition Model Based on Kaldi ASR Toolkit
    Upadhyaya, Prashant
    Farooq, Omar
    Abidi, Musiur Raza
    Varshney, Yash Vardhan
    [J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 786 - 789
  • [7] Speech Based Human Emotion Recognition Using MFCC
    Likitha, M. S.
    Gupta, Raksha R.
    Hasitha, K.
    Raju, A. Upendra
    [J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 2257 - 2260
  • [8] Hindi speech recognition in noisy environment using hybrid technique
    Kumar A.
    Mittal V.
    [J]. International Journal of Information Technology, 2021, 13 (2) : 483 - 492
  • [9] Speech Recognition using MFCC and DTW
    Mohan, Bhadragiri Jagan
    Babu, Ramesh N.
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2014,
  • [10] Speech Disorder Recognition using MFCC
    Jhawar, Gunjan
    Nagraj, Prajacta
    Mahalakshmi, P.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 246 - 250