Continuous Hindi Speech Recognition Model Based on Kaldi ASR Toolkit

被引:0
|
作者
Upadhyaya, Prashant [1 ]
Farooq, Omar [1 ]
Abidi, Musiur Raza [1 ]
Varshney, Yash Vardhan [1 ]
机构
[1] Aligarh Muslim Univ, Dept Elect Engn, Aligarh 202002, Uttar Pradesh, India
关键词
Kaldi ASR; Weight Finite State Transducers; MFCC; Speech recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, continuous Hindi speech recognition model using Kaldi toolkit is presented. For recognition, MFCC and PLP features are extracted from 1000 phonetically balanced Hindi sentence from AMUAV corpus. Acoustic modeling was performed using GMM-HMM and decoding is performed on so called HCLG which is construted from Weight Finite State Transducers (WFSTs). Performance of both monophone and triphone model using N-gram language model is reported which is computed in term of word error rate (WER). A significant reduction in word error rate (WER) was observed using the triphone model. Further, it was found that MFCC feature provide higher recognition accuracy than PLP feature. Goal is to show the performance of Hindi language using present state-of-the-art (Kaldi) system.
引用
收藏
页码:786 / 789
页数:4
相关论文
共 50 条
  • [1] Continuous Punjabi speech recognition model based on Kaldi ASR toolkit
    Guglani, Jyoti
    Mishra, A. N.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2018, 21 (02) : 211 - 216
  • [2] Amazigh speech recognition based on the Kaldi ASR toolkit
    Barkani F.
    Hamidi M.
    Laaidi N.
    Zealouk O.
    Satori H.
    Satori K.
    [J]. International Journal of Information Technology, 2023, 15 (7) : 3533 - 3540
  • [3] Continuous Hindi Speech Recognition Using Kaldi ASR Based on Deep Neural Network
    Upadhyaya, Prashant
    Mittal, Sanjeev Kumar
    Farooq, Omar
    Varshney, Yash Vardhan
    Abidi, Musiur Raza
    [J]. MACHINE INTELLIGENCE AND SIGNAL ANALYSIS, 2019, 748 : 303 - 311
  • [4] Speaker Adaptive Model for Hindi Speech using Kaldi Speech Recognition toolkit
    Upadhyaya, Prashant
    Mittal, Sanjeev Kumar
    Varshney, Yash Vardhan
    Farooq, Omar
    Abidi, Musiur Raza
    [J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2017, : 222 - 226
  • [5] DNN based continuous speech recognition system of Punjabi language on Kaldi toolkit
    Guglani, Jyoti
    Mishra, A. N.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (01) : 41 - 45
  • [6] DNN based continuous speech recognition system of Punjabi language on Kaldi toolkit
    Jyoti Guglani
    A. N. Mishra
    [J]. International Journal of Speech Technology, 2021, 24 : 41 - 45
  • [7] Deep Neural Network Based Continuous Speech Recognition for Serbian Using the Kaldi Toolkit
    Popovic, Branislav
    Ostrogonac, Stevan
    Pakoci, Edvin
    Jakovljevic, Niksa
    Delic, Vlado
    [J]. SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 186 - 192
  • [8] THE PYTORCH-KALDI SPEECH RECOGNITION TOOLKIT
    Ravanelli, Mirco
    Parcollet, Titouan
    Bengio, Yoshua
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6465 - 6469
  • [9] How to Add Word Classes to the Kaldi Speech Recognition Toolkit
    Horndasch, Axel
    Kaufhold, Caroline
    Noeth, Elmar
    [J]. TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 486 - 494
  • [10] Performance analysis of ASR Model for Santhali language on Kaldi and Matlab Toolkit
    Kumar, Arvind
    Kumar, Rampravesh
    Kishore, Kamlesh
    [J]. 2020 5TH IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS ON ELECTRONICS, INFORMATION, COMMUNICATION & TECHNOLOGY (RTEICT-2020), 2020, : 88 - 92