Continuous Hindi Speech Recognition Model Based on Kaldi ASR Toolkit

被引:0
|
作者
Upadhyaya, Prashant [1 ]
Farooq, Omar [1 ]
Abidi, Musiur Raza [1 ]
Varshney, Yash Vardhan [1 ]
机构
[1] Aligarh Muslim Univ, Dept Elect Engn, Aligarh 202002, Uttar Pradesh, India
关键词
Kaldi ASR; Weight Finite State Transducers; MFCC; Speech recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, continuous Hindi speech recognition model using Kaldi toolkit is presented. For recognition, MFCC and PLP features are extracted from 1000 phonetically balanced Hindi sentence from AMUAV corpus. Acoustic modeling was performed using GMM-HMM and decoding is performed on so called HCLG which is construted from Weight Finite State Transducers (WFSTs). Performance of both monophone and triphone model using N-gram language model is reported which is computed in term of word error rate (WER). A significant reduction in word error rate (WER) was observed using the triphone model. Further, it was found that MFCC feature provide higher recognition accuracy than PLP feature. Goal is to show the performance of Hindi language using present state-of-the-art (Kaldi) system.
引用
收藏
页码:786 / 789
页数:4
相关论文
共 50 条
  • [21] Continuous Hindi Speech Recognition Using Gaussian Mixture HMM
    Kuamr, Ankit
    Dua, Mohit
    Choudhary, Tripti
    [J]. 2014 IEEE STUDENTS' CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER SCIENCE (SCEECS), 2014,
  • [22] Hindi phoneme-viseme recognition from continuous speech
    Mishra, A. N.
    Chandra, Mahesh
    Biswas, Astik
    Sharan, S. N.
    [J]. INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2013, 6 (03) : 164 - 171
  • [23] Automatic Speech Recognition of Bengali Using Kaldi
    Guchhait, Subhadeep
    Hans, Arnold Sachith A.
    Augustine, Jacob
    [J]. PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 153 - 166
  • [24] Confusion analysis in phoneme based speech recognition in Hindi
    Bhatt, Shobha
    Dev, Amita
    Jain, Anurag
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2020, 11 (10): : 4213 - 4238
  • [25] Confusion analysis in phoneme based speech recognition in Hindi
    Shobha Bhatt
    Amita Dev
    Anurag Jain
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 4213 - 4238
  • [26] Confusion analysis in phoneme based speech recognition in Hindi
    Bhatt, Shobha
    Dev, Amita
    Jain, Anurag
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (10) : 4213 - 4238
  • [27] Evaluation of two simultaneous continuous speech recognition with ICA BSS and MFT-Based ASR
    Takeda, Ryu
    Yamamoto, Shun'ichi
    Komatani, Kazunori
    Ogata, Tetsuya
    Okuno, Hiroshi G.
    [J]. NEW TRENDS IN APPLIED ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4570 : 384 - +
  • [28] A KALDI-BASED ASR SOLUTION FOR THE ROMANIAN JUDICIAL SYSTEM
    Zalhan, Paula-Georgiana
    Stan, Alexandru
    Teodorescu, Lucian-Radu
    Saupe, Andrei-Bogdan
    Duma, Melania
    [J]. INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY, IE 2016: EDUCATION, RESEARCH & BUSINESS TECHNOLOGIES, 2016, : 191 - 197
  • [29] Development and Comparison of ASR Models using Kaldi for Noisy and Enhanced Kannada Speech Data
    Yadava, Thimmaraja G.
    Jayanna, H. S.
    [J]. 2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 1832 - 1838
  • [30] DNN-Based Acoustic Modeling for Russian Speech Recognition Using Kaldi
    Kipyatkova, Irina
    Karpov, Alexey
    [J]. SPEECH AND COMPUTER, 2016, 9811 : 246 - 253