Continuous Hindi Speech Recognition Model Based on Kaldi ASR Toolkit

被引：0

作者：

Upadhyaya, Prashant ^{[1
]}

Farooq, Omar ^{[1
]}

Abidi, Musiur Raza ^{[1
]}

Varshney, Yash Vardhan ^{[1
]}

机构：

[1] Aligarh Muslim Univ, Dept Elect Engn, Aligarh 202002, Uttar Pradesh, India

来源：

2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET) | 2017年

关键词：

Kaldi ASR; Weight Finite State Transducers; MFCC; Speech recognition;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, continuous Hindi speech recognition model using Kaldi toolkit is presented. For recognition, MFCC and PLP features are extracted from 1000 phonetically balanced Hindi sentence from AMUAV corpus. Acoustic modeling was performed using GMM-HMM and decoding is performed on so called HCLG which is construted from Weight Finite State Transducers (WFSTs). Performance of both monophone and triphone model using N-gram language model is reported which is computed in term of word error rate (WER). A significant reduction in word error rate (WER) was observed using the triphone model. Further, it was found that MFCC feature provide higher recognition accuracy than PLP feature. Goal is to show the performance of Hindi language using present state-of-the-art (Kaldi) system.

引用

页码：786 / 789

页数：4

共 50 条

[21] Continuous Hindi Speech Recognition Using Gaussian Mixture HMM
Kuamr, Ankit
Dua, Mohit
Choudhary, Tripti
[J]. 2014 IEEE STUDENTS' CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER SCIENCE (SCEECS), 2014,
[22] Hindi phoneme-viseme recognition from continuous speech
Mishra, A. N.
Chandra, Mahesh
Biswas, Astik
Sharan, S. N.
[J]. INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2013, 6 (03) : 164 - 171
[23] Automatic Speech Recognition of Bengali Using Kaldi
Guchhait, Subhadeep
Hans, Arnold Sachith A.
Augustine, Jacob
[J]. PROCEEDINGS OF SECOND INTERNATIONAL CONFERENCE ON SUSTAINABLE EXPERT SYSTEMS (ICSES 2021), 2022, 351 : 153 - 166
[24] Confusion analysis in phoneme based speech recognition in Hindi
Bhatt, Shobha
Dev, Amita
Jain, Anurag
[J]. Journal of Ambient Intelligence and Humanized Computing, 2020, 11 (10): : 4213 - 4238
[25] Confusion analysis in phoneme based speech recognition in Hindi
Shobha Bhatt
Amita Dev
Anurag Jain
[J]. Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 4213 - 4238
[26] Confusion analysis in phoneme based speech recognition in Hindi
Bhatt, Shobha
Dev, Amita
Jain, Anurag
[J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (10) : 4213 - 4238
[27] Evaluation of two simultaneous continuous speech recognition with ICA BSS and MFT-Based ASR
Takeda, Ryu
Yamamoto, Shun'ichi
Komatani, Kazunori
Ogata, Tetsuya
Okuno, Hiroshi G.
[J]. NEW TRENDS IN APPLIED ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4570 : 384 - +
[28] A KALDI-BASED ASR SOLUTION FOR THE ROMANIAN JUDICIAL SYSTEM
Zalhan, Paula-Georgiana
Stan, Alexandru
Teodorescu, Lucian-Radu
Saupe, Andrei-Bogdan
Duma, Melania
[J]. INTERNATIONAL CONFERENCE ON INFORMATICS IN ECONOMY, IE 2016: EDUCATION, RESEARCH & BUSINESS TECHNOLOGIES, 2016, : 191 - 197
[29] Development and Comparison of ASR Models using Kaldi for Noisy and Enhanced Kannada Speech Data
Yadava, Thimmaraja G.
Jayanna, H. S.
[J]. 2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 1832 - 1838
[30] DNN-Based Acoustic Modeling for Russian Speech Recognition Using Kaldi
Kipyatkova, Irina
Karpov, Alexey
[J]. SPEECH AND COMPUTER, 2016, 9811 : 246 - 253

← 1 2 3 4 5 →