Syllable based Hindi speech recognition

被引:7
|
作者
Bhatt, Shobha [1 ]
Jain, Anurag [1 ]
Dev, Amita [2 ]
机构
[1] Guru Gobind Singh Indraprastha Univ, Univ Sch Informat & Commun Technol, Sect 16 C, New Delhi 110078, India
[2] Indira Gandhi Delhi Tech Univ Women, Dept Informat Technol, New Delhi 110006, India
来源
关键词
Speech recognition; Syllable; Acoustic model; HMM; PLP; Hindi speech;
D O I
10.1080/02522667.2020.1809091
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
In this paper, one of the acoustic units of speech, the syllable, is used for the development of a continuous Hindi speech recognition system. The syllable is a larger acoustic unit that overcomes the contextual effects and requires fewer training samples in comparison to triphone based and word-based models. Other acoustic units such as phoneme-based suffer from contextual influences, and context-dependent triphones suffer due to the non-availability of triphone patterns with a large memory storage for numerous models. Earlier research works related to Hindi speech recognition were performed using the word, phoneme, and context-dependent models. The authors proposed a syllable based Hindi speech recognition system in this study due to different advantages of syllable units such as longer acoustic units, fast decoding, reducing contextual effects, and reduction of irregularities due to phonemes. The continuous Hindi speech recognition system was developed utilizing syllable based acoustic units. Hindi is widely spoken in India and other parts of the world also. The experiments are performed on Continuous Hindi speech by using a widely known Hidden Markov Model (HMM) with perceptual linear predictive coefficients(PLPs). The research outcomes reveal that by using syllables, the performance of the system was increased by 27% than phoneme and 20% than triphones. Research findings indicate that by selecting an appropriate acoustic unit for Hindi, the performance of the speech recognition system may be improved. Further, the study also provides useful insights to develop a syllable based pronunciation dictionary that may be used in speech recognition, speaker identification, and text to speech conversion systems.
引用
收藏
页码:1333 / 1351
页数:19
相关论文
共 50 条
  • [1] The syllable in Hindi
    Ohala, M
    [J]. SYLLABLE: VIEWS AND FACTS, 1999, 45 : 93 - 111
  • [2] Speech recognition and syllable segments
    Kopecek, I
    [J]. TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 203 - 208
  • [3] SYLLABLE AS A UNIT OF SPEECH RECOGNITION
    FUJIMURA, O
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1975, AS23 (01): : 82 - 87
  • [4] Syllable-Based Speech Recognition Using EMG
    Lopez-Larraz, Eduardo
    Mozos, Oscar M.
    Antelis, Javier M.
    Minguez, Javier
    [J]. 2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 4699 - 4702
  • [5] Syllable-based automatic Arabic speech recognition
    Azmi, Mohamed Mostafa
    Tolba, Hesham
    Mahdy, Sherif
    Fashal, Mervat
    [J]. PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, ROBOTICS AND AUTOMATION: ADVANCED TOPICS ON SIGNAL PROCESSING, ROBOTICS AND AUTOMATION, 2008, : 246 - +
  • [6] Confusion analysis in phoneme based speech recognition in Hindi
    Bhatt, Shobha
    Dev, Amita
    Jain, Anurag
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2020, 11 (10): : 4213 - 4238
  • [7] Confusion analysis in phoneme based speech recognition in Hindi
    Shobha Bhatt
    Amita Dev
    Anurag Jain
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 4213 - 4238
  • [8] Confusion analysis in phoneme based speech recognition in Hindi
    Bhatt, Shobha
    Dev, Amita
    Jain, Anurag
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (10) : 4213 - 4238
  • [9] Automatic Segmentation of Hindi Speech into Syllable-Like Units
    Kumari, Ruchika
    Dev, Amita
    Kumar, Ashwani
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 400 - 406
  • [10] Automatic segmentation of Hindi speech into syllable-like units
    Kumari, Ruchika
    Dev, Amita
    Kumar, Ashwani
    [J]. International Journal of Advanced Computer Science and Applications, 2020, 11 (05): : 400 - 406