Syllable based Hindi speech recognition

被引:7
|
作者
Bhatt, Shobha [1 ]
Jain, Anurag [1 ]
Dev, Amita [2 ]
机构
[1] Guru Gobind Singh Indraprastha Univ, Univ Sch Informat & Commun Technol, Sect 16 C, New Delhi 110078, India
[2] Indira Gandhi Delhi Tech Univ Women, Dept Informat Technol, New Delhi 110006, India
来源
关键词
Speech recognition; Syllable; Acoustic model; HMM; PLP; Hindi speech;
D O I
10.1080/02522667.2020.1809091
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
In this paper, one of the acoustic units of speech, the syllable, is used for the development of a continuous Hindi speech recognition system. The syllable is a larger acoustic unit that overcomes the contextual effects and requires fewer training samples in comparison to triphone based and word-based models. Other acoustic units such as phoneme-based suffer from contextual influences, and context-dependent triphones suffer due to the non-availability of triphone patterns with a large memory storage for numerous models. Earlier research works related to Hindi speech recognition were performed using the word, phoneme, and context-dependent models. The authors proposed a syllable based Hindi speech recognition system in this study due to different advantages of syllable units such as longer acoustic units, fast decoding, reducing contextual effects, and reduction of irregularities due to phonemes. The continuous Hindi speech recognition system was developed utilizing syllable based acoustic units. Hindi is widely spoken in India and other parts of the world also. The experiments are performed on Continuous Hindi speech by using a widely known Hidden Markov Model (HMM) with perceptual linear predictive coefficients(PLPs). The research outcomes reveal that by using syllables, the performance of the system was increased by 27% than phoneme and 20% than triphones. Research findings indicate that by selecting an appropriate acoustic unit for Hindi, the performance of the speech recognition system may be improved. Further, the study also provides useful insights to develop a syllable based pronunciation dictionary that may be used in speech recognition, speaker identification, and text to speech conversion systems.
引用
收藏
页码:1333 / 1351
页数:19
相关论文
共 50 条
  • [41] Grapheme Gaussian Model and Prosodic Syllable Based Tamil Speech Recognition System
    Ganesh, Akila. A.
    Ravichandran, Chandra
    [J]. 2013 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICSC), 2013, : 401 - 406
  • [42] Syllable Based Language Model for Large Vocabulary Continuous Speech Recognition of Polish
    Majewski, Piotr
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 397 - 401
  • [43] Deep Neural Networks for Syllable based Acoustic Modeling in Chinese Speech Recognition
    Li, Xiangang
    Hong, Caifu
    Yang, Yuning
    Wu, Xihong
    [J]. 2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [44] A study on conventional and syllable-based approaches for automatic speech recognition in Malayalam
    Jasmin S
    Ashish Abraham Samuel
    Rajeev Rajan
    [J]. Sādhanā, 47
  • [45] Syllable-Based Automatic Arabic Speech Recognition in Different Conditions of Noise
    Azmi, Mohamed M.
    Tolba, Hesham
    [J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 601 - +
  • [46] A study on conventional and syllable-based approaches for automatic speech recognition in Malayalam
    Jasmin, S.
    Samuel, Ashish Abraham
    Rajan, Rajeev
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2022, 47 (04):
  • [47] Attention based end to end Speech Recognition for Voice Search in Hindi and English
    Joshi, Raviraj
    Kannan, Venkateshan
    [J]. FIRE 2021: PROCEEDINGS OF THE 13TH ANNUAL MEETING OF THE FORUM FOR INFORMATION RETRIEVAL EVALUATION, 2021, : 107 - 113
  • [48] Performance enhancement of syllable based Tamil speech recognition system using time normalization and rate of speech
    A. Akila
    E. Chandra
    [J]. CSI Transactions on ICT, 2014, 2 (2) : 77 - 84
  • [49] SPEECH RECOGNITION PERFORMANCE ON A MODIFIED NONSENSE SYLLABLE TEST
    GELFAND, SA
    SCHWANDER, T
    LEVITT, H
    WEISS, M
    SILMAN, S
    [J]. JOURNAL OF REHABILITATION RESEARCH AND DEVELOPMENT, 1992, 29 (01): : 53 - 60
  • [50] PARALLEL ALGORITHMS FOR SYLLABLE RECOGNITION IN CONTINUOUS SPEECH.
    De Mori, Renato
    Laface, Pietro
    Yu Mong
    [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1985, PAMI-7 (01) : 56 - 69