Automatic speech segmentation in syllable centric speech recognition system

被引:26
|
作者
Panda S.P. [1 ]
Nayak A.K. [2 ]
机构
[1] Department of CSE, Institute of Technical Education and Research, Siksha ‘O’ Anusandhan University, Bhubaneswar, Odisha
[2] Department of CS&IT, Institute of Technical Education and Research, Siksha ‘O’ Anusandhan University, Bhubaneswar, Odisha
关键词
Indian languages; Speech recognition; Speech segmentation; Syllable; Vowel offset point; Vowel onset point; Zero crossing rate;
D O I
10.1007/s10772-015-9320-6
中图分类号
学科分类号
摘要
Speech recognition is the process of understanding the human or natural language speech by a computer. A syllable centric speech recognition system in this aspect identifies the syllable boundaries in the input speech and converts it into the respective written scripts or text units. Appropriate segmentation of the acoustic speech signal into syllabic units is an important task for development of highly accurate speech recognition system. This paper presents an automatic syllable based segmentation technique for segmenting continuous speech signals in Indian languages at syllable boundaries. To analyze the performance of the proposed technique, a set of experiments are carried out on different speech samples in three Indian languages Hindi, Bengali and Odia and are compared with the existing group delay based segmentation technique along with the manual segmentation technique. The results of all our experiments show the effectiveness of the proposed technique in segmenting the syllable units from the original speech samples compared to the existing techniques. © 2015, Springer Science+Business Media New York.
引用
收藏
页码:9 / 18
页数:9
相关论文
共 50 条
  • [21] Automatic syllable segmentation algorithm of Chinese speech based on MF-DFA
    He, Shaofang
    Zhao, Huan
    SPEECH COMMUNICATION, 2017, 92 : 42 - 51
  • [22] Syllable based Hindi speech recognition
    Bhatt, Shobha
    Jain, Anurag
    Dev, Amita
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2020, 41 (06): : 1333 - 1351
  • [23] Automatic Acoustic Segmentation for Speech Recognition on Broadcast Recordings
    Peng, Gang
    Hwang, Mei-Yuh
    Ostendorf, Mari
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2580 - 2583
  • [24] Game Theoretic Approach for Automatic Speech Segmentation and Recognition
    Rekha, J. Ujwala
    Chatrapati, K. Shahu
    Babu, A. Vinaya
    2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
  • [25] Speech segmentation without speech recognition
    Wang, D
    Lu, L
    Zhang, HJ
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 468 - 471
  • [26] An Algorithm to Identify Syllable from a Visual Speech Recognition System
    Subhashini, J.
    Kumar, C. Manoj
    WIRELESS PERSONAL COMMUNICATIONS, 2019, 107 (04) : 2105 - 2121
  • [27] Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference
    Lee, Byeongwook
    Cho, Kwang-Hyun
    SCIENTIFIC REPORTS, 2016, 6
  • [28] Speech segmentation without speech recognition
    Wang, D
    Lu, L
    Zhang, HJ
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 405 - 408
  • [29] Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference
    Byeongwook Lee
    Kwang-Hyun Cho
    Scientific Reports, 6
  • [30] An Algorithm to Identify Syllable from a Visual Speech Recognition System
    J. Subhashini
    C. Manoj Kumar
    Wireless Personal Communications, 2019, 107 : 2105 - 2121