Automatic speech segmentation in syllable centric speech recognition system

被引：26

作者：

Panda S.P. ^{[1
]}

Nayak A.K. ^{[2
]}

机构：

[1] Department of CSE, Institute of Technical Education and Research, Siksha ‘O’ Anusandhan University, Bhubaneswar, Odisha

[2] Department of CS&IT, Institute of Technical Education and Research, Siksha ‘O’ Anusandhan University, Bhubaneswar, Odisha

来源：

International Journal of Speech Technology | 2016年 / 19卷 / 1期

关键词：

Indian languages; Speech recognition; Speech segmentation; Syllable; Vowel offset point; Vowel onset point; Zero crossing rate;

D O I：

10.1007/s10772-015-9320-6

中图分类号：

学科分类号：

摘要：

Speech recognition is the process of understanding the human or natural language speech by a computer. A syllable centric speech recognition system in this aspect identifies the syllable boundaries in the input speech and converts it into the respective written scripts or text units. Appropriate segmentation of the acoustic speech signal into syllabic units is an important task for development of highly accurate speech recognition system. This paper presents an automatic syllable based segmentation technique for segmenting continuous speech signals in Indian languages at syllable boundaries. To analyze the performance of the proposed technique, a set of experiments are carried out on different speech samples in three Indian languages Hindi, Bengali and Odia and are compared with the existing group delay based segmentation technique along with the manual segmentation technique. The results of all our experiments show the effectiveness of the proposed technique in segmenting the syllable units from the original speech samples compared to the existing techniques. © 2015, Springer Science+Business Media New York.

引用

页码：9 / 18

页数：9

共 50 条

[21] Automatic syllable segmentation algorithm of Chinese speech based on MF-DFA
He, Shaofang
Zhao, Huan
SPEECH COMMUNICATION, 2017, 92 : 42 - 51
[22] Syllable based Hindi speech recognition
Bhatt, Shobha
Jain, Anurag
Dev, Amita
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2020, 41 (06): : 1333 - 1351
[23] Automatic Acoustic Segmentation for Speech Recognition on Broadcast Recordings
Peng, Gang
Hwang, Mei-Yuh
Ostendorf, Mari
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2580 - 2583
[24] Game Theoretic Approach for Automatic Speech Segmentation and Recognition
Rekha, J. Ujwala
Chatrapati, K. Shahu
Babu, A. Vinaya
2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
[25] Speech segmentation without speech recognition
Wang, D
Lu, L
Zhang, HJ
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 468 - 471
[26] An Algorithm to Identify Syllable from a Visual Speech Recognition System
Subhashini, J.
Kumar, C. Manoj
WIRELESS PERSONAL COMMUNICATIONS, 2019, 107 (04) : 2105 - 2121
[27] Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference
Lee, Byeongwook
Cho, Kwang-Hyun
SCIENTIFIC REPORTS, 2016, 6
[28] Speech segmentation without speech recognition
Wang, D
Lu, L
Zhang, HJ
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 405 - 408
[29] Brain-inspired speech segmentation for automatic speech recognition using the speech envelope as a temporal reference
Byeongwook Lee
Kwang-Hyun Cho
Scientific Reports, 6
[30] An Algorithm to Identify Syllable from a Visual Speech Recognition System
J. Subhashini
C. Manoj Kumar
Wireless Personal Communications, 2019, 107 : 2105 - 2121

← 1 2 3 4 5 →