Speech recognition using syllable-like units

被引:0
|
作者
Hu, ZH
Schalkwyk, J
Barnard, E
Cole, R
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
It is well known that speech is dynamic and that frame-based systems lack the ability to realistically model the dynamics of speech. Segment-based systems offer the potential to integrate the dynamics of speech, at least within the phoneme boundaries, although it is difficult to obtain accurate phonemic segmentation in fluent speech. In this paper we propose a new approach which uses syllable-like units in recognition. In the proposed approach, syllable-like units are defined by rules and used as the basic units of recognition. The motivation for using syllable-like units is (1) by modeling perceptually more meaningful units, better modeling of speech can be achieved; and (2) this method provides a better framework for incorporating dynamic modeling techniques into the recognition system. The proposed approach has achieved the same recognition performance on the task of recognizing months of the year as compared to the best frame-based recognizer available.
引用
收藏
页码:1117 / 1120
页数:4
相关论文
共 50 条
  • [1] Automatic Segmentation of Hindi Speech into Syllable-Like Units
    Kumari, Ruchika
    Dev, Amita
    Kumar, Ashwani
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 400 - 406
  • [2] Automatic segmentation of Hindi speech into syllable-like units
    Kumari, Ruchika
    Dev, Amita
    Kumar, Ashwani
    [J]. International Journal of Advanced Computer Science and Applications, 2020, 11 (05): : 400 - 406
  • [3] Pre-linguistic segmentation of speech into syllable-like units
    Rasanen, Okko
    Doyle, Gabriel
    Frank, Michael C.
    [J]. COGNITION, 2018, 171 : 130 - 150
  • [4] Unsupervised word discovery from speech using automatic segmentation into syllable-like units
    Rasanen, Okko
    Doyle, Gabriel
    Frank, Michael C.
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3204 - 3208
  • [5] Automatic transcription of continuous speech into syllable-like units for Indian languages
    Sarada, G. Lakshmi
    Lakshmi, A.
    Murthy, Hema A.
    Nagarajan, T.
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2009, 34 (02): : 221 - 233
  • [6] Language identification using parallel syllable-like unit recognition
    Nagarajan, T
    Murthy, HA
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 401 - 404
  • [7] Subband-based group delay segmentation of spontaneous speech Into syllable-like units
    [J]. Nagarajan, T. (raju@lantana.iitm.ernet.in), 1600, Hindawi Publishing Corporation (2004):
  • [8] Subband-Based Group Delay Segmentation of Spontaneous Speech into Syllable-Like Units
    T. Nagarajan
    H.A. Murthy
    [J]. EURASIP Journal on Advances in Signal Processing, 2004
  • [9] Subband-based group delay segmentation of spontaneous speech into syllable-like units
    Nagarajan, T
    Murthy, HA
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (17) : 2614 - 2625
  • [10] Automatic Segmentation of Chinese Mandarin Speech into Syllable-like
    Li, Jian
    Shen, Furao
    [J]. PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 57 - 60