Burst Onset Landmark Detection and Its Application to Speech Recognition

被引:8
|
作者
Lin, Chi-Yueh [1 ]
Wang, Hsiao-Chuan [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu 30013, Taiwan
关键词
Affricate consonant; burst onset; random forest; speech recognition; stop consonant;
D O I
10.1109/TASL.2010.2089518
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The reliable detection of salient acoustic-phonetic cues in speech signal plays an important role in speech recognition based on speech landmarks. Once speech landmarks are located, not only can phone recognition be performed, but other useful information can also be derived. This paper focuses on the detection of burst onset landmarks, which are crucial to the recognition of stop and affricate consonants. The proposed detector is purely based on a random forest technique, which belongs to an ensemble of tree-structured classifiers. By adopting a special asymmetric bootstrapping method, a series of experiments conducted on the TIMIT database demonstrate that the proposed detector is an efficient and accurate method for detecting burst onsets. When the detection results are appended to mel frequency cepstral coefficient vectors, the augmented feature vectors enhance the recognition correctness of hidden Markov models in recognizing stop and affricate consonants in continuous speech.
引用
收藏
页码:1253 / 1264
页数:12
相关论文
共 50 条
  • [1] A LANDMARK IN SPEECH RECOGNITION
    WALDROP, MM
    SCIENCE, 1988, 240 (4859) : 1615 - 1615
  • [2] Landmark detection for distinctive feature-based speech recognition
    Liu, SA
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (05): : 3417 - 3430
  • [3] TOWARDS A NEW SPEECH EVENT DETECTION APPROACH FOR LANDMARK-BASED SPEECH RECOGNITION
    Ziegler, Stefan
    Ludusan, Bogdan
    Gravier, Guillaume
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 342 - 347
  • [4] AUTOMATIC SPEECH RECOGNITION AND ITS APPLICATION
    BRUNDAGE, WJ
    CONTROL ENGINEERING, 1983, 30 (04) : 117 - 117
  • [5] Analysis of information in speech and its application in speech recognition
    Kajarekar, SS
    Hermansky, H
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 283 - 288
  • [6] A probabilistic framework for landmark detection based on phonetic features for automatic speech recognition
    Juneja, Amit
    Espy-Wilson, Carol
    Journal of the Acoustical Society of America, 2008, 123 (02): : 1154 - 1168
  • [7] A probabilistic framework for landmark detection based on phonetic features for automatic speech recognition
    Juneja, Amit
    Espy-Wilson, Carol
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (02): : 1154 - 1168
  • [8] Bark wavelet transform of speech and its application in speech recognition
    Fu, Qiang
    Yi, Kechu
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2000, 28 (10): : 102 - 105
  • [9] Syllable Similarity and Its Application in Speech Recognition
    Li Honglian
    Pan Jianjun
    Fan Jing
    ICWMMN 2010, PROCEEDINGS, 2010, : 302 - 306
  • [10] Duration and its application in continuous speech recognition
    ZHAO Qingwei XIAO Xi WANG Zuoying LU Dajin (Department of Electronic Engineering
    ChineseJournalofAcoustics, 2000, (03) : 259 - 269