Burst Onset Landmark Detection and Its Application to Speech Recognition

被引:8
|
作者
Lin, Chi-Yueh [1 ]
Wang, Hsiao-Chuan [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu 30013, Taiwan
关键词
Affricate consonant; burst onset; random forest; speech recognition; stop consonant;
D O I
10.1109/TASL.2010.2089518
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The reliable detection of salient acoustic-phonetic cues in speech signal plays an important role in speech recognition based on speech landmarks. Once speech landmarks are located, not only can phone recognition be performed, but other useful information can also be derived. This paper focuses on the detection of burst onset landmarks, which are crucial to the recognition of stop and affricate consonants. The proposed detector is purely based on a random forest technique, which belongs to an ensemble of tree-structured classifiers. By adopting a special asymmetric bootstrapping method, a series of experiments conducted on the TIMIT database demonstrate that the proposed detector is an efficient and accurate method for detecting burst onsets. When the detection results are appended to mel frequency cepstral coefficient vectors, the augmented feature vectors enhance the recognition correctness of hidden Markov models in recognizing stop and affricate consonants in continuous speech.
引用
收藏
页码:1253 / 1264
页数:12
相关论文
共 50 条
  • [31] Refinement of Landmark Detection and Extraction of Articulator-Free Features for Knowledge-Based Speech Recognition
    Lee, Jung-In
    Choi, Jeung-Yoon
    Kang, Hong-Goo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (03) : 746 - 749
  • [32] Cooperative Learning and its Application to Emotion Recognition from Speech
    Zhang, Zixing
    Coutinho, Eduardo
    Deng, Jun
    Schuller, Bjoern
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (01) : 115 - 126
  • [33] Partly Hidden Markov Model and its application to speech recognition
    Waseda Univ, Tokyo, Japan
    ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (121-124):
  • [34] Partly hidden Markov model and its application to speech recognition
    Kobayashi, T
    Furuyama, J
    Masumitsu, K
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 121 - 124
  • [35] DEVELOPMENT OF WALSH LINEAR CODING AND ITS APPLICATION TO SPEECH RECOGNITION
    FELDMAN, FA
    HAQUE, T
    SPEECH COMMUNICATION, 1991, 10 (01) : 91 - 97
  • [36] A study of speech emotion recognition and its application to mobile services
    Yoon, Won-Joong
    Cho, Youn-Ho
    Park, Kyu-Sik
    UBIQUITOUS INTELLIGENCE AND COMPUTING, PROCEEDINGS, 2007, 4611 : 758 - +
  • [37] Landmark-based Approach to Speech Recognition: An Alternative to HMMs
    Espy-Wilson, Carol Y.
    Pruthi, Tarun
    Juneja, Amit
    Deshmukh, Om
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2516 - +
  • [38] Studies on inter-speaker variability in speech and its application in automatic speech recognition
    S UMESH
    Sadhana, 2011, 36 : 853 - 883
  • [39] A Review of Signal Subspace Speech Enhancement and Its Application to Noise Robust Speech Recognition
    Kris Hermus
    Patrick Wambacq
    Hugo Van hamme
    EURASIP Journal on Advances in Signal Processing, 2007
  • [40] Studies on inter-speaker variability in speech and its application in automatic speech recognition
    Umesh, S.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05): : 853 - 883