Burst Onset Landmark Detection and Its Application to Speech Recognition

被引:8
|
作者
Lin, Chi-Yueh [1 ]
Wang, Hsiao-Chuan [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu 30013, Taiwan
关键词
Affricate consonant; burst onset; random forest; speech recognition; stop consonant;
D O I
10.1109/TASL.2010.2089518
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The reliable detection of salient acoustic-phonetic cues in speech signal plays an important role in speech recognition based on speech landmarks. Once speech landmarks are located, not only can phone recognition be performed, but other useful information can also be derived. This paper focuses on the detection of burst onset landmarks, which are crucial to the recognition of stop and affricate consonants. The proposed detector is purely based on a random forest technique, which belongs to an ensemble of tree-structured classifiers. By adopting a special asymmetric bootstrapping method, a series of experiments conducted on the TIMIT database demonstrate that the proposed detector is an efficient and accurate method for detecting burst onsets. When the detection results are appended to mel frequency cepstral coefficient vectors, the augmented feature vectors enhance the recognition correctness of hidden Markov models in recognizing stop and affricate consonants in continuous speech.
引用
收藏
页码:1253 / 1264
页数:12
相关论文
共 50 条
  • [21] SpeechMark: Landmark Detection Tool for Speech Analysis
    Boyce, Suzanne
    Fell, Harriet
    MacAuslan, Joel
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1892 - 1895
  • [22] Comparative landmark detection on stops of dysarthric speech
    Sunitha, S. V.
    Shivaputra
    Soundeswaran, S.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [23] Investigation of Speech Landmark Patterns for Depression Detection
    Huang, Zhaocheng
    Epps, Julien
    Joachim, Dale
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (02) : 666 - 679
  • [24] Landmark-Guided Segmental Speech Decoding for Continuous Mandarin Speech Recognition
    Chao, Hao
    Song, Cheng
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2016, 12 (03): : 410 - 421
  • [25] Toward clinical application of landmark-based speech analysis: Landmark expression in normal adult speech
    Ishikawa, Keiko
    MacAuslan, Joel
    Boyce, Suzanne
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (05): : EL441 - EL447
  • [26] LANDMARK OF MANDARIN NASAL CODAS AND ITS APPLICATION IN PRONUNCIATION ERROR DETECTION
    Xie, Yanlu
    Hasegawa-Johnson, Mark
    Qu, Leyuan
    Zhang, Jinson
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5370 - 5374
  • [27] EMG burst presence probability: A joint time-frequency representation of muscle activity and its application to onset detection
    Liu, Jie
    Ying, Dongwen
    Rymer, William Zev
    JOURNAL OF BIOMECHANICS, 2015, 48 (06) : 1193 - 1197
  • [28] A robust ward boundary detection algorithm with application to speech recognition
    Agaiby, H
    Moir, TJ
    DSP 97: 1997 13TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2: SPECIAL SESSIONS, 1997, : 753 - 755
  • [29] Speech rate estimation in disordered speech based on spectral landmark detection
    Huici, Hernandez-Diaz
    Kairuz, Hector A.
    Martens, Heidi
    Van Nuffelen, Gwen
    De Bodt, Marc
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2016, 27 : 1 - 6
  • [30] SPEECH LANDMARK BIGRAMS FOR DEPRESSION DETECTION FROM NATURALISTIC SMARTPHONE SPEECH
    Huang, Zhaocheng
    Epps, Julien
    Joachim, Dale
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5856 - 5860