Robust speech recognition using time boundary detection

被引:1
|
作者
Mohajer, K [1 ]
Hu, ZM [1 ]
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
关键词
speech recognition; Hidden Markov Models; time boundary detection; speech segmentation;
D O I
10.1117/12.488199
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explores the benefits of including time boundary information in Hidden Markov Model based speech recognition systems. Traditional systems normally feed the parameterized data into the HMM recognizer, which result in relatively complicated models and computationally expensive search steps. We propose a few methods of detecting time boundaries prior to parameterization, and present a novel way of including this additional information in the recognizer. The result is significant simplification in the model prototypes, higher accuracy and faster performance.
引用
收藏
页码:335 / 343
页数:9
相关论文
共 50 条
  • [41] Speech parameters for the robust emotional speech recognition
    Kim W.-G.
    [J]. Journal of Institute of Control, Robotics and Systems, 2010, 16 (12) : 1137 - 1142
  • [42] Robust recognition of fast speech
    Lee, Ki-Seung
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (08) : 2456 - 2459
  • [43] Japanese speech databases for robust speech recognition
    Nakamura, A
    Matsunaga, S
    Shimizu, T
    Tonomura, M
    Sagisaka, Y
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2199 - 2202
  • [44] Robust speech detector for speech recognition applications
    Liang, WQ
    Chen, YN
    Shan, YX
    Liu, J
    Liu, RS
    [J]. 2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 453 - 456
  • [45] Speech Recognition Using Dynamic Time Warping
    Amin, Talal Bin
    Mahmood, Iftekhar
    [J]. ICAST 2008: PROCEEDINGS OF 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN SPACE TECHNOLOGIES, 2008, : 72 - 77
  • [46] Robust endpoint detection for speech recognition based on discriminative feature extraction
    Yamamoto, Koichi
    Jabloun, Firas
    Reinhard, Klaus
    Kawamura, Akinori
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 805 - 808
  • [47] Multi-sensory microphones for robust speech detection, enhancement and recognition
    Zhang, ZY
    Liu, ZC
    Sinclair, M
    Acero, A
    Deng, L
    Droppo, J
    Huang, XD
    Zheng, YL
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 781 - 784
  • [48] Time-Frequency Masking For Large Scale Robust Speech Recognition
    Wang, Yuxuan
    Misra, Ananya
    Chine, Kean K.
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2469 - 2473
  • [49] Transition features for CRF-based speech recognition and boundary detection
    Dimopoulos, Spiros
    Fosler-Lussier, Eric
    Lee, Chin-Hui
    Potamianos, Alexandros
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 99 - +
  • [50] Recursive estimation of time-varying environments for robust speech recognition
    Zhao, YX
    Wang, SJ
    Yen, KC
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 225 - 228