Robust speech recognition using time boundary detection

被引:1
|
作者
Mohajer, K [1 ]
Hu, ZM [1 ]
机构
[1] Stanford Univ, Dept Elect Engn, Stanford, CA 94305 USA
关键词
speech recognition; Hidden Markov Models; time boundary detection; speech segmentation;
D O I
10.1117/12.488199
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explores the benefits of including time boundary information in Hidden Markov Model based speech recognition systems. Traditional systems normally feed the parameterized data into the HMM recognizer, which result in relatively complicated models and computationally expensive search steps. We propose a few methods of detecting time boundaries prior to parameterization, and present a novel way of including this additional information in the recognizer. The result is significant simplification in the model prototypes, higher accuracy and faster performance.
引用
收藏
页码:335 / 343
页数:9
相关论文
共 50 条
  • [1] A robust ward boundary detection algorithm with application to speech recognition
    Agaiby, H
    Moir, TJ
    [J]. DSP 97: 1997 13TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2: SPECIAL SESSIONS, 1997, : 753 - 755
  • [2] An Evaluation of Keyword Detection Using ACF of Pitch for Robust Speech Recognition
    Tang, Jiayue
    Tian, Yu
    Jiang, Xiaonan
    Tsutsui, Hiroshi
    Miyanaga, Yoshikazu
    [J]. 2018 18TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2018, : 96 - 100
  • [3] Robust edge detection method for speech recognition
    Dai, HS
    Zhu, XY
    Luo, YP
    Yang, SY
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 609 - 612
  • [4] Robust speech detection method for telephone speech recognition system
    Kuroiwa, S
    Naito, M
    Yamamoto, S
    Higuchi, N
    [J]. SPEECH COMMUNICATION, 1999, 27 (02) : 135 - 148
  • [5] Robust End Point Detection for Robot Speech Recognition Using Double Talk Detection
    Moon, Sung-Kyu
    Park, Jin-Soo
    Ko, Hanseok
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2012, 31 (03): : 161 - 169
  • [6] Robust distributed speech recognition using speech enhancement
    Flynn, Ronan
    Jones, Edward
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2008, 54 (03) : 1267 - 1273
  • [7] Robust recognition of noisy speech using speech enhancement
    Xu, YF
    Zhang, JJ
    Yao, KS
    Cao, ZG
    Ma, ZX
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 734 - 737
  • [8] ROBUST WORD BOUNDARY DETECTION IN SPONTANEOUS SPEECH USING ACOUSTIC AND LEXICAL CUES
    Tsiartas, Andreas
    Ghosh, Prasanta Kumar
    Georgiou, Panayiotis
    Narayanan, Shrikanth
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4785 - 4788
  • [9] Robust endpoint detection and energy normalization for real-time speech and speaker recognition
    Li, Q
    Zheng, JS
    Tsai, A
    Zhou, QR
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (03): : 146 - 157
  • [10] Robust end-of-utterance detection for real-time speech recognition applications
    Hariharan, R
    Häkkinen, J
    Laurila, K
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 249 - 252