Improvement in N-best search for continuous speech recognition

被引:0
|
作者
Illina, I
Gong, YF
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, several techniques for reducing the search complexity of beam search for continuous speech recognition task are proposed. Six heuristic methods for pruning ate described and the parameters of the pruning are adjusted to keep constant the word error rate while reducing the computational complexity and memory demand. The evaluation of the effect of each pruning method is performed in Mixture Stochastic Trajectory Model (MSTM). MSTM is a segment-based model using phonemes as the speech units. The set of tests in a speaker-dependent continuous speech recognition task shows that using the pruning methods, a substantial reduction of 67% of search effort is obtained in term of number of hypothesised phonemes during the search. All proposed techniques are independent of the acoustic models and therefore are applicable to other acoustic modeling techniques.
引用
收藏
页码:2147 / 2150
页数:4
相关论文
共 50 条
  • [31] Segmental search for continuous speech recognition
    Laface, P
    Fissore, L
    Maro, A
    Ravera, F
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2155 - 2158
  • [32] Empirically combining unnormalized NNLM and back-off N-gram for fast N-best rescoring in speech recognition
    Yongzhe Shi
    Wei-Qiang Zhang
    Meng Cai
    Jia Liu
    EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [33] Discriminative keyword spotting using triphones information and N-best search
    Tabibian, Shima
    Akbari, Ahmad
    Nasersharif, Babak
    INFORMATION SCIENCES, 2018, 423 : 157 - 171
  • [34] IMPROVING NONNATIVE SPEECH UNDERSTANDING USING CONTEXT AND N-BEST MEANING FUSION
    Xu, Yushi
    Seneff, Stephanie
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4977 - 4980
  • [35] DISCRIMINATIVE LEARNING USING LINGUISTIC FEATURES TO RESCORE N-BEST SPEECH HYPOTHESES
    Georgescul, Maria
    Rayner, Manny
    Bouillon, Pierrette
    Tsourakis, Nikos
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 97 - 100
  • [36] SEARCH RESULTS BASED N-BEST HYPOTHESIS RESCORING WITH MAXIMUM ENTROPY CLASSIFICATION
    Peng, Fuchun
    Roy, Scott
    Shahshahani, Ben
    Beaufays, Francoise
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 422 - 427
  • [37] Improved Deep Duel Model for Rescoring N-best Speech Recognition List Using Backward LSTMLM and Ensemble Encoders
    Ogawa, Atsunori
    Delcroix, Marc
    Karita, Shigeki
    Nakatani, Tomohiro
    INTERSPEECH 2019, 2019, : 3900 - 3904
  • [38] Utterance verification using search confusion rate and its N-best approach
    Kim, K
    Kim, H
    Hahn, M
    ETRI JOURNAL, 2005, 27 (04) : 461 - 464
  • [39] Dynamic programming search for continuous speech recognition
    Ney, H
    Ortmanns, S
    IEEE SIGNAL PROCESSING MAGAZINE, 1999, 16 (05) : 64 - 83
  • [40] Fast search algorithms for continuous speech recognition
    Zhao, J
    Hamaker, J
    Deshmukh, N
    Ganapathiraju, A
    Picone, J
    IEEE SOUTHEASTCON '99, PROCEEDINGS, 1999, : 36 - 39