Fast decoding in large vocabulary name dialing

被引:0
|
作者
Suontausta, J [1 ]
Häkkinen, J [1 ]
Viikki, O [1 ]
机构
[1] Nokia Res Ctr, Speech & Audio Syst Lab, Tampere, Finland
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The fast decoding problem is a key challenge virtually in all practical real-time speech recognition systems since model decoding is still by far the most time-consuming operation in Automatic Speech Recognition (ASR) systems. In current speech recognizers, there is typically a trade-off between the desired vocabulary size, the processing power available for speech recognition, and the recognition accuracy. Fast decoding methods are often needed in order to meet the real-time requirements set for a system. The use of these methods may of course not degrade the recognition accuracy. In this paper, we investigate the performance of efficient decoding methods in large vocabulary name dialing. Tree-structured lexicon, fast observation probability evaluation, and adaptive Viterbi beam search are developed and integrated in a name dialing system. The system is tested with lexicons ranging from 100 to 3000 entries. With the lexicon of 1000 words the utilization of the fast decoding methods speeds up the system by 282%. The speed-up degrades the recognition accuracy as little as 0.95%.
引用
收藏
页码:1535 / 1538
页数:4
相关论文
共 50 条
  • [1] Speaker-independent name dialing with out-of-vocabulary rejection
    Ramalingam, CS
    Netsch, L
    Kao, YH
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1475 - 1478
  • [2] Speaker-dependent name dialing in a car environment with out-of-vocabulary rejection
    Ramalingam, C.S.
    Gong, Yifan
    Netsch, Lorin P.
    Anderson, Wallace W.
    Godfrey, John J.
    Kao, Yu-Hung
    [J]. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 165 - 168
  • [3] Speaker-dependent name dialing in a car environment with out-of-vocabulary rejection
    Ramalingam, CS
    Gong, YF
    Netsch, LP
    Anderson, WW
    Godfrey, JJ
    Kao, YH
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 165 - 168
  • [4] Fast two-level HMM decoding algorithm for large vocabulary handwriting recognition
    Koerich, AL
    Sabourin, R
    Suen, CY
    [J]. NINTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION, PROCEEDINGS, 2004, : 232 - 237
  • [5] Name dialing - How useful is it?
    Laurila, K
    Haavisto, P
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3731 - 3734
  • [6] Innovative approaches for large vocabulary name recognition
    Gao, Y
    Ramabhadran, B
    Chen, J
    Erdogan, H
    Picheny, M
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 53 - 56
  • [7] Persian large vocabulary name recognition system (FarsName)
    Hajitabar, Alireza
    Sameti, Hossein
    Hadian, Hossein
    Safari, Arash
    [J]. 2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1580 - 1583
  • [8] An overview of decoding techniques for large vocabulary continuous speech recognition
    Aubert, XL
    [J]. COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01): : 89 - 114
  • [9] Very large vocabulary proper name recognition for directory assistance
    Béchet, F
    de Mori, R
    Subsol, G
    [J]. ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 222 - 225
  • [10] New Vocabulary SYSTEM DIALING 009 (O Coleman)
    Bambarger, Bradley
    [J]. DOWN BEAT, 2015, 82 (04): : 57 - 57