Large-vocabulary continuous speech recognition: Advances and applications

被引:31
|
作者
Gauvain, JL [1 ]
Lamel, L [1 ]
机构
[1] CNRS, LIMSI, F-91403 Orsay, France
关键词
acoustic modeling; continuous speech recognition; dictation; large vocabulary; model adaptation; multilinguality; portability; speaker-independent; speech recognition; spoken language system;
D O I
10.1109/5.880079
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The past decade has witnessed substantial advances in speech-recognition technology, which when combined with the increase in computational power and storage capacity has resulted in a variety of commercial products already or soon to be on the market. In this paper, we review the state-of-the-art in core technology large-vocabulary continuous speech recognition, with a view toward highlighting recent advances. We then highlight issues in moving toward applications, discussing system efficiency, portability across languages and tasks, and enhancing the system output by adding tags and nonlinguistic information. Current performance in speech recognition and outstanding challenges for three classes of applications-dictation, audio indexation, and spoken language dialogue systems-are discussed.
引用
收藏
页码:1181 / 1200
页数:20
相关论文
共 50 条
  • [1] Large-Vocabulary Continuous Speech Recognition Systems
    Saon, George
    Chien, Jen-Tzung
    IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 18 - 33
  • [2] Advances in Missing Feature Techniques for Robust Large-Vocabulary Continuous Speech Recognition
    Van Segbroeck, Maarten
    Van Hamme, Hugo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 123 - 137
  • [3] Large-Vocabulary Continuous Speech Recognition of Lhasa Tibetan
    Li, Guanyu
    Yu, Hongzhi
    COMPUTER AND INFORMATION TECHNOLOGY, 2014, 519-520 : 802 - 806
  • [5] A large-vocabulary continuous speech recognition system for Hindi
    Kumar, M
    Rajput, N
    Verma, A
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2004, 48 (5-6) : 703 - 715
  • [6] Combining spectral representations for large-vocabulary continuous speech recognition
    Garau, Giulia
    Renals, Steve
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 508 - 518
  • [7] Acoustic models of the elderly for large-vocabulary continuous speech recognition
    Baba, A
    Yoshizawa, S
    Yamada, M
    Lee, A
    Shikano, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2004, 87 (07): : 49 - 57
  • [9] Large-vocabulary speech recognition algorithms
    Padmanabhan, M
    Picheny, M
    COMPUTER, 2002, 35 (04) : 42 - +
  • [10] SPEECH RECOGNITION FOR LARGE-VOCABULARY SYSTEMS
    JACOB, B
    ANDREOBRECHT, R
    JOURNAL DE PHYSIQUE IV, 1994, 4 (C5): : 489 - 492