Large-vocabulary continuous speech recognition: Advances and applications

被引:31
|
作者
Gauvain, JL [1 ]
Lamel, L [1 ]
机构
[1] CNRS, LIMSI, F-91403 Orsay, France
关键词
acoustic modeling; continuous speech recognition; dictation; large vocabulary; model adaptation; multilinguality; portability; speaker-independent; speech recognition; spoken language system;
D O I
10.1109/5.880079
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The past decade has witnessed substantial advances in speech-recognition technology, which when combined with the increase in computational power and storage capacity has resulted in a variety of commercial products already or soon to be on the market. In this paper, we review the state-of-the-art in core technology large-vocabulary continuous speech recognition, with a view toward highlighting recent advances. We then highlight issues in moving toward applications, discussing system efficiency, portability across languages and tasks, and enhancing the system output by adding tags and nonlinguistic information. Current performance in speech recognition and outstanding challenges for three classes of applications-dictation, audio indexation, and spoken language dialogue systems-are discussed.
引用
收藏
页码:1181 / 1200
页数:20
相关论文
共 50 条
  • [41] A segmental framework for fully-unsupervised large-vocabulary speech recognition
    Kamper, Herman
    Jansen, Aren
    Goldwater, Sharon
    COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 154 - 174
  • [42] Large-vocabulary speech recognition system for Taiwanese (Min-nan)
    Lyu, Ren-yuan
    Chiang, Yuang-chin
    Hsieh, Wen-ping
    Fang, Ren-zhou
    Chen, Chih-yu
    Journal of the Chinese Institute of Electrical Engineering, Transactions of the Chinese Institute of Engineers, Series E/Chung KuoTien Chi Kung Chieng Hsueh K'an, 2000, 7 (02): : 123 - 136
  • [43] A VLSI GRAMMAR PROCESSING SUBSYSTEM FOR A REAL-TIME LARGE-VOCABULARY CONTINUOUS SPEECH RECOGNITION SYSTEM
    CHEN, DC
    YU, R
    RABAEY, J
    BRODERSEN, RW
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1991, 26 (03) : 443 - 448
  • [44] Multimodal Integration for Large-Vocabulary Audio-Visual Speech Recognition
    Yu, Wentao
    Zeiler, Steffen
    Kolossa, Dorothea
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 341 - 345
  • [45] Improved phoneme-history-dependent search method for large-vocabulary continuous-speech recognition
    Hori, T
    Noda, Y
    Matsunaga, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (06): : 1059 - 1067
  • [46] LARGE-VOCABULARY SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION WITH SEMICONTINUOUS HIDDEN MARKOV-MODELS
    HUANG, XD
    HON, HW
    LEE, KF
    SPEECH AND NATURAL LANGUAGE, 1989, : 276 - 279
  • [47] ADAPTABLE PHONEME-BASED MODELS FOR LARGE-VOCABULARY SPEECH RECOGNITION
    BAMBERG, PG
    MANDEL, MA
    SPEECH COMMUNICATION, 1991, 10 (5-6) : 437 - 451
  • [48] Speech recognition on Mandarin Call Home: A large-vocabulary, conversational, and telephone speech corpus
    Liu, FH
    Picheny, M
    Srinivasa, P
    Monkowski, M
    Chen, JL
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 157 - 160
  • [49] ADVANCES IN LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION IN GREEK: MODELING AND NONLINEAR FEATURES
    Rodomagoulakis, Isidoros
    Potamianos, Gerasimos
    Maragos, Petros
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [50] Issues in large-vocabulary interactive speech systems
    Attwater, DJ
    Whittaker, SJ
    BT TECHNOLOGY JOURNAL, 1996, 14 (01): : 177 - 186