Large-vocabulary continuous speech recognition: Advances and applications

被引：31

作者：

Gauvain, JL ^{[1
]}

Lamel, L ^{[1
]}

机构：

[1] CNRS, LIMSI, F-91403 Orsay, France

来源：

PROCEEDINGS OF THE IEEE | 2000年 / 88卷 / 08期

关键词：

acoustic modeling; continuous speech recognition; dictation; large vocabulary; model adaptation; multilinguality; portability; speaker-independent; speech recognition; spoken language system;

D O I：

10.1109/5.880079

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The past decade has witnessed substantial advances in speech-recognition technology, which when combined with the increase in computational power and storage capacity has resulted in a variety of commercial products already or soon to be on the market. In this paper, we review the state-of-the-art in core technology large-vocabulary continuous speech recognition, with a view toward highlighting recent advances. We then highlight issues in moving toward applications, discussing system efficiency, portability across languages and tasks, and enhancing the system output by adding tags and nonlinguistic information. Current performance in speech recognition and outstanding challenges for three classes of applications-dictation, audio indexation, and spoken language dialogue systems-are discussed.

引用

页码：1181 / 1200

页数：20

共 50 条

[41] A segmental framework for fully-unsupervised large-vocabulary speech recognition
Kamper, Herman
Jansen, Aren
Goldwater, Sharon
COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 154 - 174
[42] Large-vocabulary speech recognition system for Taiwanese (Min-nan)
Lyu, Ren-yuan
Chiang, Yuang-chin
Hsieh, Wen-ping
Fang, Ren-zhou
Chen, Chih-yu
Journal of the Chinese Institute of Electrical Engineering, Transactions of the Chinese Institute of Engineers, Series E/Chung KuoTien Chi Kung Chieng Hsueh K'an, 2000, 7 (02): : 123 - 136
[43] A VLSI GRAMMAR PROCESSING SUBSYSTEM FOR A REAL-TIME LARGE-VOCABULARY CONTINUOUS SPEECH RECOGNITION SYSTEM
CHEN, DC
YU, R
RABAEY, J
BRODERSEN, RW
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1991, 26 (03) : 443 - 448
[44] Multimodal Integration for Large-Vocabulary Audio-Visual Speech Recognition
Yu, Wentao
Zeiler, Steffen
Kolossa, Dorothea
28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 341 - 345
[45] Improved phoneme-history-dependent search method for large-vocabulary continuous-speech recognition
Hori, T
Noda, Y
Matsunaga, S
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (06): : 1059 - 1067
[46] LARGE-VOCABULARY SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION WITH SEMICONTINUOUS HIDDEN MARKOV-MODELS
HUANG, XD
HON, HW
LEE, KF
SPEECH AND NATURAL LANGUAGE, 1989, : 276 - 279
[47] ADAPTABLE PHONEME-BASED MODELS FOR LARGE-VOCABULARY SPEECH RECOGNITION
BAMBERG, PG
MANDEL, MA
SPEECH COMMUNICATION, 1991, 10 (5-6) : 437 - 451
[48] Speech recognition on Mandarin Call Home: A large-vocabulary, conversational, and telephone speech corpus
Liu, FH
Picheny, M
Srinivasa, P
Monkowski, M
Chen, JL
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 157 - 160
[49] ADVANCES IN LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION IN GREEK: MODELING AND NONLINEAR FEATURES
Rodomagoulakis, Isidoros
Potamianos, Gerasimos
Maragos, Petros
2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
[50] Issues in large-vocabulary interactive speech systems
Attwater, DJ
Whittaker, SJ
BT TECHNOLOGY JOURNAL, 1996, 14 (01): : 177 - 186

← 1 2 3 4 5 →