Recent improvements of the RWTH large vocabulary speech recognition system on spontaneous speech

被引：0

作者：

Sixtus, A ^{[1
]}

Molau, S ^{[1
]}

Kanthak, S ^{[1
]}

Schlüter, R ^{[1
]}

Ney, H ^{[1
]}

机构：

[1] RWTH Aachen Univ Technol, Lehrstuhl Informat VI, D-52056 Aachen, Germany

来源：

2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI | 2000年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents recent improvements of the RWTH large vocabulary continuous speech recognition system (LSCSR). In particular, we will report on the integration of across-word models into the first recognition pass, and describe better algorithms for fast vocal tract normalization (VTN). We will focus both on the improvements in word error rate and how to speed up the recognizer with only minimal loss in recognition accuracy. Implementation details and experimental results are given for the VerbMobil task, a German spontaneous speech corpus. The 25.0% word error rate (WER) of our within-word baseline system was reduced to 21.4% with VTN and across-word models. Decreasing the real-time factor (RTF) by up to 85% resulted in only a small degradation in recognition performance of 2% relative on average.

引用

页码：1671 / 1674

页数：4

共 50 条

[31] Error identification for large vocabulary speech recognition
Zhou, ZY
Meng, H
2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 21 - 24
[32] Vietnamese Large Vocabulary Continuous Speech Recognition
Ngoc Thang Vu
Schultz, Tanja
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 333 - 338
[33] Large Vocabulary Automatic Speech Recognition for Children
Liao, Hank
Pundak, Golan
Siohan, Olivier
Carroll, Melissa K.
Coccaro, Noah
Jiang, Qi-Ming
Sainath, Tara N.
Senior, Andrew
Beaufays, Francoise
Bacchiani, Michiel
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1611 - 1615
[34] ON LATTICE GENERATION FOR LARGE VOCABULARY SPEECH RECOGNITION
Rybach, David
Riley, Michael
Schalkwyk, Johan
2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 228 - 235
[35] Advances in large vocabulary continuous speech recognition
Zweig, G
Picheny, M
ADVANCES IN COMPUTERS, VOL. 60: INFORMATION SECURITY, 2004, 60 : 249 - 291
[36] The RWTH Aachen University Open Source Speech Recognition System
Rybach, David
Gollan, Christian
Heigold, Georg
Hoffmeister, Bjoern
Loeoef, Jonas
Schlueter, Ralf
Ney, Hermann
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2063 - 2066
[37] Parallel Scalability in Speech Recognition Inference engines in large vocabulary continuous speech recognition
You, Kisun
Chong, Jike
Yi, Youngmin
Gonina, Ekaterina
Hughes, Christopher J.
Chen, Yen-Kuang
Sung, Wonyong
Keutzer, Kurt
IEEE SIGNAL PROCESSING MAGAZINE, 2009, 26 (06) : 124 - 135
[38] A COMMERCIAL LARGE-VOCABULARY DISCRETE SPEECH RECOGNITION SYSTEM - DRAGONDICTATE
MANDEL, MA
LANGUAGE AND SPEECH, 1992, 35 : 237 - 246
[39] Speaker adaptation in the philips system for large vocabulary continuous speech recognition
Thelen, E
Aubert, X
Beyerlein, P
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1035 - 1038
[40] Development of Large Vocabulary Speech Recognition System with Keyword Search for Manipuri
Patel, Tanvina
Krishna, D. N.
Fathima, Noor
Shah, Nisar
Mahima, C.
Kumar, Deepak
Iyengar, Anuroop
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1031 - 1035

← 1 2 3 4 5 →