Recent improvements of the RWTH large vocabulary speech recognition system on spontaneous speech

被引:0
|
作者
Sixtus, A [1 ]
Molau, S [1 ]
Kanthak, S [1 ]
Schlüter, R [1 ]
Ney, H [1 ]
机构
[1] RWTH Aachen Univ Technol, Lehrstuhl Informat VI, D-52056 Aachen, Germany
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents recent improvements of the RWTH large vocabulary continuous speech recognition system (LSCSR). In particular, we will report on the integration of across-word models into the first recognition pass, and describe better algorithms for fast vocal tract normalization (VTN). We will focus both on the improvements in word error rate and how to speed up the recognizer with only minimal loss in recognition accuracy. Implementation details and experimental results are given for the VerbMobil task, a German spontaneous speech corpus. The 25.0% word error rate (WER) of our within-word baseline system was reduced to 21.4% with VTN and across-word models. Decreasing the real-time factor (RTF) by up to 85% resulted in only a small degradation in recognition performance of 2% relative on average.
引用
收藏
页码:1671 / 1674
页数:4
相关论文
共 50 条
  • [31] Error identification for large vocabulary speech recognition
    Zhou, ZY
    Meng, H
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 21 - 24
  • [32] Vietnamese Large Vocabulary Continuous Speech Recognition
    Ngoc Thang Vu
    Schultz, Tanja
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 333 - 338
  • [33] Large Vocabulary Automatic Speech Recognition for Children
    Liao, Hank
    Pundak, Golan
    Siohan, Olivier
    Carroll, Melissa K.
    Coccaro, Noah
    Jiang, Qi-Ming
    Sainath, Tara N.
    Senior, Andrew
    Beaufays, Francoise
    Bacchiani, Michiel
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1611 - 1615
  • [34] ON LATTICE GENERATION FOR LARGE VOCABULARY SPEECH RECOGNITION
    Rybach, David
    Riley, Michael
    Schalkwyk, Johan
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 228 - 235
  • [35] Advances in large vocabulary continuous speech recognition
    Zweig, G
    Picheny, M
    ADVANCES IN COMPUTERS, VOL. 60: INFORMATION SECURITY, 2004, 60 : 249 - 291
  • [36] The RWTH Aachen University Open Source Speech Recognition System
    Rybach, David
    Gollan, Christian
    Heigold, Georg
    Hoffmeister, Bjoern
    Loeoef, Jonas
    Schlueter, Ralf
    Ney, Hermann
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2063 - 2066
  • [37] Parallel Scalability in Speech Recognition Inference engines in large vocabulary continuous speech recognition
    You, Kisun
    Chong, Jike
    Yi, Youngmin
    Gonina, Ekaterina
    Hughes, Christopher J.
    Chen, Yen-Kuang
    Sung, Wonyong
    Keutzer, Kurt
    IEEE SIGNAL PROCESSING MAGAZINE, 2009, 26 (06) : 124 - 135
  • [38] A COMMERCIAL LARGE-VOCABULARY DISCRETE SPEECH RECOGNITION SYSTEM - DRAGONDICTATE
    MANDEL, MA
    LANGUAGE AND SPEECH, 1992, 35 : 237 - 246
  • [39] Speaker adaptation in the philips system for large vocabulary continuous speech recognition
    Thelen, E
    Aubert, X
    Beyerlein, P
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1035 - 1038
  • [40] Development of Large Vocabulary Speech Recognition System with Keyword Search for Manipuri
    Patel, Tanvina
    Krishna, D. N.
    Fathima, Noor
    Shah, Nisar
    Mahima, C.
    Kumar, Deepak
    Iyengar, Anuroop
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1031 - 1035