Recent improvements of the RWTH large vocabulary speech recognition system on spontaneous speech

被引:0
|
作者
Sixtus, A [1 ]
Molau, S [1 ]
Kanthak, S [1 ]
Schlüter, R [1 ]
Ney, H [1 ]
机构
[1] RWTH Aachen Univ Technol, Lehrstuhl Informat VI, D-52056 Aachen, Germany
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents recent improvements of the RWTH large vocabulary continuous speech recognition system (LSCSR). In particular, we will report on the integration of across-word models into the first recognition pass, and describe better algorithms for fast vocal tract normalization (VTN). We will focus both on the improvements in word error rate and how to speed up the recognizer with only minimal loss in recognition accuracy. Implementation details and experimental results are given for the VerbMobil task, a German spontaneous speech corpus. The 25.0% word error rate (WER) of our within-word baseline system was reduced to 21.4% with VTN and across-word models. Decreasing the real-time factor (RTF) by up to 85% resulted in only a small degradation in recognition performance of 2% relative on average.
引用
收藏
页码:1671 / 1674
页数:4
相关论文
共 50 条
  • [1] The RWTH large vocabulary continuous speech recognition system
    Ney, H
    Welling, L
    Ortmanns, S
    Beulen, K
    Wessel, F
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 853 - 856
  • [2] Large vocabulary continuous speech recognition of Broadcast News - The Philips/RWTH approach
    Beyerlein, P
    Aubert, X
    Haeb-Umbach, R
    Harris, M
    Klakow, D
    Wendemuth, A
    Molau, S
    Ney, H
    Pitz, M
    Sixtus, A
    SPEECH COMMUNICATION, 2002, 37 (1-2) : 109 - 131
  • [3] IMPROVEMENTS ON BOTTLENECK FEATURE FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
    Tuerxun, Maimaitiaili
    Zhang, Shiliang
    Bao, Yebo
    Dai, Lirong
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 516 - 520
  • [4] Recent experiments in large vocabulary conversational speech recognition
    Billa, J.
    Colhurst, T.
    El-Jaroudi, A.
    Iyer, R.
    Ma, K.
    Matsoukas, S.
    Quillen, C.
    Richardson, F.
    Siu, M.
    Zavaliagkos, G.
    Gish, H.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 41 - 44
  • [5] Recent experiments in Large Vocabulary Conversational Speech Recognition
    Billa, J
    Colhurst, T
    El-Jaroudi, A
    Iyer, R
    Ma, K
    Matsoukas, S
    Quillen, C
    Richardson, F
    Siu, M
    Zavaliagkos, G
    Gish, H
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 41 - 44
  • [6] Recent Developments in Large Vocabulary Continuous Speech Recognition
    Saon, George
    Chien, Jen-Tzung
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [7] Estonian Large Vocabulary Speech Recognition System for Radiology
    Alumaee, Tanel
    Meister, Einar
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, 2010, 219 : 33 - 38
  • [8] The titech large vocabulary WFST speech recognition system
    Dixon, Paul R.
    Caseiro, Diamantino A.
    Oonishi, Tasuku
    Furui, Sadaoki
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 443 - +
  • [9] Chinese speech recognition system with very large vocabulary
    Qin, Y
    Mo, FY
    Li, CL
    Guan, DH
    ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 817 - 820
  • [10] A Myanmar Large Vocabulary Continuous Speech Recognition System
    Naing, Hay Mar Soe
    Hlaing, Aye Mya
    Pa, Win Pa
    Hu, Xinhui
    Thu, Ye Kyaw
    Hori, Chiori
    Kawai, Hisashi
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 320 - 327