A LAYERED APPROACH FOR DUTCH LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION

被引:0
|
作者
Pelemans, Joris [1 ]
Demuynck, Kris [1 ]
Wambacq, Patrick [1 ]
机构
[1] Katholieke Univ Leuven, Dept ESAT, B-3001 Louvain, Belgium
关键词
LVCSR; phone lattice decoding; ASR architecture; phone confusion matrix; accented speech;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we investigate whether a layered architecture that has already proven its value for small tasks, works for a system with large lexica (400k words) and language models (5-grams) as well. The architecture was designed to decouple phone and word recognition which allows for the integration of more complex linguistic components, especially at the sub-word level. It was tested on the Dutch language which - with its large variety of accents and rich morphology - is ideally suited to benefit from this integration. The results reveal that the architecture is already competitive to an all-in-one approach in which acoustic models, language models and lexicon are all applied simultaneously. Candidates for further improvement to the system based on a conditional phone confusion model are suggested.
引用
收藏
页码:4421 / 4424
页数:4
相关论文
共 50 条
  • [1] Reduced semi-continuous models for large vocabulary continuous speech recognition in Dutch
    Demuynck, K
    Duchateau, J
    VanCompernolle, D
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2289 - 2292
  • [2] A Segmental CRF Approach to Large Vocabulary Continuous Speech Recognition
    Zweig, Geoffrey
    Nguyen, Patrick
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 152 - 157
  • [3] Vietnamese Large Vocabulary Continuous Speech Recognition
    Ngoc Thang Vu
    Schultz, Tanja
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 333 - 338
  • [4] Advances in large vocabulary continuous speech recognition
    Zweig, G
    Picheny, M
    [J]. ADVANCES IN COMPUTERS, VOL. 60: INFORMATION SECURITY, 2004, 60 : 249 - 291
  • [5] Large vocabulary continuous speech recognition of Broadcast News - The Philips/RWTH approach
    Beyerlein, P
    Aubert, X
    Haeb-Umbach, R
    Harris, M
    Klakow, D
    Wendemuth, A
    Molau, S
    Ney, H
    Pitz, M
    Sixtus, A
    [J]. SPEECH COMMUNICATION, 2002, 37 (1-2) : 109 - 131
  • [6] Developments in large vocabulary, continuous speech recognition of German
    AddaDecker, M
    Adda, G
    Lamel, L
    Gauvain, JL
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 153 - 156
  • [7] Utilizing Lipreading in Large Vocabulary Continuous Speech Recognition
    Palecek, Karel
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 767 - 776
  • [8] The RWTH large vocabulary continuous speech recognition system
    Ney, H
    Welling, L
    Ortmanns, S
    Beulen, K
    Wessel, F
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 853 - 856
  • [9] Combating Reverberation in Large Vocabulary Continuous Speech Recognition
    Mitra, Vikramjit
    Van Hout, Julien
    McLaren, Mitchell
    Wang, Wen
    Graciarena, Martin
    Vergyri, Dimitra
    Franco, Horacio
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2449 - 2453
  • [10] Experimenting with lipreading for large vocabulary continuous speech recognition
    Palecek, Karel
    [J]. JOURNAL ON MULTIMODAL USER INTERFACES, 2018, 12 (04) : 309 - 318