A LAYERED APPROACH FOR DUTCH LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION

被引：0

作者：

Pelemans, Joris ^{[1
]}

Demuynck, Kris ^{[1
]}

Wambacq, Patrick ^{[1
]}

机构：

[1] Katholieke Univ Leuven, Dept ESAT, B-3001 Louvain, Belgium

来源：

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年

关键词：

LVCSR; phone lattice decoding; ASR architecture; phone confusion matrix; accented speech;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper we investigate whether a layered architecture that has already proven its value for small tasks, works for a system with large lexica (400k words) and language models (5-grams) as well. The architecture was designed to decouple phone and word recognition which allows for the integration of more complex linguistic components, especially at the sub-word level. It was tested on the Dutch language which - with its large variety of accents and rich morphology - is ideally suited to benefit from this integration. The results reveal that the architecture is already competitive to an all-in-one approach in which acoustic models, language models and lexicon are all applied simultaneously. Candidates for further improvement to the system based on a conditional phone confusion model are suggested.

引用

页码：4421 / 4424

页数：4

共 50 条

[1] Reduced semi-continuous models for large vocabulary continuous speech recognition in Dutch
Demuynck, K
Duchateau, J
VanCompernolle, D
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2289 - 2292
[2] A Segmental CRF Approach to Large Vocabulary Continuous Speech Recognition
Zweig, Geoffrey
Nguyen, Patrick
[J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 152 - 157
[3] Vietnamese Large Vocabulary Continuous Speech Recognition
Ngoc Thang Vu
Schultz, Tanja
[J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 333 - 338
[4] Advances in large vocabulary continuous speech recognition
Zweig, G
Picheny, M
[J]. ADVANCES IN COMPUTERS, VOL. 60: INFORMATION SECURITY, 2004, 60 : 249 - 291
[5] Large vocabulary continuous speech recognition of Broadcast News - The Philips/RWTH approach
Beyerlein, P
Aubert, X
Haeb-Umbach, R
Harris, M
Klakow, D
Wendemuth, A
Molau, S
Ney, H
Pitz, M
Sixtus, A
[J]. SPEECH COMMUNICATION, 2002, 37 (1-2) : 109 - 131
[6] Developments in large vocabulary, continuous speech recognition of German
AddaDecker, M
Adda, G
Lamel, L
Gauvain, JL
[J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 153 - 156
[7] Utilizing Lipreading in Large Vocabulary Continuous Speech Recognition
Palecek, Karel
[J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 767 - 776
[8] The RWTH large vocabulary continuous speech recognition system
Ney, H
Welling, L
Ortmanns, S
Beulen, K
Wessel, F
[J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 853 - 856
[9] Combating Reverberation in Large Vocabulary Continuous Speech Recognition
Mitra, Vikramjit
Van Hout, Julien
McLaren, Mitchell
Wang, Wen
Graciarena, Martin
Vergyri, Dimitra
Franco, Horacio
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2449 - 2453
[10] Experimenting with lipreading for large vocabulary continuous speech recognition
Palecek, Karel
[J]. JOURNAL ON MULTIMODAL USER INTERFACES, 2018, 12 (04) : 309 - 318

← 1 2 3 4 5 →