A LAYERED APPROACH FOR DUTCH LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION

被引:0
|
作者
Pelemans, Joris [1 ]
Demuynck, Kris [1 ]
Wambacq, Patrick [1 ]
机构
[1] Katholieke Univ Leuven, Dept ESAT, B-3001 Louvain, Belgium
关键词
LVCSR; phone lattice decoding; ASR architecture; phone confusion matrix; accented speech;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we investigate whether a layered architecture that has already proven its value for small tasks, works for a system with large lexica (400k words) and language models (5-grams) as well. The architecture was designed to decouple phone and word recognition which allows for the integration of more complex linguistic components, especially at the sub-word level. It was tested on the Dutch language which - with its large variety of accents and rich morphology - is ideally suited to benefit from this integration. The results reveal that the architecture is already competitive to an all-in-one approach in which acoustic models, language models and lexicon are all applied simultaneously. Candidates for further improvement to the system based on a conditional phone confusion model are suggested.
引用
收藏
页码:4421 / 4424
页数:4
相关论文
共 50 条
  • [31] A large vocabulary continuous speech recognition system for Persian language
    Hossein Sameti
    Hadi Veisi
    Mohammad Bahrani
    Bagher Babaali
    Khosro Hosseinzadeh
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2011
  • [32] A word graph algorithm for large vocabulary continuous speech recognition
    Ortmanns, S
    Ney, H
    Aubert, X
    [J]. COMPUTER SPEECH AND LANGUAGE, 1997, 11 (01): : 43 - 72
  • [33] Large Vocabulary Continuous Audio-Visual Speech Recognition
    Sterpu, George
    [J]. ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, : 538 - 541
  • [34] On designing pronunciation lexicons for large vocabulary, continuous speech recognition
    Lamel, L
    Adda, G
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 6 - 9
  • [35] An overview of decoding techniques for large vocabulary continuous speech recognition
    Aubert, XL
    [J]. COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01): : 89 - 114
  • [36] Large-Vocabulary Continuous Speech Recognition of Lhasa Tibetan
    Li, Guanyu
    Yu, Hongzhi
    [J]. COMPUTER AND INFORMATION TECHNOLOGY, 2014, 519-520 : 802 - 806
  • [37] DEEP-FSMN FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
    Zhang, Shiliang
    Lei, Ming
    Yan, Zhijie
    Dai, Lirong
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5869 - 5873
  • [38] Large-vocabulary continuous speech recognition: Advances and applications
    Gauvain, JL
    Lamel, L
    [J]. PROCEEDINGS OF THE IEEE, 2000, 88 (08) : 1181 - 1200
  • [39] A large-vocabulary continuous speech recognition system for Hindi
    Kumar, M
    Rajput, N
    Verma, A
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2004, 48 (5-6) : 703 - 715
  • [40] Phone deactivation pruning in large vocabulary continuous speech recognition
    Renals, S
    [J]. IEEE SIGNAL PROCESSING LETTERS, 1996, 3 (01) : 4 - 6