The ESAT 2008 System for N-Best Dutch Speech Recognition Benchmark

被引:12
|
作者
Demuynck, Kris [1 ]
Puurula, Antti [1 ]
Van Compernolle, Dirk [1 ]
Wambacq, Patrick [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn, B-3001 Louvain, Belgium
关键词
D O I
10.1109/ASRU.2009.5373311
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the ESAT 2008 Broadcast News transcription system for the N-Best 2008 benchmark, developed in part for testing the recent SPRAAK Speech Recognition Toolkit. ESAT system was developed for the Southern Dutch Broadcast News subtask of N-Best using standard methods of modern speech recognition. A combination of improvements were made in commonly overlooked areas such as text normalization, pronunciation modeling, lexicon selection and morphological modeling, virtually solving the out-of-vocabulary (OOV) problem for Dutch by reducing OOV-rate to 0.06% on the N-Best development data and 0.23% on the evaluation data. Recognition experiments were run with several configurations comparing one-pass vs. two-pass decoding, high-order vs. low-order n-gram models, lexicon sizes and different types of morphological modeling. The system achieved 7.23% word error rate (WER) on the broadcast news development data and 20.3% on the much more difficult evaluation data of N-Best.
引用
收藏
页码:339 / 344
页数:6
相关论文
共 50 条
  • [42] System combination for improved automatic generation of N-best proper nouns pronunciation
    Duncan, R
    [J]. IEEE SOUTHEASTCON 2001: ENGINEERING THE FUTURE, PROCEEDINGS, 2001, : 208 - 212
  • [43] N-best List Re-ranking Using Semantic Relatedness and Syntactic Score: An Approach for Improving Speech Recognition Accuracy in Air Traffic Control
    Van Nhan Nguyen
    Holone, Harald
    [J]. 2016 16TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2016, : 1315 - 1319
  • [44] Removal of Heterogeneous Candidates Using Positional Accuracy Based on Levenshtein Distance on Isolated n-best Recognition
    Yun, Young-Sun
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2011, 30 (08): : 428 - 435
  • [45] Smoothed N-best-based speaker adaptation for speech recognition
    Matsui, T
    Matsuoka, T
    Furui, S
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 1015 - 1018
  • [46] N-Best-based unsupervised speaker adaptation for speech recognition
    Matsui, T
    Furui, S
    [J]. COMPUTER SPEECH AND LANGUAGE, 1998, 12 (01): : 41 - 50
  • [47] Dutch Automatic Speech Recognition on the Web: Towards a General Purpose System
    Pelemans, Joris
    Demuynck, Kris
    Wambacq, Patrick
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2121 - 2124
  • [48] N-best-based instantaneous speaker adaptation method for speech recognition
    Matsui, T
    Furui, S
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 973 - 976
  • [49] Towards an Open-Source Dutch Speech Recognition System for the Healthcare Domain
    Tejedor-Garcia, Cristian
    van der Molen, Berrie
    van den Heuvel, Henk
    van Hessen, Arjan
    Pieters, Toine
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1032 - 1039
  • [50] Towards an Open-Source Dutch Speech Recognition System for the Healthcare Domain
    Tejedor-García, Cristian
    van der Molen, Berrie
    van den Heuvel, Henk
    van Hessen, Arjan
    Pieters, Toine
    [J]. 2022 Language Resources and Evaluation Conference, LREC 2022, 2022, : 1032 - 1039