Investigating Morphological Decomposition for Transcription of Arabic Broadcast News and Broadcast Conversation Data

被引:0
|
作者
Lamel, Lori [1 ]
Messaoudi, Abdel. [1 ]
Gauvain, Jean-Luc [1 ]
机构
[1] LIMSI CNRS, Spoken Language Proc Grp, F-91403 Orsay, France
关键词
Morphological decomposition; Arabic speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the challenges of Arabic speech recognition is to deal with the huge lexical variety. Morphological decomposition has been proposed to address this problem by increasing lexical coverage, thereby reducing errors that are due to words that are unknown to the system. In our previous attempts to develop an Arabic speech-to-text (STT) transcription system with morphological decomposition, an increase in word error rate of about 2% absolute was observed relative to a comparable word based system. Based on an error analysis and a comparison of our approach with that of other sites, two modifications were made. The first modification was to not decompose the most frequent words; and the second to not decompose the prefix 'A1' for words starting with a solar consonant since due to assimilation with the following consonant, deletion of the prefix was one of the most frequent errors. Comparable recognition performance was achieved using word-based and morphologically decomposed language models, and since the errors made by the systems are different, combining the two gave a performance gain.
引用
收藏
页码:1429 / 1432
页数:4
相关论文
共 50 条
  • [1] Morphological decomposition for arabic broadcast news transcription
    Xiang, Bing
    Nguyen, Kham
    Nguyen, Long
    Schwartz, Richard
    Makhoul, John
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1089 - 1092
  • [2] IMPROVED MORPHOLOGICAL DECOMPOSITION FOR ARABIC BROADCAST NEWS TRANSCRIPTION
    Ng, Tim
    Nguyen, Kham
    Zbib, Rabih
    Nguyen, Long
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4309 - +
  • [3] Arabic broadcast news transcription system
    Alghamdi, Mansour
    Elshafei, Moustafa
    Al-Muhtaseb, Husni
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2007, 10 (04) : 183 - 195
  • [4] Advances in arabic broadcast news transcription at RWTH
    Rybach, David
    Hahn, Stefan
    Gollan, Christian
    Schlueter, Ralf
    Ney, Hermann
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 449 - 454
  • [5] Unsupervised training for Mandarin Broadcast News and Conversation transcription
    Wang, L.
    Gales, M. J. F.
    Woodland, P. C.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 353 - +
  • [6] Automatic transcription of Broadcast News data
    Pallett, DS
    Lamel, L
    SPEECH COMMUNICATION, 2002, 37 (1-2) : 1 - 2
  • [7] Broadcast news transcription
    Kubala, F
    Jin, H
    Matsoukas, S
    Nguyen, L
    Schwartz, R
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 203 - 206
  • [8] Transcription of Catalan Broadcast Conversation
    Schulz, Henrik
    Fonollosa, Jose A. R.
    Rybach, David
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2009, 5729 : 154 - +
  • [9] Japanese broadcast news transcription
    BBN Technologies, 10 Moulton St., Cambridge
    MA
    02138, United States
    Int. Conf. Spok. Lang. Process., ICSLP, (1749-1752):
  • [10] Unsupervised training on a large amount of arabic broadcast news data
    Ma, Jeff
    Matsoukas, Spyros
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 349 - +