Investigating Morphological Decomposition for Transcription of Arabic Broadcast News and Broadcast Conversation Data

被引:0
|
作者
Lamel, Lori [1 ]
Messaoudi, Abdel. [1 ]
Gauvain, Jean-Luc [1 ]
机构
[1] LIMSI CNRS, Spoken Language Proc Grp, F-91403 Orsay, France
关键词
Morphological decomposition; Arabic speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the challenges of Arabic speech recognition is to deal with the huge lexical variety. Morphological decomposition has been proposed to address this problem by increasing lexical coverage, thereby reducing errors that are due to words that are unknown to the system. In our previous attempts to develop an Arabic speech-to-text (STT) transcription system with morphological decomposition, an increase in word error rate of about 2% absolute was observed relative to a comparable word based system. Based on an error analysis and a comparison of our approach with that of other sites, two modifications were made. The first modification was to not decompose the most frequent words; and the second to not decompose the prefix 'A1' for words starting with a solar consonant since due to assimilation with the following consonant, deletion of the prefix was one of the most frequent errors. Comparable recognition performance was achieved using word-based and morphologically decomposed language models, and since the errors made by the systems are different, combining the two gave a performance gain.
引用
收藏
页码:1429 / 1432
页数:4
相关论文
共 50 条
  • [41] Slovak Broadcast News Speech Recognition and Transcription System
    Lojka, Martin
    Viszlay, Peter
    Stas, Jan
    Hladek, Daniel
    Juhar, Jozef
    ADVANCES IN NETWORK-BASED INFORMATION SYSTEMS, NBIS-2018, 2019, 22 : 385 - 394
  • [42] IMPROVING ARABIC BROADCAST TRANSCRIPTION USING AUTOMATIC TOPIC CLUSTERING
    Chu, Stephen M.
    Mangu, Lidia
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4449 - 4452
  • [43] Unsupervised Language Model Adaptation for Mandarin Broadcast Conversation Transcription
    Mrva, David
    Woodland, Philip C.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2210 - 2213
  • [44] Recognition of Visual Arabic Scripting News Ticker From Broadcast Stream
    Tayyab, Moeen
    Hussain, Ayyaz
    Alshara, Mohammed Ali
    Khan, Shakir
    Alotaibi, Reemiah Muneer
    Baig, Abdul Rauf
    IEEE ACCESS, 2022, 10 : 59189 - 59204
  • [45] Improved Acoustic Modeling for Transcribing Arabic Broadcast Data
    Lamel, Lori
    Messaoudi, Abdel.
    Gauvain, Jean-Luc
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1117 - 1120
  • [46] Clustering-based topic identification of transcribed Arabic broadcast news
    Jafar, Ahmed Abdelaziz
    Fakhr, Mohamed Waleed
    Farouk, Mohamed Hesham
    Lecture Notes in Electrical Engineering, 2015, 312 : 253 - 260
  • [47] Visual News Ticker Surveillance Approach from Arabic Broadcast Streams
    Tayyab, Moeen
    Hussain, Ayyaz
    Mir, Usama
    Iqbal, M. Aqeel
    Haneef, Muhammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 6177 - 6193
  • [48] From Speech to Trees: Applying Treebank Annotation to Arabic Broadcast News
    Maamouri, Mohamed
    Bies, Ann
    Kulick, Seth
    Zaghouani, Wajdi
    Graff, David
    Ciul, Michael
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2117 - 2122
  • [49] Czech-to-Slovak Adapted Broadcast News Transcription System
    Nouza, Jan
    Silovsky, Jan
    Zdansky, Jindrich
    Cerva, Petr
    Kroul, Martin
    Chaloupka, Josef
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2683 - 2686
  • [50] Audio Partitioning and Transcription for Broadcast Data Indexation
    J.L. Gauvain
    L. Lamel
    G. Adda
    Multimedia Tools and Applications, 2001, 14 : 187 - 200