THE IBM 2009 GALE ARABIC SPEECH TRANSCRIPTION SYSTEM

被引:0
|
作者
Kingsbury, Brian [1 ]
Soltau, Hagen [1 ]
Saon, George [1 ]
Chu, Stephen [1 ]
Kuo, Hong-Kwang [1 ]
Mangu, Lidia [1 ]
Ravuri, Suman
Morgan, Nelson
Janin, Adam
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
large vocabulary speech recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We describe the Arabic broadcast transcription system fielded by IBM in the GALE Phase 4 machine translation evaluation. Key advances over our Phase 3.5 system include improvements to context-dependent modeling in vowelized Arabic acoustic models; the use of neural-network features provided by the International Computer Science Institute; Model M language models; a neural network language model that uses syntactic and morphological features; and improvements to our system combination strategy. These advances were instrumental in achieving a word error rate of 8.9% on the Phase 4 evaluation set, and an absolute improvement of 1.6% word error rate over our 2008 system on the unsequestered Phase 3.5 evaluation data.
引用
收藏
页码:4672 / 4675
页数:4
相关论文
共 50 条
  • [41] Text-to-speech synthesis system with Arabic diacritic recognition system
    Rebai, Ilyes
    BenAyed, Yassine
    [J]. COMPUTER SPEECH AND LANGUAGE, 2015, 34 (01): : 43 - 60
  • [42] Developing high performance ASR in the IBM multilingual speech-to-speech translation system
    Cui, Xiaodong
    Gu, Liang
    Xiang, Bing
    Zhang, Wei
    Gao, Yuqing
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 5121 - 5124
  • [43] The IBM Speech Activity Detection System for the DARPA RATS Program
    Saon, George
    Thomas, Samuel
    Soltau, Hagen
    Ganapathy, Sriram
    Kingsbury, Brian
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3464 - 3468
  • [44] Recent Advances of IBM's Handheld Speech Translation System
    Zhu, Weizhong
    Zhou, Bowen
    Prosser, Charles
    Krbec, Pavel
    Gao, Yuqing
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1181 - 1184
  • [45] The IBM 2015 English Conversational Telephone Speech Recognition System
    Saon, George
    Kuo, Hong-Kwang J.
    Rennie, Steven
    Picheny, Michael
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3140 - 3144
  • [46] ESTIMATION OF PROBABILITIES IN THE LANGUAGE MODEL OF THE IBM SPEECH RECOGNITION SYSTEM
    NADAS, A
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (04): : 859 - 861
  • [47] The IBM 2016 English Conversational Telephone Speech Recognition System
    Saon, George
    Sercu, Tom
    Rennie, Steven
    Kuo, Hong-Kwang J.
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 7 - 11
  • [48] A robust speech disorders correction system for Arabic language using visual speech recognition
    Farag, Ahmed
    El Adawy, Mohamed
    Ismail, Ahmed
    [J]. BIOMEDICAL RESEARCH-INDIA, 2013, 24 (02): : 185 - 192
  • [49] An Avatar Based Translation System from Arabic Speech to Arabic Sign Language for Deaf People
    Halawani, Sami M.
    Daman, Daut
    Kari, Sarudin
    Ahmad, Ab Rahman
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2013, 13 (12): : 43 - 52
  • [50] Efficient and Robust Arabic Automotive Speech Command Recognition System
    Ouali, Soufiyan
    El Garouani, Said
    [J]. ALGORITHMS, 2024, 17 (09)