THE IBM 2009 GALE ARABIC SPEECH TRANSCRIPTION SYSTEM

被引:0
|
作者
Kingsbury, Brian [1 ]
Soltau, Hagen [1 ]
Saon, George [1 ]
Chu, Stephen [1 ]
Kuo, Hong-Kwang [1 ]
Mangu, Lidia [1 ]
Ravuri, Suman
Morgan, Nelson
Janin, Adam
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
large vocabulary speech recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We describe the Arabic broadcast transcription system fielded by IBM in the GALE Phase 4 machine translation evaluation. Key advances over our Phase 3.5 system include improvements to context-dependent modeling in vowelized Arabic acoustic models; the use of neural-network features provided by the International Computer Science Institute; Model M language models; a neural network language model that uses syntactic and morphological features; and improvements to our system combination strategy. These advances were instrumental in achieving a word error rate of 8.9% on the Phase 4 evaluation set, and an absolute improvement of 1.6% word error rate over our 2008 system on the unsequestered Phase 3.5 evaluation data.
引用
收藏
页码:4672 / 4675
页数:4
相关论文
共 50 条
  • [1] THE IBM 2008 GALE ARABIC SPEECH TRANSCRIPTION SYSTEM
    Saon, George
    Soltau, Hagen
    Chaudhari, Upendra
    Chu, Stephen
    Kingsbury, Brian
    Kuo, Hong-Kwang
    Mangu, Lidia
    Povey, Daniel
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4378 - 4381
  • [2] Advances in Arabic Speech Transcription at IBM Under the DARPA GALE Program
    Soltau, Hagen
    Saon, George
    Kingsbury, Brian
    Kuo, Hong-Kwang Jeff
    Mangu, Lidia
    Povey, Daniel
    Emami, Ahmad
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (05): : 884 - 894
  • [3] THE 2009 IBM GALE MANDARIN BROADCAST TRANSCRIPTION SYSTEM
    Chu, Stephen M.
    Povey, Daniel
    Kuo, Hong-Kwang
    Mangu, Lidia
    Zhang, Shilei
    Shi, Qin
    Qin, Yong
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4374 - 4377
  • [4] The IBM 2006 gale Arabic ASR system
    Soltau, Hagen
    Saon, George
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 349 - +
  • [5] Recent advances in the IBM GALE Mandarin transcription system
    Chu, Stephen M.
    Kuo, Rong-kwang
    Mangu, Lidia
    Liu, Ji
    Qin, Yong
    Shi, Qin
    Zhang, Shi Lei
    Aronowitz, Hagai
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4329 - 4332
  • [6] Advances in mandarin broadcast speech transcription at IBM under the DARPA GALE program
    Qin, Yong
    Shi, Qin
    Liu, Yi Y.
    Aronowitz, Hagai
    Chu, Stephen M.
    Kuo, Hong-Kwang
    Zweig, Geoffrey
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 410 - +
  • [7] The IBM BOLT Speech Transcription System
    Thomas, Samuel
    Saon, George
    Kuo, Hong-Kwang
    Mangu, Lidia
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3150 - 3153
  • [8] The IBM mandarin broadcast speech transcription system
    Chu, Stephen M.
    Kuo, Hong-kwang
    Liu, Yi Y.
    Qin, Yong
    Shi, Qin
    Zweig, Geoffrey
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 345 - +
  • [9] The IBM 2007 speech transcription system for European parliamentary speeches
    Ramabhadran, Bhuvana
    Siohan, Olivier
    Sethy, Abhinav
    [J]. 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 472 - +
  • [10] The IBM 2006 Speech Transcription System for European Parliamentary Speeches
    Ramabhadran, B.
    Siohan, O.
    Mangu, L.
    Zweig, G.
    Westphal, M.
    Schulz, H.
    Soneiro, A.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1225 - +