Evaluation of 2-way Iraqi Arabic-English speech translation systems using automated metrics

被引:5
|
作者
Condon, Sherri [1 ]
Arehart, Mark [1 ]
Parvaz, Dan [2 ]
Sanders, Gregory [3 ]
Doran, Christy [4 ]
Aberdeen, John [4 ]
机构
[1] MITRE Corp, Mclean, VA 22102 USA
[2] MITRE Corp, Orlando, FL USA
[3] Natl Inst Stand & Technol, Gaithersburg, MD 20899 USA
[4] MITRE Corp, Bedford, MA 01730 USA
关键词
Arabic; Machine translation; Evaluation; Automated metrics; Speech translation;
D O I
10.1007/s10590-011-9105-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Defense Advanced Research Projects Agency (DARPA) Spoken Language Communication and Translation System for Tactical Use (TRANSTAC) program (http://1.usa.gov/transtac) faced many challenges in applying automated measures of translation quality to Iraqi Arabic-English speech translation dialogues. Features of speech data in general and of Iraqi Arabic data in particular undermine basic assumptions of automated measures that depend on matching system outputs to reference translations. These features are described along with the challenges they present for evaluating machine translation quality using automated metrics. We show that scores for translation into Iraqi Arabic exhibit higher correlations with human judgments when they are computed from normalized system outputs and reference translations. Orthographic normalization, lexical normalization, and operations involving light stemming resulted in higher correlations with human judgments.
引用
收藏
页码:159 / 176
页数:18
相关论文
共 23 条
  • [1] RECENT ADVANCES IN SRI'S IRAQCOMM™ IRAQI ARABIC-ENGLISH SPEECH-TO-SPEECH TRANSLATION SYSTEM
    Akbacak, Murat
    Franco, Horacio
    Frandsen, Michael
    Hasan, Sasa
    Jameel, Huda
    Kathol, Andreas
    Khadivi, Shahram
    Lei, Xin
    Mandal, Arindam
    Mansour, Saab
    Precoda, Kristin
    Richey, Colleen
    Vergyri, Dimitra
    Wang, Wen
    Yang, Mei
    Zheng, Jing
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4809 - +
  • [2] Evaluation of Machine Translation Errors in English and Iraqi Arabic
    Condon, Sherri
    Parvaz, Dan
    Aberdeen, John
    Doran, Christy
    Freeman, Andrew
    Awad, Marwan
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [3] Optimizing Components for Handheld Two-way Speech Translation for an English-Iraqi Arabic System
    Hsiao, Roger
    Venugopal, Ashish
    Koehler, Thilo
    Zhang, Ying
    Charoenpornsawat, Paisarn
    Zollmann, Andreas
    Vogel, Stephan
    Black, Alan W.
    Schultz, Tanja
    Waibel, Alex
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 765 - +
  • [4] An Evaluation of the Accuracy of the Machine Translation Systems of Social Media Language: Google's Arabic-English Translation as an Example
    Sabtan, Yasser Muhammad Naguib
    Hussein, Mohamed Saad Mahmoud
    Ethelb, Hamza
    Omar, Abdulfattah
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (07) : 406 - 415
  • [5] Design and evaluation of the 2006 bbn english/Iraqi two-way speech translation system
    Stallard, David
    Choi, Fred
    Krstovski, Kriste
    Natarajan, Prem
    Prasad, Rohit
    Saleem, Shirin
    Suleiman, Raid
    [J]. 2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 254 - 257
  • [6] Evaluation of Arabic to English Machine Translation Systems
    Zakraoui, Jezia
    Saleh, Moutaz
    Al-Maadeed, Somaya
    AlJa'am, Jihad Mohamad
    [J]. 2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2020, : 185 - 190
  • [7] Evaluation of English to Arabic Machine Translation Systems using BLEU and GTM
    Al-Rukban, Aljoharah
    Saudagar, Abdul Khader Jilani
    [J]. PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON EDUCATION TECHNOLOGY AND COMPUTERS (ICETC 2017), 2017, : 228 - 232
  • [8] Investigations on speech recognition systems for low-resource dialectal Arabic-English code-switching speech
    Hamed, Injy
    Denisov, Pavel
    Li, Chia-Yu
    Elmahdy, Mohamed
    Abdennadher, Slim
    Ngoc Thang Vu
    [J]. COMPUTER SPEECH AND LANGUAGE, 2022, 72
  • [9] Teaching Arabic-English legal translation using model texts: a mixed-method study
    Brashi, Abbas
    Latif, Muhammad M. M. Abdel
    [J]. HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2024, 11 (01):
  • [10] Evaluation methodology and metrics employed to assess the TRANSTAC two-way, speech-to-speech translation systems
    Sanders, Gregory A.
    Weiss, Brian A.
    Schlenoff, Craig
    Steves, Michelle P.
    Condon, Sherri
    [J]. COMPUTER SPEECH AND LANGUAGE, 2013, 27 (02): : 528 - 553