Evaluation of Context-dependent Phrasal Translation Lexicons for Statistical Machine Translation

被引:0
|
作者
Carpuat, Marine [1 ]
Wu, Dekai [1 ]
机构
[1] Univ Sci & Technol, Dept Comp Sci & Engn, Human Language Technol Ctr, HKUST, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
We present new direct data analysis showing that dynamically-built context-dependent phrasal translation lexicons are more useful resources for phrase-based statistical machine translation (SMT) than conventional static phrasal translation lexicons, which ignore all contextual information. After several years of surprising negative results, recent work suggests that context-dependent phrasal translation lexicons are an appropriate framework to successfully incorporate Word Sense Disambiguation (WSD) modeling into SMT. However, this approach has so far only been evaluated using automatic translation quality metrics, which are important, but aggregate many different factors. A direct analysis is still needed to understand how context-dependent phrasal translation lexicons impact translation quality, and whether the additional complexity they introduce is really necessary. In this paper, we focus on the impact of context-dependent translation lexicons on lexical choice in phrase-based SMT and show that context-dependent lexicons are more useful to a phrase-based SMT system than a conventional lexicon. A typical phrase-based SMT system makes use of more and longer phrases with context modeling, including phrases that were not seen very frequently in training. Even when the segmentation is identical, the context-dependent lexicons yields translations that match references more often than conventional lexicons.
引用
收藏
页码:3520 / 3527
页数:8
相关论文
共 50 条
  • [31] Syntax-Based Context Representation for Statistical Machine Translation
    Chen, Kehai
    Zhao, Tiejun
    Yang, Muyun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (12) : 3226 - 3237
  • [32] Context Dependent Word Modeling for Statistical Machine Translation Using Part-of-Speech Tags
    Sarikaya, Ruhi
    Deng, Yonggang
    Gao, Yuqing
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2201 - 2204
  • [33] Principles of context-based machine translation evaluation
    Hovy, Eduard
    King, Margaret
    Popescu-Belis, Andrei
    Machine Translation, 2002, 17 (01) : 43 - 75
  • [34] A survey of context in neural machine translation and its evaluation
    Castilho, Sheila
    Knowles, Rebecca
    NATURAL LANGUAGE PROCESSING, 2024,
  • [35] Evaluation of Machine Translation Output in Context of Inflectional Languages
    Munkova, Dasa
    Kapusta, Jozef
    Munk, Michal
    Reichel, Jaroslav
    2016 IEEE 10TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2016, : 85 - 89
  • [36] Translation Model of Myanmar Phrases for Statistical Machine Translation
    Zin, Thet Thet
    Soe, Khin Mar
    Thein, Ni Lar
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2012, 6839 : 235 - +
  • [37] Automated Translation of Android Context-Dependent Gestures to Visual GUI Test Instructions
    Coppola, Riccardo
    Ardito, Luca
    Torchiano, Marco
    A-TEST '21: PROCEEDINGS OF THE 12TH INTERNATIONAL WORKSHOP ON AUTOMATING TEST CASE DESIGN, SELECTION, AND EVALUATION, 2021, : 17 - 24
  • [38] Evaluation of the Translation of Separable Phrasal Verbs Generated by ChatGPT
    Alosaimi, Basmah Abdulmohsen
    Alawad, Nouf Abdulaziz
    ARAB WORLD ENGLISH JOURNAL, 2024, : 282 - 291
  • [39] Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation
    Wang, Xing
    Tu, Zhaopeng
    Zhang, Min
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) : 2255 - 2266
  • [40] Reduction of Neural Machine Translation Failures by Incorporating Statistical Machine Translation
    Dugonik, Jani
    Maucec, Mirjam Sepesy
    Verber, Domen
    Brest, Janez
    MATHEMATICS, 2023, 11 (11)