Evaluation of Context-dependent Phrasal Translation Lexicons for Statistical Machine Translation

被引:0
|
作者
Carpuat, Marine [1 ]
Wu, Dekai [1 ]
机构
[1] Univ Sci & Technol, Dept Comp Sci & Engn, Human Language Technol Ctr, HKUST, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
We present new direct data analysis showing that dynamically-built context-dependent phrasal translation lexicons are more useful resources for phrase-based statistical machine translation (SMT) than conventional static phrasal translation lexicons, which ignore all contextual information. After several years of surprising negative results, recent work suggests that context-dependent phrasal translation lexicons are an appropriate framework to successfully incorporate Word Sense Disambiguation (WSD) modeling into SMT. However, this approach has so far only been evaluated using automatic translation quality metrics, which are important, but aggregate many different factors. A direct analysis is still needed to understand how context-dependent phrasal translation lexicons impact translation quality, and whether the additional complexity they introduce is really necessary. In this paper, we focus on the impact of context-dependent translation lexicons on lexical choice in phrase-based SMT and show that context-dependent lexicons are more useful to a phrase-based SMT system than a conventional lexicon. A typical phrase-based SMT system makes use of more and longer phrases with context modeling, including phrases that were not seen very frequently in training. Even when the segmentation is identical, the context-dependent lexicons yields translations that match references more often than conventional lexicons.
引用
收藏
页码:3520 / 3527
页数:8
相关论文
共 50 条
  • [21] Statistical Machine Translation
    Cherry, Colin
    COMPUTATIONAL LINGUISTICS, 2010, 36 (04) : 773 - 776
  • [22] Statistical Machine Translation
    Zhang Xiaojun
    APPLIED LINGUISTICS, 2011, 32 (03) : 359 - 362
  • [23] Machine translation in context
    Godden, K
    MACHINE TRANSLATION AND THE INFORMATION SOUP, 1998, 1529 : 158 - 163
  • [24] Genetic Codes with No Dedicated Stop Codon: Context-Dependent Translation Termination
    Swart, Estienne Carl
    Serra, Valentina
    Petroni, Giulio
    Nowacki, Mariusz
    CELL, 2016, 166 (03) : 691 - 702
  • [25] Riboformer: a deep learning framework for predicting context-dependent translation dynamics
    Shao, Bin
    Yan, Jiawei
    Zhang, Jing
    Liu, Lili
    Chen, Ye
    Buskirk, Allen R.
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [26] Disambiguating Phrasal Verbs in English to Kannada Machine Translation
    Parameswarappa, S.
    Narayana, V. N.
    WIRELESS NETWORKS AND COMPUTATIONAL INTELLIGENCE, ICIP 2012, 2012, 292 : 405 - 410
  • [27] Statistical machine translation based on translation rules
    Yulian, H.
    Journal of Chemical and Pharmaceutical Research, 2014, 6 (07) : 1628 - 1635
  • [28] Context-Aware Phrase Representation for Statistical Machine Translation
    Ruan, Zhiwei
    Su, Jinsong
    Xiong, Deyi
    Ji, Rongrong
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2018, 11012 : 137 - 149
  • [29] A Context-Aware Topic Model for Statistical Machine Translation
    Su, Jinsong
    Xiong, Deyi
    Liu, Yang
    Han, Xianpei
    Lin, Hongyu
    Yao, Junfeng
    Zhang, Min
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 229 - 238
  • [30] Context Sensitive Word Deletion Model for Statistical Machine Translation
    Li, Qiang
    Han, Yaqian
    Xiao, Tong
    Zhu, Jingbo
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2017, 2017, 10565 : 73 - 84