Context Sensitive Word Deletion Model for Statistical Machine Translation

被引:0
|
作者
Li, Qiang [1 ]
Han, Yaqian [1 ]
Xiao, Tong [1 ]
Zhu, Jingbo [1 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, NiuTrans Lab, Shenyang, Liaoning, Peoples R China
基金
美国国家科学基金会;
关键词
Natural language processing; Statistical machine translation; Word deletion; ALIGNMENT;
D O I
10.1007/978-3-319-69005-6_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word deletion (WD) errors can lead to poor comprehension of the meaning of source translated sentences in phrase-based statistical machine translation (SMT), and have a critical impact on the adequacy of the translation results generated by SMT systems. In this paper, first we classify the word deletion into two categories, wanted and unwanted word deletions. For these two kinds of word deletions, we propose a maximum entropy based word deletion model to improve the translation quality in phrase-based SMT. Our proposed model are based on features automatically learned from a real-word bitext. In our experiments on Chinese-to-English news and web translation tasks, the results show that our approach is capable of generating more adequate translations compared with the baseline system, and our proposed word deletion model yields a +0.99 BLEU improvement and a -2.20 TER reduction on the NIST machine translation evaluation corpora.
引用
收藏
页码:73 / 84
页数:12
相关论文
共 50 条
  • [1] Better Addressing Word Deletion for Statistical Machine Translation
    Li, Qiang
    Zhang, Dongdong
    Li, Mu
    Xiao, Tong
    Zhu, Jingbo
    [J]. NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 91 - 102
  • [2] Using a Bilingual Context in Word-Based Statistical Machine Translation
    Schmidt, Christoph
    Vilar, David
    Ney, Herrnann
    [J]. PATTERN RECOGNITION IN INFORMATION SYSTEMS, PROCEEDINGS, 2008, : 144 - 153
  • [3] A Context-Aware Topic Model for Statistical Machine Translation
    Su, Jinsong
    Xiong, Deyi
    Liu, Yang
    Han, Xianpei
    Lin, Hongyu
    Yao, Junfeng
    Zhang, Min
    [J]. PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 229 - 238
  • [4] Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation
    Wang, Xing
    Tu, Zhaopeng
    Zhang, Min
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (12) : 2255 - 2266
  • [5] Generation of word graphs in statistical machine translation
    Ueffing, N
    Och, FJ
    Ney, H
    [J]. PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, : 156 - 163
  • [6] Grammatical and context-sensitive error correction using a statistical machine translation framework
    Ehsan, Nava
    Faili, Heshaam
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2013, 43 (02): : 187 - 206
  • [7] Context Dependent Word Modeling for Statistical Machine Translation Using Part-of-Speech Tags
    Sarikaya, Ruhi
    Deng, Yonggang
    Gao, Yuqing
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2201 - 2204
  • [8] Modeling Indicative Context for Statistical Machine Translation
    Wu, Shuangzhi
    Zhang, Dongdong
    Liu, Shujie
    Zhou, Ming
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 224 - 232
  • [9] A Novel Word Reordering Method for Statistical Machine Translation
    Zang, Shuo
    Zhao, Hai
    Wu, Chunyang
    Wang, Rui
    [J]. 2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 843 - 848
  • [10] Measuring word alignment quality for statistical machine translation
    Fraser, Alexander
    Marcu, Daniel
    [J]. COMPUTATIONAL LINGUISTICS, 2007, 33 (03) : 293 - 303