Training Set Similarity Based Parameter Selection for Statistical Machine Translation

被引:0
|
作者
Shi, Xuewen [1 ]
Huang, Heyan [1 ]
Jian, Ping [1 ]
Tang, Yi-Kun [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing Engn Res Ctr High Volume Language Informa, Beijing 100081, Peoples R China
来源
WEB AND BIG DATA (APWEB-WAIM 2018), PT I | 2018年 / 10987卷
基金
中国国家自然科学基金;
关键词
Statistical machine translation; Log-linear model; Parameter selection;
D O I
10.1007/978-3-319-96890-2_6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Log-linear model based statistical machine translation systems (SMT) are usually composed of multiple feature functions. Each feature function is assigned a weight as a model parameter. In this paper, we consider that different input source sentences may have discrepant needs for model parameters. To adapt the model to different inputs, we propose a model parameters selection method for log-linear model based SMT systems. The method is mainly based on the characteristics of different feature functions themselves without any assumption on unseen test sets. Experimental results on two language pairs (Zh-En and Ug-Zh) show that our method leads to the improvements up to 2.4 and 2.2 BLEU score respectively, and it also shows the good interpretability of our proposed method.
引用
收藏
页码:63 / 71
页数:9
相关论文
共 50 条
  • [21] A Sense-Based Translation Model for Statistical Machine Translation
    Xiong, Deyi
    Zhang, Min
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1459 - 1469
  • [22] DOCUMENT TRANSLATION RETRIEVAL BASED ON STATISTICAL MACHINE TRANSLATION TECHNIQUES
    Sanchez-Martinez, Felipe
    Carrasco, Rafael C.
    APPLIED ARTIFICIAL INTELLIGENCE, 2011, 25 (05) : 329 - 340
  • [23] Syntax-Based Statistical Machine Translation
    Hadiwinoto, Christian
    COMPUTATIONAL LINGUISTICS, 2017, 43 (04) : 893 - 896
  • [24] Statistical machine translation decoder based on phrase
    ATR Spoken Language Translation Research Laboratories, 2-2-2 Hikaridai Seika-cho, Soraku-gun, Kyoto
    619-0288, Japan
    不详
    606-8501, Japan
    Int. Conf. Spok. Lang. Process., ICSLP, (1889-1892):
  • [25] Phrase-based statistical machine translation
    Zens, R
    Och, FJ
    Ney, H
    KI2002: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 2479 : 18 - 32
  • [26] The Training Set Selection Methods of microRNA Precursors Prediction Based on Machine Learning Approaches
    Liu Wenyuan
    Ma Jing
    Wang Changwu
    Wang Baowen
    Li Yongqiang
    2013 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM DESIGN AND ENGINEERING APPLICATIONS (ISDEA), 2013, : 1566 - 1569
  • [27] MACHINE TRANSLATION: A CRITICAL LOOK AT THE PERFORMANCE OF RULE-BASED AND STATISTICAL MACHINE TRANSLATION
    Banitz, Brita
    CADERNOS DE TRADUCAO, 2020, 40 (01): : 54 - 71
  • [28] Statistical machine translation
    Sanchez-Martinez, Felipe
    Antonio Perez-Ortiz, Juan
    MACHINE TRANSLATION, 2010, 24 (3-4) : 273 - 278
  • [29] Statistical Machine Translation
    Vandeghinste, Vincent
    Van Eynde, Frank
    TARGET-INTERNATIONAL JOURNAL OF TRANSLATION STUDIES, 2012, 24 (01) : 157 - 159
  • [30] Statistical Machine Translation
    Vatsa, Mukesh G. S.
    Joshi, Nikita
    Goswami, Sumit
    DESIDOC JOURNAL OF LIBRARY & INFORMATION TECHNOLOGY, 2010, 30 (04): : 25 - 32