Improvement of Machine Translation Evaluation by Simple Linguistically Motivated Features

被引:0
|
作者
Mu-Yun Yang
Shu-Qi Sun
Jun-Guo Zhu
Sheng Li
Tie-Jun Zhao
Xiao-Ning Zhu
机构
[1] Harbin Institute of Technology,School of Computer Science and Technology
关键词
machine translation; automatic evaluation; regression SVM (supporting vector machine); linguistic feature;
D O I
暂无
中图分类号
学科分类号
摘要
Adopting the regression SVM framework, this paper proposes a linguistically motivated feature engineering strategy to develop an MT evaluation metric with a better correlation with human assessments. In contrast to current practices of “greedy” combination of all available features, six features are suggested according to the human intuition for translation quality. Then the contribution of linguistic features is examined and analyzed via a hill-climbing strategy. Experiments indicate that, compared to either the SVM-ranking model or the previous attempts on exhaustive linguistic features, the regression SVM model with six linguistic information based features generalizes across different datasets better, and augmenting these linguistic features with proper non-linguistic metrics can achieve additional improvements.
引用
收藏
页码:57 / 67
页数:10
相关论文
共 50 条
  • [1] Improvement of Machine Translation Evaluation by Simple Linguistically Motivated Features
    杨沐昀
    孙叔琦
    朱俊国
    李生
    赵铁军
    朱晓宁
    [J]. Journal of Computer Science & Technology, 2011, 26 (01) : 57 - 67
  • [2] Improvement of Machine Translation Evaluation by Simple Linguistically Motivated Features
    Yang, Mu-Yun
    Sun, Shu-Qi
    Zhu, Jun-Guo
    Li, Sheng
    Zhao, Tie-Jun
    Zhu, Xiao-Ning
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2011, 26 (01) : 57 - 67
  • [3] Linguistically Motivated Evaluation of English-Latvian Statistical Machine Translation
    Skadina, Inguna
    Levane-Petrova, Kristine
    Rabante, Guna
    [J]. HUMAN LANGUAGE TECHNOLOGIES: THE BALTIC PERSPECTIVE, 2012, 247 : 221 - 229
  • [4] Linguistically Motivated Unsupervised Segmentation for Machine Translation
    Fishel, Mark
    Kirik, Harri
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1741 - 1745
  • [5] Linguistically motivated statistical machine translation: models and algorithms
    Vandeghinste, Vincent
    [J]. MACHINE TRANSLATION, 2015, 29 (3-4) : 291 - 294
  • [6] A linguistically motivated taxonomy for Machine Translation error analysis
    Costa, Angela
    Ling, Wang
    Luis, Tiago
    Correi, Rui
    Coheur, Luisa
    [J]. MACHINE TRANSLATION, 2015, 29 (02) : 127 - 161
  • [7] A Linguistically Motivated Test Suite to Semi-Automatically Evaluate German-English Machine Translation Output
    Macketanz, Vivien
    Avramidis, Eleftherios
    Burchardt, Aljoscha
    Wang, He
    Ai, Renlong
    Manakhimova, Shushen
    Strohriegel, Ursula
    Moeller, Sebastian
    Uszkoreit, Hans
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 936 - 947
  • [8] Spoken Language Understanding via Supervised Learning and Linguistically Motivated Features
    Georgescul, Maria
    Rayner, Manny
    Bouillon, Pierrette
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 6177 : 117 - 128
  • [9] Linguistically Enhanced Text to Sign Gloss Machine Translation
    Egea Gomez, Santiago
    Chiruzzo, Luis
    McGill, Euan
    Saggion, Horacio
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2022), 2022, 13286 : 172 - 183
  • [10] Incorporation of Linguistic Features in Machine Translation Evaluation of Arabic
    El Marouani, Mohamed
    Boudaa, Tarik
    Enneya, Nourddine
    [J]. BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 500 - 511