Automatic Machine Translation Evaluation with Part-of-Speech Information

被引:0
|
作者
Han, Aaron L. -F. [1 ]
Wong, Derek F. [1 ]
Chao, Lidia S. [1 ]
He, Liangye [1 ]
机构
[1] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China
来源
关键词
Natural language processing; Machine translation evaluation; Part-of-Speech; Reference translation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One problem of automatic translation is the evaluation of the result. The result should be as close to a human reference translation as possible, but varying word order or synonyms have to be taken into account for the evaluation of the similarity of both. In the conventional methods, researchers tend to employ many resources such as the synonyms vocabulary, paraphrasing, and text entailment data, etc. To make the evaluation model both accurate and concise, this paper explores the evaluation only using Part-of-Speech information of the words, which means the method is based only on the consilience of the POS strings of the hypothesis translation and reference. In this developed method, the POS also acts as the similar function with the synonyms in addition to its syntactic or morphological behaviour of the lexical item in question. Measures for the similarity between machine translation and human reference are dependent on the language pair since the word order or the number of synonyms may vary, for instance. This new measure solves this problem to a certain extent by introducing weights to different sources of information. The experiment results on English, German and French languages correlate on average better with the human reference than some existing measures, such as BLEU, AMBER and MP4IBM1.
引用
收藏
页码:121 / 128
页数:8
相关论文
共 50 条
  • [1] Part-of-speech tagging based on machine translation techniques
    Gasco i Mora, Guillem
    Sanchez Peiro, Joan Andreu
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 1, PROCEEDINGS, 2007, 4477 : 257 - +
  • [2] Latent Part-of-Speech Sequences for Neural Machine Translation
    Yang, Xuewen
    Liu, Yingru
    Xie, Dongliang
    Wang, Xin
    Balasubramanian, Niranjan
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 780 - 790
  • [3] Using target-language information to train part-of-speech taggers for machine translation
    Sanchez-Martinez, Felipe
    Antonio Perez-Ortiz, Juan
    Forcada, Mikel L.
    [J]. MACHINE TRANSLATION, 2008, 22 (1-2) : 29 - 66
  • [4] Part-of-speech persistence: The influence of part-of-speech information on lexical processes
    Melinger, Alissa
    Koenig, Jean-Pierre
    [J]. JOURNAL OF MEMORY AND LANGUAGE, 2007, 56 (04) : 472 - 489
  • [5] Part-of-Speech Tags Guide Low-Resource Machine Translation
    Kadeer, Zaokere
    Yi, Nian
    Wumaier, Aishan
    [J]. ELECTRONICS, 2023, 12 (16)
  • [6] Standards for automatic part-of-speech tagging
    Minnaja, DC
    [J]. 15TH INTERNATIONAL CONGRESS ON CYBERNETICS, PROCEEDINGS, 1999, : 745 - 750
  • [7] Exploring the use of target-language information to train the part-of-speech tagger of machine translation-systems
    Sánchez-Martínez, F
    Pérez-Ortiz, JA
    Forcada, ML
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, 2004, 3230 : 137 - 148
  • [8] AUTOMATIC RECOGNITION OF PART-OF-SPEECH FOR ENGLISH TEXTS
    JANAS, JM
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1977, 13 (04) : 205 - 213
  • [9] Context Dependent Word Modeling for Statistical Machine Translation Using Part-of-Speech Tags
    Sarikaya, Ruhi
    Deng, Yonggang
    Gao, Yuqing
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2201 - 2204
  • [10] Evidence for the shared representation of part-of-speech information
    Melinger, A
    Koenig, JP
    [J]. LACUS FORUM XXVI: THE LEXICON, 2000, 26 : 533 - 541