Analysing terminology translation errors in statistical and neural machine translation

被引:7
|
作者
Haque, Rejwanul [1 ]
Hasanuzzaman, Mohammed [1 ]
Way, Andy [1 ]
机构
[1] Dublin City Univ, ADAPT Ctr, Sch Comp, Dublin, Ireland
基金
爱尔兰科学基金会;
关键词
Terminology translation; Machine translation; Phrase-based statistical machine translation; Neural machine translation; QUALITY;
D O I
10.1007/s10590-020-09251-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Terminology translation plays a critical role in domain-specific machine translation (MT). Phrase-based statistical MT (PB-SMT) has been the dominant approach to MT for the past 30 years, both in academia and industry. Neural MT (NMT), an end-to-end learning approach to MT, is steadily taking the place of PB-SMT. In this paper, we conduct comparative qualitative evaluation and comprehensive error analysis on terminology translation in PB-SMT and NMT in two translation directions: English-to-Hindi and Hindi-to-English. To the best of our knowledge, there is no gold standard available for evaluating terminology translation quality in MT. For this reason we select an evaluation test set from a legal domain corpus and create a gold standard for evaluating terminology translation in MT. We also propose an error typology taking the terminology translation errors in MT into consideration. We translate sentences of the test set with our MT systems and terminology translations are manually classified as per the error typology. We evaluate the MT system's performance on terminology translation, and demonstrate our findings, unraveling strengths, weaknesses, and similarities of PB-SMT and NMT in the area of term translation.
引用
收藏
页码:149 / 195
页数:47
相关论文
共 50 条
  • [11] Understanding and Improving the Robustness of Terminology Constraints in Neural Machine Translation
    Zhang, Huaao
    Wang, Qiang
    Qin, Bo
    Shi, Zelin
    Wang, Haibo
    Chen, Ming
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 6029 - 6042
  • [12] MTIL2017: Machine Translation Using Recurrent Neural Network on Statistical Machine Translation
    Mahata, Sainik Kumar
    Das, Dipankar
    Bandyopadhyay, Sivaji
    JOURNAL OF INTELLIGENT SYSTEMS, 2019, 28 (03) : 447 - 453
  • [13] Preventing translation quality deterioration caused by beam search decoding in neural machine translation using statistical machine translation
    Satir, Emre
    Bulut, Hasan
    INFORMATION SCIENCES, 2021, 581 : 791 - 807
  • [14] Entity Highlight Generation as Statistical and Neural Machine Translation
    Huang, Jizhou
    Sun, Yaming
    Zhang, Wei
    Wang, Haifeng
    Liu, Ting
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (10) : 1860 - 1872
  • [15] Statistical machine translation
    Sanchez-Martinez, Felipe
    Antonio Perez-Ortiz, Juan
    MACHINE TRANSLATION, 2010, 24 (3-4) : 273 - 278
  • [16] A Recursive Recurrent Neural Network for Statistical Machine Translation
    Liu, Shujie
    Yang, Nan
    Li, Mu
    Zhou, Ming
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1491 - 1500
  • [17] Statistical Machine Translation
    Vandeghinste, Vincent
    Van Eynde, Frank
    TARGET-INTERNATIONAL JOURNAL OF TRANSLATION STUDIES, 2012, 24 (01) : 157 - 159
  • [18] Statistical Machine Translation
    Vatsa, Mukesh G. S.
    Joshi, Nikita
    Goswami, Sumit
    DESIDOC JOURNAL OF LIBRARY & INFORMATION TECHNOLOGY, 2010, 30 (04): : 25 - 32
  • [19] Statistical Machine Translation
    Babhulgaonkar, A. R.
    Bharad, S. V.
    2017 1ST INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND INFORMATION MANAGEMENT (ICISIM), 2017, : 62 - 67
  • [20] Statistical machine translation
    Lopez, Adam
    ACM COMPUTING SURVEYS, 2008, 40 (03)