A Study of Morphological Robustness of Neural Machine Translation

被引:0
|
作者
Jayanthi, Sai Muralidhar [1 ]
Pratapa, Adithya [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we analyze the robustness of neural machine translation systems towards grammatical perturbations in the source. In particular, we focus on morphological inflection related perturbations. While this has been recently studied for English -> French translation (MORPHEUS) (Tan et al., 2020), it is unclear how this extends to Any -> English translation systems. We propose MORPHEUS-MULTILINGUAL that utilizes UniMorph dictionaries to identify morphological perturbations to source that adversely affect the translation models. Along with an analysis of stateof-the-art pretrained MT systems, we train and analyze systems for 11 language pairs using the multilingual TED corpus (Qi et al., 2018). We also compare this to actual errors of non-native speakers using Grammatical Error Correction datasets. Finally, we present a qualitative and quantitative analysis of the robustness of Any!English translation systems. Code for our work is publicly available.
引用
收藏
页码:49 / 59
页数:11
相关论文
共 50 条
  • [1] SENTENCE BOUNDARY AUGMENTATION FOR NEURAL MACHINE TRANSLATION ROBUSTNESS
    Li, Daniel
    Te, I
    Arivazhagan, Naveen
    Cherry, Colin
    Padfield, Dirk
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7553 - 7557
  • [2] Progressive Translation: Improving Domain Robustness of Neural Machine Translation with Intermediate Sequences
    Wang, Chaojun
    Liu, Yang
    Lam, Wai
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9425 - 9439
  • [3] Is Robustness Transferable across Languages in Multilingual Neural Machine Translation?
    Pan, Leiyu
    Supryadi
    Xiong, Deyi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 14114 - 14125
  • [4] Understanding and Improving the Robustness of Terminology Constraints in Neural Machine Translation
    Zhang, Huaao
    Wang, Qiang
    Qin, Bo
    Shi, Zelin
    Wang, Haibo
    Chen, Ming
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 6029 - 6042
  • [5] Improving Robustness of Neural Machine Translation with Multi-task Learning
    Zhou, Shuyan
    Zeng, Xiangkai
    Zhou, Yingqi
    Anastasopoulos, Antonios
    Neubig, Graham
    FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), 2019, : 565 - 571
  • [6] Morphological segmentation method for Turkic language neural machine translation
    Tukeyev, U.
    Karibayeva, A.
    Zhumanov, Z. h
    COGENT ENGINEERING, 2020, 7 (01):
  • [7] NICT's Supervised Neural Machine Translation Systems for the WMT19 Translation Robustness Task
    Dabre, Raj
    Sumita, Eiichiro
    FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), 2019, : 533 - 536
  • [8] Improving Mongolian-Chinese Neural Machine Translation with Morphological Noise
    Ji, Yatu
    Hou, Hongxu
    Wu, Nier
    Chen, Junjie
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 123 - 135
  • [9] A Study of Reinforcement Learning for Neural Machine Translation
    Wu, Lijun
    Tian, Fei
    Qin, Tao
    Lai, Jianhuang
    Liu, Tie-Yan
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3612 - 3621
  • [10] Morphological mismatches in machine translation
    Mel'cuk, Igor
    Wanner, Leo
    MACHINE TRANSLATION, 2008, 22 (03) : 101 - 152