Improving Adversarial Neural Machine Translation for Morphologically Rich Language

被引:9
|
作者
Mi, Chenggang [1 ]
Xie, Lei [1 ]
Zhang, Yanning [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Natl Engn Lab Integrated Aerosp Ground Ocean Big, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Neural machine translation (NMT); morphologically rich language; adversarial training; morphological word embedding; multiple references;
D O I
10.1109/TETCI.2019.2960546
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generative adversarial networks (GAN) have great successes on natural language processing (NLP) and neural machine translation (NMT). However, the existing discriminator in GAN for NMT only combines two words as one query to train the translation models, which restrict the discriminator to be more meaningful and fail to apply rich monolingual information. Recent studies only consider one single reference translation during model training, this limit the GAN model to learn sufficient information about the representation of source sentence. These situations are even worse when languages are morphologically rich. In this article, an extended version of GAN model for neural machine translation is proposed to optimize the performance of morphologically rich language translation. In particular, we use the morphological word embedding instead of word embedding as input in GAN model to enrich the representation of words and overcome the data sparsity problem during model training. Moreover, multiple references are integrated into discriminator to make the model consider more context information and adapt to the diversity of different languages. Experimental results on German <-> English, French <-> English, Czech <-> English, Finnish <-> English, Turkish <-> English, Chinese <-> English, Finnish <-> Turkish and Turkish <-> Czech translation tasks demonstrate that our method achieves significant improvements over baseline systems.
引用
收藏
页码:417 / 426
页数:10
相关论文
共 50 条
  • [1] IMPROVING ADVERSARIAL NEURAL MACHINE TRANSLATION WITH PRIOR KNOWLEDGE
    Yang, Yating
    Li, Xiao
    Jiang, Tonghai
    Kong, Jinying
    Ma, Bo
    Zhou, Xi
    Wang, Lei
    [J]. 2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1373 - 1377
  • [2] Compositional Representation of Morphologically-Rich Input for Neural Machine Translation
    Ataman, Duygu
    Federico, Marcello
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 305 - 311
  • [3] Addressing data sparsity for neural machine translation between morphologically rich languages
    Garcia-Martinez, Mercedes
    Aransa, Walid
    Bougares, Fethi
    Barrault, Loic
    [J]. MACHINE TRANSLATION, 2020, 34 (01) : 1 - 20
  • [4] Statistical machine translation into a morphologically complex language
    Oflazer, Kemal
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 376 - 387
  • [5] Improved Unsupervised Neural Machine Translation with Semantically Weighted Back Translation for Morphologically Rich and Low Resource Languages
    Chauhan, Shweta
    Saxena, Shefali
    Daniel, Philemon
    [J]. NEURAL PROCESSING LETTERS, 2022, 54 (03) : 1707 - 1726
  • [6] Improved Unsupervised Neural Machine Translation with Semantically Weighted Back Translation for Morphologically Rich and Low Resource Languages
    Shweta Chauhan
    Shefali Saxena
    Philemon Daniel
    [J]. Neural Processing Letters, 2022, 54 : 1707 - 1726
  • [7] Machine Translation Evaluation: Unveiling the Role of Dense Sentence Vector Embedding for Morphologically Rich Language
    Tripathi, Samiksha
    Kansal, Vineet
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (01)
  • [8] Improving English-to-Indian Language Neural Machine Translation Systems
    Kandimalla, Akshara
    Lohar, Pintu
    Maji, Souvik Kumar
    Way, Andy
    [J]. INFORMATION, 2022, 13 (05)
  • [9] Crafting Adversarial Examples for Neural Machine Translation
    Zhang, Xinze
    Zhang, Junzhe
    Chen, Zhenhua
    He, Kun
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1967 - 1977
  • [10] Manifold Adversarial Augmentation for Neural Machine Translation
    Chen, Guandan
    Fan, Kai
    Zhang, Kaibo
    Chen, Boxing
    Huang, Zhongqiang
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3184 - 3189