Compositional Representation of Morphologically-Rich Input for Neural Machine Translation

被引:0
|
作者
Ataman, Duygu [1 ,2 ]
Federico, Marcello [1 ,3 ]
机构
[1] FBK, Trento, Italy
[2] Univ Trento, Trento, Italy
[3] MMT Srl, Trento, Italy
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Neural machine translation (NMT) models are typically trained with fixed-size input and output vocabularies, which creates an important bottleneck on their accuracy and generalization capability. As a solution, various studies proposed segmenting words into sub-word units and performing translation at the sub-lexical level. However, statistical word segmentation methods have recently shown to be prone to morphological errors, which can lead to inaccurate translations. In this paper, we propose to overcome this problem by replacing the source-language embedding layer of NMT with a bi-directional recurrent neural network that generates compositional representations of the input at any desired level of granularity. We test our approach in a low-resource setting with five languages from different morphological typologies, and under different composition assumptions. By training NMT to compose word representations from character trigrams, our approach consistently outperforms (from 1.71 to 2.48 BLEU points) NMT learning embeddings of statistically generated sub-word units.
引用
收藏
页码:305 / 311
页数:7
相关论文
共 50 条
  • [1] Improving Adversarial Neural Machine Translation for Morphologically Rich Language
    Mi, Chenggang
    Xie, Lei
    Zhang, Yanning
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2020, 4 (04): : 417 - 426
  • [2] Addressing data sparsity for neural machine translation between morphologically rich languages
    Garcia-Martinez, Mercedes
    Aransa, Walid
    Bougares, Fethi
    Barrault, Loic
    [J]. MACHINE TRANSLATION, 2020, 34 (01) : 1 - 20
  • [3] Towards More Diverse Input Representation for Neural Machine Translation
    Chen, Kehai
    Wang, Rui
    Utiyama, Masao
    Sumita, Eiichiro
    Zhao, Tiejun
    Yang, Muyun
    Zhao, Hai
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1586 - 1597
  • [4] Portable Lexical Analysis for Parsing of Morphologically-Rich Languages
    Medved, Marek
    Jakubicek, Milos
    [J]. RASLAN 2013: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING, 2013, : 21 - 26
  • [5] On Compositional Generalization of Neural Machine Translation
    Li, Yafu
    Yin, Yongjing
    Chen, Yulong
    Zhang, Yue
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4767 - 4780
  • [6] AMR Alignment for Morphologically-rich and Pro-drop Languages
    Oral, Elif
    Eryigit, Gulsen
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP, 2022, : 143 - 152
  • [7] Morphologically Motivated Input Variations and Data Augmentation in Turkish-English Neural Machine Translation
    Yirmibesoglu, Zeynep
    Gungor, Tunga
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (03)
  • [8] Improved Unsupervised Neural Machine Translation with Semantically Weighted Back Translation for Morphologically Rich and Low Resource Languages
    Chauhan, Shweta
    Saxena, Shefali
    Daniel, Philemon
    [J]. NEURAL PROCESSING LETTERS, 2022, 54 (03) : 1707 - 1726
  • [9] Improved Unsupervised Neural Machine Translation with Semantically Weighted Back Translation for Morphologically Rich and Low Resource Languages
    Shweta Chauhan
    Shefali Saxena
    Philemon Daniel
    [J]. Neural Processing Letters, 2022, 54 : 1707 - 1726
  • [10] Neural Machine Translation for Morphologically Rich Languages with Improved Sub-word Units and Synthetic Data
    Pinnis, Marcis
    Krislauks, Rihards
    Deksne, Daiga
    Miks, Toms
    [J]. TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 : 237 - 245