Improving tree-based neural machine translation with dynamic lexicalized dependency encoding

被引:14
|
作者
Yang, Baosong [1 ]
Wong, Derek F. [1 ]
Chao, Lidia S. [1 ]
Zhang, Min [2 ]
机构
[1] Univ Macau, Nat Language Proc & Portuguese Chinese Machine Tr, Dept Comp & Informat Sci, Macau, Peoples R China
[2] Soochow Univ, Inst Artificial Intelligence, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Syntactic modeling; Dynamic parameters; Tree-RNN; Neural machine translation (NMT);
D O I
10.1016/j.knosys.2019.105042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tree-to-sequence neural machine translation models have proven to be effective in learning the semantic representations from the exploited syntactic structure. Despite their success, tree-to-sequence models have two major issues: (1) the embeddings of constituents at the higher tree levels tend to contribute less in translation; and (2) using a single set of model parameters is difficult to fully capture the syntactic and semantic richness of linguistic phrases. To address the first problem, we proposed a lexicalized dependency model, in which the source-side lexical representations are learned in a head-dependent fashion following a dependency graph. Since the number of dependents is variable, we proposed a variant recurrent neural network (RNN) to jointly consider the long-distance dependencies and the sequential information of words. Concerning the second problem, we adopt a latent vector to dynamically condition the parameters for the composition of each node representation. Experimental results reveal that the proposed model significantly outperforms the recently proposed tree-based methods in English-Chinese and English-German translation tasks with even far fewer parameters. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Tree-based dynamic classifier chains
    Mencia, Eneldo Loza
    Kulessa, Moritz
    Bohlender, Simon
    Fuernkranz, Johannes
    MACHINE LEARNING, 2023, 112 (11) : 4129 - 4165
  • [22] Tree-based dynamic classifier chains
    Eneldo Loza Mencía
    Moritz Kulessa
    Simon Bohlender
    Johannes Fürnkranz
    Machine Learning, 2023, 112 : 4129 - 4165
  • [23] Analyzing and improving reliability: A tree-based approach
    Southern Methodist University, United States
    不详
    不详
    IEEE Software, 2 (97-104):
  • [24] Analyzing and improving reliability: A tree-based approach
    Tian, J
    Palma, J
    IEEE SOFTWARE, 1998, 15 (02) : 97 - +
  • [25] Improving Neural Machine Translation with Neural Sentence Rewriting
    Wu, Tian
    He, Zhongjun
    Chen, Enhong
    Wang, Haifeng
    2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 147 - 152
  • [26] Neural machine translation with Gumbel Tree-LSTM based encoder
    Su, Chao
    Huang, Heyan
    Shi, Shumin
    Jian, Ping
    Shi, Xuewen
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71
  • [27] Improving Neural Machine Translation with Neural Syntactic Distance
    Ma, Chunpeng
    Tamura, Akihiro
    Utiyama, Masao
    Zhao, Tiejun
    Sumita, Eiichiro
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2032 - 2037
  • [28] Improve Neural Machine Translation by Syntax Tree
    Chen, Siyu
    Yu, Qingsong
    ISCSIC'18: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, 2018,
  • [29] Syntactic Features of the Noun A dependency tree-based quantitative Study
    Li, Yuan
    Shi, Si
    Liu, Haitao
    MUTTERSPRACHE, 2021, 131 (03): : 201 - 224
  • [30] A tree-based approach for English-to-Turkish translation
    Bakay, Ozge
    Avar, Begum
    Yildiz, Olcay Taner
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (01) : 437 - 452