Syntax-aware Transformer Encoder for Neural Machine Translation

被引:0
|
作者
Duan, Sufeng [1 ]
Zhao, Hai [1 ]
Zhou, Junru [1 ]
Wang, Rui [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Key Lab Shanghai Educ Commiss Intelligent Interac, MoE Key Lab Artificial Intelligence,AI Inst, Shanghai, Peoples R China
[2] Natl Inst Informat & Commun Technol NICT, Kyoto, Japan
基金
中国国家自然科学基金;
关键词
Neural Machine Translation; dependency parsing; POS Tagging;
D O I
10.1109/ialp48816.2019.9037672
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Syntax has been shown a helpful clue in various natural language processing tasks including previous statistical machine translation and recurrent neural network based machine translation. However, since the state-of-the-art neural machine translation (NMT) has to be built on the Transformer based encoder, few attempts are found on such a syntax enhancement. Thus in this paper, we explore effective ways to introduce syntax into Transformer for better machine translation. We empirically compare two ways, positional encoding and input embedding, to exploit syntactic clues from dependency tree over source sentence. Our proposed methods have a merit keeping the architecture of Transformer unchanged, thus the efficiency of Transformer can be kept. The experimental results on IWSLT' 14 German-to-English and WMT14 English-to-German show that our method can yield advanced results over strong Transformer baselines.
引用
收藏
页码:396 / 401
页数:6
相关论文
共 50 条
  • [21] Syntax-Aware Representation for Aspect Term Extraction
    Zhang, Jingyuan
    Xu, Guangluan
    Wang, Xinyi
    Sun, Xian
    Huang, Tinglei
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT I, 2019, 11439 : 123 - 134
  • [22] Syntax-Aware Mutation for Testing the Solidity Compiler
    Mitropoulos, Charalambos
    Sotiropoulos, Thodoris
    Ioannidis, Sotiris
    Mitropoulos, Dimitris
    COMPUTER SECURITY - ESORICS 2023, PT III, 2024, 14346 : 327 - 347
  • [23] Metapath and syntax-aware heterogeneous subgraph neural networks for spam review detection
    Zhang, Zhiqiang
    Dong, Yuhang
    Wu, Haiyan
    Song, Haiyu
    Deng, Shengchun
    Chen, Yanhong
    APPLIED SOFT COMPUTING, 2022, 128
  • [24] Syntax-aware Multilingual Semantic Role Labeling
    He, Shexia
    Li, Zuchao
    Zhao, Hai
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5350 - 5359
  • [25] Towards Syntax-Aware Editors for Visual Languages
    Costagliola, Gennaro
    Deufemia, Vincenzo
    Polese, Giuseppe
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2005, 127 (04) : 107 - 125
  • [26] Syntax-aware on-the-fly code completion
    Takerngsaksiri, Wannita
    Tantithamthavorn, Chakkrit
    Li, Yuan-Fang
    INFORMATION AND SOFTWARE TECHNOLOGY, 2024, 165
  • [27] Building syntax-aware editors for visual languages
    Costagliola, G
    Deufemia, V
    Polese, G
    Risi, M
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2005, 16 (06): : 508 - 540
  • [28] Improving BERT with Syntax-aware Local Attention
    Li, Zhongli
    Zhou, Qingyu
    Li, Chao
    Xu, Ke
    Cao, Yunbo
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 645 - 653
  • [29] Context- and Sequence-Aware Convolutional Recurrent Encoder for Neural Machine Translation
    Mallick, Ritam
    Susan, Seba
    Agrawal, Vaibhaw
    Garg, Rizul
    Rawal, Prateek
    36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, : 853 - 856
  • [30] Syntax-aware Semantic Role Labeling without Parsing
    Cai, Rui
    Lapata, Mirella
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2019, 7 : 343 - 356