Syntax-aware Transformer Encoder for Neural Machine Translation

被引:0
|
作者
Duan, Sufeng [1 ]
Zhao, Hai [1 ]
Zhou, Junru [1 ]
Wang, Rui [2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Key Lab Shanghai Educ Commiss Intelligent Interac, MoE Key Lab Artificial Intelligence,AI Inst, Shanghai, Peoples R China
[2] Natl Inst Informat & Commun Technol NICT, Kyoto, Japan
基金
中国国家自然科学基金;
关键词
Neural Machine Translation; dependency parsing; POS Tagging;
D O I
10.1109/ialp48816.2019.9037672
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Syntax has been shown a helpful clue in various natural language processing tasks including previous statistical machine translation and recurrent neural network based machine translation. However, since the state-of-the-art neural machine translation (NMT) has to be built on the Transformer based encoder, few attempts are found on such a syntax enhancement. Thus in this paper, we explore effective ways to introduce syntax into Transformer for better machine translation. We empirically compare two ways, positional encoding and input embedding, to exploit syntactic clues from dependency tree over source sentence. Our proposed methods have a merit keeping the architecture of Transformer unchanged, thus the efficiency of Transformer can be kept. The experimental results on IWSLT' 14 German-to-English and WMT14 English-to-German show that our method can yield advanced results over strong Transformer baselines.
引用
收藏
页码:396 / 401
页数:6
相关论文
共 50 条
  • [41] A Unified Syntax-aware Framework for Semantic Role Labeling
    Zuchao, Li
    He, Shexia
    Cai, Jiaxun
    Zhang, Zhuosheng
    Zhao, Hai
    Liu, Gongshen
    Li, Linlin
    Si, Luo
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2401 - 2411
  • [42] Syntax-Aware Sentence Matching with Graph Convolutional Networks
    Lei, Yangfan
    Hu, Yue
    Wei, Xiangpeng
    Xing, Luxi
    Liu, Quanchao
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT II, 2019, 11776 : 353 - 364
  • [43] A Syntax-Aware Re-ranker for Microblog Retrieval
    Severyn, Aliaksei
    Moschitti, Alessandro
    Tsagkias, Manos
    Berendsen, Richard
    de Rijke, Maarten
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1067 - 1070
  • [44] srcQL: A Syntax-Aware Query Language for Source Code
    Bartman, Brian
    Newman, Christian D.
    Collard, Michael L.
    Maletic, Jonathan I.
    2017 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), 2017, : 467 - 471
  • [45] Hierarchical Heterogeneous Graph Attention Network for Syntax-Aware Summarization
    Song, Zixing
    King, Irwin
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11340 - 11348
  • [46] Scalable Syntax-Aware Language Models Using Knowledge Distillation
    Kuncoro, Adhiguna
    Dyer, Chris
    Rimell, Laura
    Clark, Stephen
    Blunsom, Phil
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3472 - 3484
  • [47] Multi-Channel Encoder for Neural Machine Translation
    Xiong, Hao
    He, Zhongjun
    Hu, Xiaoguang
    Wu, Hua
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4962 - 4969
  • [48] Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation
    Li, Bei
    Liu, Hui
    Wang, Ziyang
    Jiang, Yufan
    Xiao, Tong
    Zhu, Jingbo
    Liu, Tongran
    Li, Changliang
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3512 - 3518
  • [49] Syntax-Informed Interactive Neural Machine Translation
    Gupta, Kamal Kumar
    Haque, Rejwanul
    Ekbal, Asif
    Bhattacharyya, Pushpak
    Way, Andy
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [50] Syntax-aware Natural Language Inference with Graph Matching Networks
    Lin, Yan-Tong
    Wu, Meng-Tse
    Su, Keh-Yih
    2020 25TH INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2020), 2020, : 85 - 90