Enhancing Machine Translation with Dependency-Aware Self-Attention

被引:0
|
作者
Bugliarello, Emanuele [1 ,2 ]
Okazaki, Naoaki [2 ]
机构
[1] Univ Copenhagen, Copenhagen, Denmark
[2] Tokyo Inst Technol, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most neural machine translation models only rely on pairs of parallel sentences, assuming syntactic information is automatically learned by an attention mechanism. In this work, we investigate different approaches to incorporate syntactic knowledge in the Transformer model and also propose a novel, parameter-free, dependency-aware self-attention mechanism that improves its translation quality, especially for long sentences and in low-resource scenarios. We show the efficacy of each approach on WMT English <-> German and English -> Turkish, and WAT English -> Japanese translation tasks.
引用
收藏
页码:1618 / 1627
页数:10
相关论文
共 50 条
  • [31] Dependency-Aware Metamorphic Testing of Datalog Engines
    Mansur, Muhammad Numair
    Wuestholz, Valentin
    Christakis, Maria
    [J]. PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 236 - 247
  • [32] DABT: A Dependency-aware Bug Triaging Method
    Jahanshahi, Hadi
    Chhabra, Kritika
    Cevik, Mucahit
    Basar, Ayse
    [J]. PROCEEDINGS OF EVALUATION AND ASSESSMENT IN SOFTWARE ENGINEERING (EASE 2021), 2021, : 221 - 230
  • [33] Dependency-aware unequal erasure protection codes
    Bouabdallah A.
    Lacan J.
    [J]. Journal of Zhejiang University-SCIENCE A, 2006, 7 (Suppl 1): : 27 - 33
  • [34] Dependency-Aware Caching for HTTP Adaptive Streaming
    Zhang, Cong
    Liu, Jiangchuan
    Chen, Fei
    Cui, Yong
    Ngai, Edith C. -H.
    [J]. 2016 DIGITAL MEDIA INDUSTRY AND ACADEMIC FORUM (DMIAF), 2016, : 89 - 93
  • [35] ENHANCING HYBRID SELF-ATTENTION STRUCTURE WITH RELATIVE-POSITION-AWARE BIAS FOR SPEECH SYNTHESIS
    Yang, Shan
    Lu, Heng
    Kang, Shiying
    Xie, Lei
    Yu, Dong
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6910 - 6914
  • [36] Task Allocation in Dependency-aware Spatial Crowdsourcing
    Ni, Wangze
    Cheng, Peng
    Chen, Lei
    Lin, Xuemin
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 985 - 996
  • [37] Dependency-aware Maintenance for Dynamic Grid Services
    Jin, Hai
    Qi, Li
    Wu, Song
    Luo, Yaqin
    Dai, Jie
    [J]. 2007 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS (ICPP), 2007, : 532 - 539
  • [38] Dependency-aware unequal erasure protection codes
    BOUABDALLAH Amine
    LACAN Jér?me
    [J]. Journal of Zhejiang University-Science A(Applied Physics & Engineering), 2006, (S1) : 27 - 33
  • [39] Dependency-aware action planning for smart home
    Kim, Jongjin
    Lee, Jaeri
    Yun, Jeongin
    Kang, U.
    [J]. PLOS ONE, 2024, 19 (06):
  • [40] Dependency-Aware Distributed Video Transcoding in the Cloud
    Zakerinasab, Mohammad Reza
    Wang, Mea
    [J]. 40TH ANNUAL IEEE CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN 2015), 2015, : 245 - 252