Understanding and Improving Hidden Representation for Neural Machine Translation

被引:0
|
作者
Li, Guanlin [1 ]
Liu, Lemao [2 ]
Li, Xintong [3 ]
Zhu, Conghui [1 ]
Zhao, Tiejun [1 ]
Shi, Shuming [2 ]
机构
[1] Harbin Inst Technol, Harbin, Peoples R China
[2] Tencent AI Lab, Bellevue, WA USA
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilayer architectures are currently the gold standard for large-scale neural machine translation. Existing works have explored some methods for understanding the hidden representations, however, they have not sought to improve the translation quality rationally according to their understanding. Towards understanding for performance improvement, we first artificially construct a sequence of nested relative tasks and measure the feature generalization ability of the learned hidden representation over these tasks. Based on our understanding, we then propose to regularize the layer-wise representations with all treeinduced tasks. To overcome the computational bottleneck resulting from the large number of regularization terms, we design efficient approximation methods by selecting a few coarse-to-fine tasks for regularization. Extensive experiments on two widely-used datasets demonstrate the proposed methods only lead to small extra overheads in training but no additional overheads in testing, and achieve consistent improvements (up to +1.3 BLEU) compared to the state-of-the-art translation model.
引用
收藏
页码:466 / 477
页数:12
相关论文
共 50 条
  • [31] Improving Multilingual Neural Machine Translation with Auxiliary Source Languages
    Xu, Weijia
    Yin, Yuwei
    Ma, Shuming
    Zhang, Dongdong
    Huang, Haoyang
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 3029 - 3041
  • [32] Towards a Better Understanding of Label Smoothing in Neural Machine Translation
    Gao, Yingbo
    Wang, Weiyue
    Herold, Christian
    Yang, Zijian
    Ney, Hermann
    [J]. 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 212 - 223
  • [33] Detecting Source Contextual Barriers for Understanding Neural Machine Translation
    Li, Guanlin
    Liu, Lemao
    Zhu, Conghui
    Wang, Rui
    Zhao, Tiejun
    Shi, Shuming
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 3158 - 3169
  • [34] Improving Neural Machine Translation by Efficiently Incorporating Syntactic Templates
    Phuong Nguyen
    Tung Le
    Thanh-Le Ha
    Thai Dang
    Khanh Tran
    Kim Anh Nguyen
    Nguyen Le Minh
    [J]. ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 303 - 314
  • [35] Improving Adversarial Neural Machine Translation for Morphologically Rich Language
    Mi, Chenggang
    Xie, Lei
    Zhang, Yanning
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2020, 4 (04): : 417 - 426
  • [36] Towards Understanding Neural Machine Translation with Attention Heads' Importance
    Zhou, Zijie
    Zhu, Junguo
    Li, Weijiang
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [37] Compositional Representation of Morphologically-Rich Input for Neural Machine Translation
    Ataman, Duygu
    Federico, Marcello
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 305 - 311
  • [38] Non-autoregressive neural machine translation with auxiliary representation fusion
    Du, Quan
    Feng, Kai
    Xu, Chen
    Xiao, Tong
    Zhu, Jingbo
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 7229 - 7239
  • [39] UNDERSTANDING MACHINE TRANSLATION
    Varga, Agnes
    [J]. IDIMT-2006, 2006, 19 : 285 - 296
  • [40] Improving Chinese-Vietnamese Neural Machine Translation with Linguistic Differences
    Yu, Zhiqiang
    Yu, Zhengtao
    Xian, Yantuan
    Huang, Yuxin
    Guo, Junjun
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (02)