Understanding and Improving Hidden Representation for Neural Machine Translation

被引:0
|
作者
Li, Guanlin [1 ]
Liu, Lemao [2 ]
Li, Xintong [3 ]
Zhu, Conghui [1 ]
Zhao, Tiejun [1 ]
Shi, Shuming [2 ]
机构
[1] Harbin Inst Technol, Harbin, Peoples R China
[2] Tencent AI Lab, Bellevue, WA USA
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilayer architectures are currently the gold standard for large-scale neural machine translation. Existing works have explored some methods for understanding the hidden representations, however, they have not sought to improve the translation quality rationally according to their understanding. Towards understanding for performance improvement, we first artificially construct a sequence of nested relative tasks and measure the feature generalization ability of the learned hidden representation over these tasks. Based on our understanding, we then propose to regularize the layer-wise representations with all treeinduced tasks. To overcome the computational bottleneck resulting from the large number of regularization terms, we design efficient approximation methods by selecting a few coarse-to-fine tasks for regularization. Extensive experiments on two widely-used datasets demonstrate the proposed methods only lead to small extra overheads in training but no additional overheads in testing, and achieve consistent improvements (up to +1.3 BLEU) compared to the state-of-the-art translation model.
引用
收藏
页码:466 / 477
页数:12
相关论文
共 50 条
  • [1] Understanding and Improving the Robustness of Terminology Constraints in Neural Machine Translation
    Zhang, Huaao
    Wang, Qiang
    Qin, Bo
    Shi, Zelin
    Wang, Haibo
    Chen, Ming
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 6029 - 6042
  • [2] Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation
    Zhang, Songming
    Liang, Yunlong
    Wang, Shuaibo
    Chen, Yufeng
    Han, Wenjuan
    Liu, Jian
    Xu, Jinan
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 8062 - 8079
  • [3] Understanding and Improving Sequence-to-Sequence Pretraining for Neural Machine Translation
    Wang, Wenxuan
    Jiao, Wenxiang
    Hao, Yongchang
    Wang, Xing
    Shi, Shuming
    Tu, Zhaopeng
    Lyu, Michael R.
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 2591 - 2600
  • [4] Neural Machine Translation with Joint Representation
    Li, Yanyang
    Wang, Qiang
    Xiao, Tong
    Liu, Tongran
    Zhu, Jingbo
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8285 - 8292
  • [5] Neural Hidden Markov Model for Machine Translation
    Wang, Weiyue
    Zhu, Derui
    Alkhouli, Tamer
    Gan, Zixuan
    Ney, Hermann
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 377 - 382
  • [6] Visualizing and Understanding Neural Machine Translation
    Ding, Yanzhuo
    Liu, Yang
    Luan, Huanbo
    Sun, Maosong
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1150 - 1159
  • [7] Improving Neural Machine Translation with Neural Sentence Rewriting
    Wu, Tian
    He, Zhongjun
    Chen, Enhong
    Wang, Haifeng
    [J]. 2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 147 - 152
  • [8] Improving Neural Machine Translation with Neural Syntactic Distance
    Ma, Chunpeng
    Tamura, Akihiro
    Utiyama, Masao
    Zhao, Tiejun
    Sumita, Eiichiro
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2032 - 2037
  • [9] Controlling the Transition of Hidden States for Neural Machine Translation
    Zheng, Zaixiang
    Huang, Shujian
    Dai, Xin-Yu
    Chen, Jiajun
    [J]. MACHINE TRANSLATION, CWMT 2018, 2019, 954 : 86 - 92
  • [10] Improving Neural Machine Translation Using Rule-Based Machine Translation
    Singh, Muskaan
    Kumar, Ravinder
    Chana, Inderveer
    [J]. 2019 7TH INTERNATIONAL CONFERENCE ON SMART COMPUTING & COMMUNICATIONS (ICSCC), 2019, : 8 - 12