Modeling Fluency and Faithfulness for Diverse Neural Machine Translation

被引:0
|
作者
Feng, Yang [1 ,2 ]
Xie, Wanying [1 ,3 ]
Gu, Shuhao [1 ,2 ]
Shao, Chenze [1 ,2 ]
Zhang, Wen [4 ]
Yang, Zhengxin [1 ,2 ]
Yu, Dong [3 ]
机构
[1] Chinese Acad Sci ICT CAS, Key Lab Intelligent Informat Proc, Inst Comp Technol, Guangzhou, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Beijing Language & Culture Univ, Beijing, Peoples R China
[4] Smart Platform Prod Dept Tencent Inc, Beijing, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural machine translation models usually adopt the teacher forcing strategy for training which requires the predicted sequence matches ground truth word by word and forces the probability of each prediction to approach a 0-1 distribution. However, the strategy casts all the portion of the distribution to the ground truth word and ignores other words in the target vocabulary even when the ground truth word cannot dominate the distribution. To address the problem of teacher forcing, we propose a method to introduce an evaluation module to guide the distribution of the prediction. The evaluation module accesses each prediction from the perspectives of fluency and faithfulness to encourage the model to generate the word which has a fluent connection with its past and future translation and meanwhile tends to form a translation equivalent in meaning to the source. The experiments on multiple translation tasks show that our method can achieve significant improvements over strong baselines.
引用
收藏
页码:59 / 66
页数:8
相关论文
共 50 条
  • [1] Towards Enhancing Faithfulness for Neural Machine Translation
    Weng, Rongxiang
    Yu, Heng
    Wei, Xiangpeng
    Luo, Weihua
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2675 - 2684
  • [2] Measuring and Improving Faithfulness of Attention in Neural Machine Translation
    Moradi, Pooya
    Kambhatla, Nishant
    Sarkar, Anoop
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2791 - 2802
  • [3] Training with Adversaries to Improve Faithfulness of Attention in Neural Machine Translation
    Moradi, Pooya
    Kambhatla, Nishant
    Sarkar, Anoop
    [J]. AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 86 - 93
  • [4] Leveraging Diverse Modeling Contexts With Collaborating Learning for Neural Machine Translation
    Liao, Yusheng
    Wang, Yanfeng
    Wang, Yu
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2100 - 2111
  • [5] Leveraging Diverse Modeling Contexts with Collaborating Learning for Neural Machine Translation
    Liao, Yusheng
    Wang, Yanfeng
    Wang, Yu
    [J]. IEEE/ACM Transactions on Audio Speech and Language Processing, 2024, 32 : 2100 - 2111
  • [6] Modeling Coverage for Neural Machine Translation
    Tu, Zhaopeng
    Lu, Zhengdong
    Liu, Yang
    Liu, Xiaohua
    Li, Hang
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 76 - 85
  • [7] Confidence Modeling for Neural Machine Translation
    Aida, Taichi
    Yamamoto, Kazuhide
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 349 - 354
  • [8] Fluency Enhancement of Machine Translation
    Manion, Steve L.
    Punchihewa, Amal
    [J]. ICSPCS: 2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, PROCEEDINGS, 2008, : 591 - 596
  • [9] Modeling Source Syntax for Neural Machine Translation
    Li, Junhui
    Xiong, Deyi
    Tu, Zhaopeng
    Zhu, Muhua
    Zhang, Min
    Zhou, Guodong
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 688 - 697
  • [10] Modeling Coherence for Discourse Neural Machine Translation
    Xiong, Hao
    He, Zhongjun
    Wu, Hua
    Wang, Haifeng
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 7338 - 7345