Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning

被引:0
|
作者
Kang, Xiaomian [1 ,2 ]
Zhao, Yang [1 ,2 ]
Zhang, Jiajun [1 ,2 ,3 ]
Zong, Chengqing [1 ,2 ,4 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[3] Beijing Acad Artificial Intelligence, Beijing, Peoples R China
[4] CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document-level neural machine translation has yielded attractive improvements. However, majority of existing methods roughly use all context sentences in a fixed scope. They neglect the fact that different source sentences need different sizes of context. To address this problem, we propose an effective approach to select dynamic context so that the document-level translation model can utilize the more useful selected context sentences to produce better translations. Specifically, we introduce a selection module that is independent of the translation module to score each candidate context sentence. Then, we propose two strategies to explicitly select a variable number of context sentences and feed them into the translation module. We train the two modules end-to-end via reinforcement learning. A novel reward is proposed to encourage the selection and utilization of dynamic context sentences. Experiments demonstrate that our approach can select adaptive context sentences for different source sentences, and significantly improves the performance of document-level translation methods.
引用
收藏
页码:2242 / 2254
页数:13
相关论文
共 50 条
  • [31] Importance-Aware Data Augmentation for Document-Level Neural Machine Translation
    Wu, Minghao
    Wang, Yufei
    Foster, George
    Qiu, Lizhen
    Haffari, Gholamreza
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 740 - 752
  • [32] Improving the Transformer Translation Model with Document-Level Context
    Zhang, Jiacheng
    Luan, Huanbo
    Sun, Maosong
    Zhai, FeiFei
    Xu, Jingfang
    Zhang, Min
    Liu, Yang
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 533 - 542
  • [33] G-Transformer for Document-level Machine Translation
    Bao, Guangsheng
    Zhang, Yue
    Teng, Zhiyang
    Chen, Boxing
    Luo, Weihua
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3442 - 3455
  • [34] Exploring Discourse Structure in Document-level Machine Translation
    Hu, Xinyu
    Wan, Xiaojun
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13889 - 13902
  • [35] Document-Level Machine Translation with Large Language Models
    Wang, Longyue
    Lyu, Chenyang
    Ji, Tianbo
    Zhang, Zhirui
    Yu, Dian
    Shi, Shuming
    Tu, Zhaopeng
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 16646 - 16661
  • [36] Better Document-Level Machine Translation with Bayes' Rule
    Yu, Lei
    Sartran, Laurent
    Stokowiec, Wojciech
    Ling, Wang
    Kong, Lingpeng
    Blunsom, Phil
    Dyer, Chris
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 : 346 - 360
  • [37] Towards Personalised and Document-level Machine Translation of Dialogue
    Vincent, Sebastian T.
    EACL 2021: THE 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2021, : 137 - 147
  • [38] Non-Autoregressive Document-Level Machine Translation
    Bao, Guangsheng
    Teng, Zhiyang
    Zhou, Hao
    Yan, Jianhao
    Zhang, Yue
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 14791 - 14803
  • [39] Context-aware Decoder for Neural Machine Translation using a Target-side Document-Level Language Model
    Sugiyama, Amane
    Yoshinaga, Naoki
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5781 - 5791
  • [40] Multi-Hop Transformer for Document-Level Machine Translation
    Zhang, Long
    Zhang, Tong
    Zhang, Haibo
    Yang, Baosong
    Ye, Wei
    Zhang, Shikun
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3953 - 3963