Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning

被引:0
|
作者
Kang, Xiaomian [1 ,2 ]
Zhao, Yang [1 ,2 ]
Zhang, Jiajun [1 ,2 ,3 ]
Zong, Chengqing [1 ,2 ,4 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[3] Beijing Acad Artificial Intelligence, Beijing, Peoples R China
[4] CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document-level neural machine translation has yielded attractive improvements. However, majority of existing methods roughly use all context sentences in a fixed scope. They neglect the fact that different source sentences need different sizes of context. To address this problem, we propose an effective approach to select dynamic context so that the document-level translation model can utilize the more useful selected context sentences to produce better translations. Specifically, we introduce a selection module that is independent of the translation module to score each candidate context sentence. Then, we propose two strategies to explicitly select a variable number of context sentences and feed them into the translation module. We train the two modules end-to-end via reinforcement learning. A novel reward is proposed to encourage the selection and utilization of dynamic context sentences. Experiments demonstrate that our approach can select adaptive context sentences for different source sentences, and significantly improves the performance of document-level translation methods.
引用
收藏
页码:2242 / 2254
页数:13
相关论文
共 50 条
  • [1] Routing Based Context Selection for Document-Level Neural Machine Translation
    Fei, Weilun
    Jian, Ping
    Zhu, Xiaoguang
    Lin, Yi
    MACHINE TRANSLATION, CCMT 2021, 2021, 1464 : 77 - 91
  • [2] Document-Level Neural Machine Translation With Recurrent Context States
    Zhao, Yue
    Liu, Hui
    IEEE ACCESS, 2023, 11 : 27519 - 27526
  • [3] CONTEXT-ADAPTIVE DOCUMENT-LEVEL NEURAL MACHINE TRANSLATION
    Zhang, Linlin
    Zhang, Zhirui
    Chen, Boxing
    Luo, Weihua
    Si, Luo
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6232 - 6236
  • [4] Document Flattening: Beyond Concatenating Context for Document-Level Neural Machine Translation
    Wu, Minghao
    Foster, George
    Qu, Lizhen
    Haffari, Gholamreza
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 448 - 462
  • [5] Document-Level Neural Machine Translation with Hierarchical Modeling of Global Context
    Tan, Xin
    Zhang, Long-Yin
    Zhou, Guo-Dong
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2022, 37 (02) : 295 - 308
  • [6] Toward Understanding Most of the Context in Document-Level Neural Machine Translation
    Choi, Gyu-Hyeon
    Shin, Jong-Hun
    Lee, Yo-Han
    Kim, Young-Kil
    ELECTRONICS, 2022, 11 (15)
  • [7] Document-level Neural Machine Translation Using BERT as Context Encoder
    Guo, Zhiyu
    Minh Le Nguyen
    AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 94 - 100
  • [8] Document-Level Neural Machine Translation with Hierarchical Modeling of Global Context
    Xin Tan
    Long-Yin Zhang
    Guo-Dong Zhou
    Journal of Computer Science and Technology, 2022, 37 : 295 - 308
  • [9] Hierarchical Modeling of Global Context for Document-Level Neural Machine Translation
    Tan, Xin
    Zhang, Longyin
    Xiong, Deyi
    Zhou, Guodong
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1576 - 1585
  • [10] Rethinking Document-level Neural Machine Translation
    Sun, Zewei
    Wang, Mingxuan
    Zhou, Hao
    Zhao, Chengqi
    Huang, Shujian
    Chen, Jiajun
    Li, Lei
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3537 - 3548