Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning

被引:0
|
作者
Kang, Xiaomian [1 ,2 ]
Zhao, Yang [1 ,2 ]
Zhang, Jiajun [1 ,2 ,3 ]
Zong, Chengqing [1 ,2 ,4 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[3] Beijing Acad Artificial Intelligence, Beijing, Peoples R China
[4] CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document-level neural machine translation has yielded attractive improvements. However, majority of existing methods roughly use all context sentences in a fixed scope. They neglect the fact that different source sentences need different sizes of context. To address this problem, we propose an effective approach to select dynamic context so that the document-level translation model can utilize the more useful selected context sentences to produce better translations. Specifically, we introduce a selection module that is independent of the translation module to score each candidate context sentence. Then, we propose two strategies to explicitly select a variable number of context sentences and feed them into the translation module. We train the two modules end-to-end via reinforcement learning. A novel reward is proposed to encourage the selection and utilization of dynamic context sentences. Experiments demonstrate that our approach can select adaptive context sentences for different source sentences, and significantly improves the performance of document-level translation methods.
引用
收藏
页码:2242 / 2254
页数:13
相关论文
共 50 条
  • [21] Document-Level Neural Machine Translation with Associated Memory Network
    Jiang, Shu
    Wang, Rui
    Li, Zuchao
    Utiyama, Masao
    Chen, Kehai
    Sumita, Eiichiro
    Zhao, Hai
    Lu, Bao-liang
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (10) : 1712 - 1723
  • [22] Modeling Discourse Structure for Document-level Neural Machine Translation
    Chen, Junxuan
    Li, Xiang
    Zhang, Jiarui
    Zhou, Chulun
    Cui, Jianwei
    Wang, Bin
    Su, Jinsong
    WORKSHOP ON AUTOMATIC SIMULTANEOUS TRANSLATION CHALLENGES, RECENT ADVANCES, AND FUTURE DIRECTIONS, 2020, : 30 - 36
  • [23] Improving Document-Level Neural Machine Translation with Domain Adaptation
    Ul Haq, Sami
    Rauf, Sadaf Abdul
    Shoukat, Arslan
    Noor-e-Hira
    NEURAL GENERATION AND TRANSLATION, 2020, : 225 - 231
  • [24] Document-Level Machine Translation with Effective Batch-Level Context Representation
    Zhong, Kang
    Zhang, Jie
    Guo, Wu
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [25] Addressing the Length Bias Problem in Document-Level Neural Machine Translation
    Zhang, Zhuocheng
    Gu, Shuhao
    Zhang, Min
    Feng, Yang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 11545 - 11556
  • [26] TANDO: A Corpus for Document-level Machine Translation
    Gete, Harritxu
    Etchegoyhen, Thierry
    Ponce, David
    Labaka, Gorka
    Aranberri, Nora
    Corral, Ander
    Saralegi, Xabier
    Santos, Igor Ellakuria
    Martin, Maite
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3026 - 3037
  • [27] Modeling Document-Level Context for Event Detection via Important Context Selection
    Ben Veyseh, Amir Pouran
    Minh Van Nguyen
    Nghia Ngo Trung
    Min, Bonan
    Thien Huu Nguyen
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 5403 - 5413
  • [28] Semantically Constrained Document-Level Chinese-Mongolian Neural Machine Translation
    Li, Haoran
    Hou, Hongxu
    Wu, Nier
    Jia, Xiaoning
    Chang, Xin
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [29] Document-Level Machine Translation as a Re-translation Process
    Martinez Garcia, Eva
    Espana-Bonet, Cristina
    Marquez, Lluis
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2014, (53): : 103 - 110
  • [30] Combining Local and Document-Level Context: The LMU Munich Neural Machine Translation System at WMT19
    Stojanovski, Dario
    Fraser, Alexander
    FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), 2019, : 400 - 406