Document-Level Neural Machine Translation With Recurrent Context States

被引:1
|
作者
Zhao, Yue [1 ]
Liu, Hui [2 ]
机构
[1] Northeastern Univ, Sch Marxism, Shenyang 110819, Peoples R China
[2] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China
关键词
Context modeling; Training; Complexity theory; Decoding; Computational modeling; Machine translation; Transformers; Neural machine translation; document-level translation; speeding up;
D O I
10.1109/ACCESS.2023.3247508
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Integrating contextual information into sentence-level neural machine translation (NMT) systems has been proven to be effective in generating fluent and coherent translations. However, taking too much context into account slows down these systems, especially when context-aware models are applied to the decoder side. To improve efficiency, we propose a simple and fast method to encode all sentences in an arbitrary large context window. It makes contextual representations in the process of translating each sentence so that the overhead introduced by the context model is almost negligible. We experiment with our method on three widely used English-German document-level translation datasets, which obtain substantial improvements over the sentence-level baseline with almost no loss in efficiency. Moreover, our method also achieves comparable performance with previous strong context-aware baselines and speeds up the inference by 1.53x. The speed-up is even larger when more contexts are taken into account. On the ContraPro pronoun translation dataset, it significantly outperforms the strong baseline.
引用
收藏
页码:27519 / 27526
页数:8
相关论文
共 50 条
  • [31] Improving the Transformer Translation Model with Document-Level Context
    Zhang, Jiacheng
    Luan, Huanbo
    Sun, Maosong
    Zhai, FeiFei
    Xu, Jingfang
    Zhang, Min
    Liu, Yang
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 533 - 542
  • [32] G-Transformer for Document-level Machine Translation
    Bao, Guangsheng
    Zhang, Yue
    Teng, Zhiyang
    Chen, Boxing
    Luo, Weihua
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3442 - 3455
  • [33] Exploring Discourse Structure in Document-level Machine Translation
    Hu, Xinyu
    Wan, Xiaojun
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13889 - 13902
  • [34] Document-Level Machine Translation with Large Language Models
    Wang, Longyue
    Lyu, Chenyang
    Ji, Tianbo
    Zhang, Zhirui
    Yu, Dian
    Shi, Shuming
    Tu, Zhaopeng
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 16646 - 16661
  • [35] Better Document-Level Machine Translation with Bayes' Rule
    Yu, Lei
    Sartran, Laurent
    Stokowiec, Wojciech
    Ling, Wang
    Kong, Lingpeng
    Blunsom, Phil
    Dyer, Chris
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 : 346 - 360
  • [36] Towards Personalised and Document-level Machine Translation of Dialogue
    Vincent, Sebastian T.
    EACL 2021: THE 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2021, : 137 - 147
  • [37] Non-Autoregressive Document-Level Machine Translation
    Bao, Guangsheng
    Teng, Zhiyang
    Zhou, Hao
    Yan, Jianhao
    Zhang, Yue
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 14791 - 14803
  • [38] Context-aware Decoder for Neural Machine Translation using a Target-side Document-Level Language Model
    Sugiyama, Amane
    Yoshinaga, Naoki
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5781 - 5791
  • [39] Multi-Hop Transformer for Document-Level Machine Translation
    Zhang, Long
    Zhang, Tong
    Zhang, Haibo
    Yang, Baosong
    Ye, Wei
    Zhang, Shikun
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 3953 - 3963
  • [40] Target-Side Augmentation for Document-Level Machine Translation
    Bao, Guangsheng
    Teng, Zhiyang
    Zhang, Yue
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 10725 - 10740