Document-Level Machine Translation with Large Language Models

被引:0
|
作者
Wang, Longyue [1 ]
Lyu, Chenyang [2 ]
Ji, Tianbo [3 ]
Zhang, Zhirui [1 ]
Yu, Dian [1 ]
Shi, Shuming [1 ]
Tu, Zhaopeng [1 ]
机构
[1] Tencent AI Lab, Shenzhen, Guangdong, Peoples R China
[2] MBZUAI, Abu Dhabi, U Arab Emirates
[3] Dublin City Univ, Dublin, Ireland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language models (LLMs) such as ChatGPT can produce coherent, cohesive, relevant, and fluent answers for various natural language processing (NLP) tasks. Taking documentlevel machine translation (MT) as a testbed, this paper provides an in-depth evaluation of LLMs' ability on discourse modeling. The study focuses on three aspects: 1) Effects of Context-Aware Prompts, where we investigate the impact of different prompts on document-level translation quality and discourse phenomena; 2) Comparison of Translation Models, where we compare the translation performance of ChatGPT with commercial MT systems and advanced document-level MT methods; 3) Analysis of Discourse Modelling Abilities, where we further probe discourse knowledge encoded in LLMs and shed light on impacts of training techniques on discourse modeling. By evaluating on a number of benchmarks, we surprisingly find that LLMs have demonstrated superior performance and show potential to become a new paradigm for document-level translation: 1) leveraging their powerful long-text modeling capabilities, GPT-3.5 and GPT-4 outperform commercial MT systems in terms of human evaluation;(1) 2) GPT-4 demonstrates a stronger ability for probing linguistic knowledge than GPT-3.5. This work highlights the challenges and opportunities of LLMs for MT, which we hope can inspire the future design and evaluation of LLMs.(2)
引用
收藏
页码:16646 / 16661
页数:16
相关论文
共 50 条
  • [31] Document-Level Machine Translation with Effective Batch-Level Context Representation
    Zhong, Kang
    Zhang, Jie
    Guo, Wu
    2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
  • [32] Routing Based Context Selection for Document-Level Neural Machine Translation
    Fei, Weilun
    Jian, Ping
    Zhu, Xiaoguang
    Lin, Yi
    MACHINE TRANSLATION, CCMT 2021, 2021, 1464 : 77 - 91
  • [33] Document-Level Neural Machine Translation with Hierarchical Modeling of Global Context
    Tan, Xin
    Zhang, Long-Yin
    Zhou, Guo-Dong
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2022, 37 (02) : 295 - 308
  • [34] Addressing the Length Bias Problem in Document-Level Neural Machine Translation
    Zhang, Zhuocheng
    Gu, Shuhao
    Zhang, Min
    Feng, Yang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 11545 - 11556
  • [35] Toward Understanding Most of the Context in Document-Level Neural Machine Translation
    Choi, Gyu-Hyeon
    Shin, Jong-Hun
    Lee, Yo-Han
    Kim, Young-Kil
    ELECTRONICS, 2022, 11 (15)
  • [36] Learning Contextualized Sentence Representations for Document-Level Neural Machine Translation
    Zhang, Pei
    Zhang, Xu
    Chen, Wei
    Yu, Jian
    Wang, Yanfeng
    Xiong, Deyi
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2298 - 2305
  • [37] Document-level Neural Machine Translation Using BERT as Context Encoder
    Guo, Zhiyu
    Minh Le Nguyen
    AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 94 - 100
  • [38] Document-Level Neural Machine Translation with Hierarchical Modeling of Global Context
    Xin Tan
    Long-Yin Zhang
    Guo-Dong Zhou
    Journal of Computer Science and Technology, 2022, 37 : 295 - 308
  • [39] Hierarchical Modeling of Global Context for Document-Level Neural Machine Translation
    Tan, Xin
    Zhang, Longyin
    Xiong, Deyi
    Zhou, Guodong
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1576 - 1585
  • [40] Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level Neural Machine Translation
    Junczys-Dowmunt, Marcin
    FOURTH CONFERENCE ON MACHINE TRANSLATION (WMT 2019), 2019, : 225 - 233