Document-Level Machine Translation with Large Language Models

被引:0
|
作者
Wang, Longyue [1 ]
Lyu, Chenyang [2 ]
Ji, Tianbo [3 ]
Zhang, Zhirui [1 ]
Yu, Dian [1 ]
Shi, Shuming [1 ]
Tu, Zhaopeng [1 ]
机构
[1] Tencent AI Lab, Shenzhen, Guangdong, Peoples R China
[2] MBZUAI, Abu Dhabi, U Arab Emirates
[3] Dublin City Univ, Dublin, Ireland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language models (LLMs) such as ChatGPT can produce coherent, cohesive, relevant, and fluent answers for various natural language processing (NLP) tasks. Taking documentlevel machine translation (MT) as a testbed, this paper provides an in-depth evaluation of LLMs' ability on discourse modeling. The study focuses on three aspects: 1) Effects of Context-Aware Prompts, where we investigate the impact of different prompts on document-level translation quality and discourse phenomena; 2) Comparison of Translation Models, where we compare the translation performance of ChatGPT with commercial MT systems and advanced document-level MT methods; 3) Analysis of Discourse Modelling Abilities, where we further probe discourse knowledge encoded in LLMs and shed light on impacts of training techniques on discourse modeling. By evaluating on a number of benchmarks, we surprisingly find that LLMs have demonstrated superior performance and show potential to become a new paradigm for document-level translation: 1) leveraging their powerful long-text modeling capabilities, GPT-3.5 and GPT-4 outperform commercial MT systems in terms of human evaluation;(1) 2) GPT-4 demonstrates a stronger ability for probing linguistic knowledge than GPT-3.5. This work highlights the challenges and opportunities of LLMs for MT, which we hope can inspire the future design and evaluation of LLMs.(2)
引用
收藏
页码:16646 / 16661
页数:16
相关论文
共 50 条
  • [1] AutoRE: Document-Level Relation Extraction with Large Language Models
    Xue, Lilong
    Zhang, Dan
    Dong, Yuxiao
    Tang, Jie
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 3: SYSTEM DEMONSTRATIONS, 2024, : 211 - 220
  • [2] TANDO: A Corpus for Document-level Machine Translation
    Gete, Harritxu
    Etchegoyhen, Thierry
    Ponce, David
    Labaka, Gorka
    Aranberri, Nora
    Corral, Ander
    Saralegi, Xabier
    Santos, Igor Ellakuria
    Martin, Maite
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3026 - 3037
  • [3] Rethinking Document-level Neural Machine Translation
    Sun, Zewei
    Wang, Mingxuan
    Zhou, Hao
    Zhao, Chengqi
    Huang, Shujian
    Chen, Jiajun
    Li, Lei
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3537 - 3548
  • [4] Document-Level Adaptation for Neural Machine Translation
    Kothur, Sachith Sri Ram
    Knowles, Rebecca
    Koehn, Philipp
    NEURAL MACHINE TRANSLATION AND GENERATION, 2018, : 64 - 73
  • [5] Corpora for Document-Level Neural Machine Translation
    Liu, Siyou
    Zhang, Xiaojun
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3775 - 3781
  • [6] Large language models effectively leverage document-level context for literary translation, but critical errors persist
    Karpinska, Marzena
    Iyyer, Mohit
    arXiv, 2023,
  • [7] Document-Level Machine Translation as a Re-translation Process
    Martinez Garcia, Eva
    Espana-Bonet, Cristina
    Marquez, Lluis
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2014, (53): : 103 - 110
  • [8] On Search Strategies for Document-Level Neural Machine Translation
    Herold, Christian
    Ney, Hermann
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 12827 - 12836
  • [9] G-Transformer for Document-level Machine Translation
    Bao, Guangsheng
    Zhang, Yue
    Teng, Zhiyang
    Chen, Boxing
    Luo, Weihua
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 3442 - 3455
  • [10] Exploring Discourse Structure in Document-level Machine Translation
    Hu, Xinyu
    Wan, Xiaojun
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13889 - 13902