Context-Aware Neural Machine Translation Learns Anaphora Resolution

被引:0
|
作者
Voita, Elena [1 ,2 ]
Serdyukov, Pavel [1 ]
Sennrich, Rico [3 ,4 ]
Titov, Ivan [2 ,3 ]
机构
[1] Yandex, Moscow, Russia
[2] Univ Amsterdam, Amsterdam, Netherlands
[3] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[4] Univ Zurich, Zurich, Switzerland
基金
美国国家科学基金会; 瑞士国家科学基金会; 欧洲研究理事会;
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Standard machine translation systems process sentences in isolation and hence ignore extra-sentential information, even though extended context can both prevent mistakes in ambiguous cases and improve translation coherence. We introduce a context-aware neural machine translation model designed in such way that the flow of information from the extended context to the translation model can be controlled and analyzed. We experiment with an English-Russian subtitles dataset, and observe that much of what is captured by our model deals with improving pronoun translation. We measure correspondences between induced attention distributions and coreference relations and observe that the model implicitly captures anaphora. It is consistent with gains for sentences where pronouns need to be gendered in translation. Beside improvements in anaphoric cases, the model also improves in overall BLEU, both over its context-agnostic version (+0.7) and over simple concatenation of the context and source sentences (+0.6).
引用
收藏
页码:1264 / 1274
页数:11
相关论文
共 50 条
  • [1] A study of BERT for context-aware neural machine translation
    Xueqing Wu
    Yingce Xia
    Jinhua Zhu
    Lijun Wu
    Shufang Xie
    Tao Qin
    [J]. Machine Learning, 2022, 111 : 917 - 935
  • [2] A Context-Aware Recurrent Encoder for Neural Machine Translation
    Zhang, Biao
    Xiong, Deyi
    Su, Jinsong
    Duan, Hong
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (12) : 2424 - 2432
  • [3] Context-Aware Monolingual Repair for Neural Machine Translation
    Voita, Elena
    Sennrich, Rico
    Titov, Ivan
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 877 - 886
  • [4] A study of BERT for context-aware neural machine translation
    Wu, Xueqing
    Xia, Yingce
    Zhu, Jinhua
    Wu, Lijun
    Xie, Shufang
    Qin, Tao
    [J]. MACHINE LEARNING, 2022, 111 (03) : 917 - 935
  • [5] Selective Attention for Context-aware Neural Machine Translation
    Maruf, Sameen
    Martins, Andre F. T.
    Haffari, Gholamreza
    [J]. 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3092 - 3102
  • [6] Context-Aware Neural Machine Translation for Korean Honorific Expressions
    Hwang, Yongkeun
    Kim, Yanghoon
    Jung, Kyomin
    [J]. ELECTRONICS, 2021, 10 (13)
  • [7] One Type Context Is Not Enough: Global Context-aware Neural Machine Translation
    Chen, Linqing
    Li, Junhui
    Gong, Zhengxian
    Zhang, Min
    Zhou, Guodong
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (06)
  • [8] Context-Aware Linguistic Steganography Model Based on Neural Machine Translation
    Ding, Changhao
    Fu, Zhangjie
    Yang, Zhongliang
    Yu, Qi
    Li, Daqiu
    Huang, Yongfeng
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 (868-878) : 868 - 878
  • [9] Context-aware Neural Machine Translation with Mini-batch Embedding
    Morishita, Makoto
    Suzuki, Jun
    Iwata, Tomoharu
    Nagata, Masaaki
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2513 - 2521
  • [10] Context-Aware Phrase Representation for Statistical Machine Translation
    Ruan, Zhiwei
    Su, Jinsong
    Xiong, Deyi
    Ji, Rongrong
    [J]. PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2018, 11012 : 137 - 149