Creating a Corpus for Russian Data-to-Text Generation Using Neural Machine Translation and Post-Editing

被引:0
|
作者
Shimorina, Anastasia [1 ,2 ]
Khasanova, Elena [1 ]
Gardent, Claire [2 ,3 ]
机构
[1] Lorraine Univ, Nancy, France
[2] LORIA, Nancy, France
[3] CNRS, Nancy, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an approach for semi-automatically creating a data-to-text (D2T) corpus for Russian that can be used to learn a D2T natural language generation model. An error analysis of the output of an English-to-Russian neural machine translation system shows that 80% of the automatically translated sentences contain an error and that 53% of all translation errors bear on named entities (NE). We therefore focus on named entities and introduce two post-editing techniques for correcting wrongly translated NEs.
引用
收藏
页码:44 / 49
页数:6
相关论文
共 50 条
  • [1] Neural Machine Translation Quality and Post-Editing Performance
    Zouhar, Vilem
    Tamchyna, Ales
    Popel, Martin
    Bojar, Ondrej
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 10204 - 10214
  • [2] Post-editing neural machine translation versus translation memory segments
    Sanchez-Gijon, Pilar
    Moorkens, Joss
    Way, Andy
    [J]. MACHINE TRANSLATION, 2019, 33 (1-2) : 31 - 59
  • [3] Translators' perceptions of literary post-editing using statistical and neural machine translation
    Moorkens, Joss
    Toral, Antonio
    Castilho, Sheila
    Way, Andy
    [J]. TRANSLATION SPACES, 2018, 7 (02) : 240 - 262
  • [4] Comparing the Quality of Neural Machine Translation and Professional Post-Editing
    Vardaro, Jennifer
    Schaeffer, Moritz
    Hansen-Schirra, Silvia
    [J]. 2019 ELEVENTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2019,
  • [5] Translation Quality and Error Recognition in Professional Neural Machine Translation Post-Editing
    Vardaro, Jennifer
    Schaeffer, Moritz
    Hansen-Schirra, Silvia
    [J]. INFORMATICS-BASEL, 2019, 6 (03):
  • [6] On the correctness of machine translation: A machine translation post-editing task
    Koponen, Maarit
    Salmi, Leena
    [J]. JOURNAL OF SPECIALISED TRANSLATION, 2015, (23): : 117 - 135
  • [7] Man or machine? Comparing the difficulty of human translation versus neural machine translation post-editing
    Jia, Yanfang
    Sun, Sanjun
    [J]. PERSPECTIVES-STUDIES IN TRANSLATION THEORY AND PRACTICE, 2023, 31 (05): : 950 - 968
  • [8] The impact of Google Neural Machine Translation on Post-editing by student translators
    Yamada, Masaru
    [J]. JOURNAL OF SPECIALISED TRANSLATION, 2019, (31): : 87 - 106
  • [9] Is machine translation post-editing worth the effort? A survey of research into post-editing and effort
    Koponen, Maarit
    [J]. JOURNAL OF SPECIALISED TRANSLATION, 2016, (25): : 131 - 148
  • [10] Post-Editing Machine Translation As an FSL Exercise
    Kliffer, Michael D.
    [J]. PORTA LINGUARUM, 2008, (09) : 53 - 67