Improve example-based machine translation quality for low-resource language using ontology

被引:1
|
作者
Khan Md Anwarus K.M.A. [1 ]
Yamada S. [2 ]
Tetsuro N. [3 ]
机构
[1] IBM Research Tokyo, 19-21 Nihonbashi, Hakozaki-cho, Chuo-ku, Tokyo
[2] NTT Corporation, NTT Hibiya Building, 1-1-6 Uchisaiwai-cho, Chiyoda-ku, Tokyo
[3] University of Electro-Communications, Graduate School of Informatics and Engineering, 1-5-1 Chofugaoka, Chofu, Tokyo
关键词
Example-based machine translation; Knowledge engineering; WordNet;
D O I
10.2991/ijndc.2017.5.3.6
中图分类号
学科分类号
摘要
In this research we propose to use ontology to improve the performance of an EBMT system for low-resource language pair. The EBMT architecture use chunk-string templates (CSTs) and unknown word translation mechanism. CSTs consist of a chunk in source-language, a string in target-language, and word alignment in-formation. For unknown word translation, we used WordNet hypernym tree and English-Bengali dictionary. CSTs improved the wide-coverage by 57 points and quality by 48.81 points in human evaluation. Currently 64.29% of the test-set translations by the system were acceptable. The combined solutions of CSTs and unknown words generated 67.85% acceptable translations from the test-set. Un-known words mechanism improved translation quality by 3.56 points in human evaluation. Copyright © 2017, the Authors.
引用
收藏
页码:176 / 191
页数:15
相关论文
共 50 条
  • [41] Handling Syntactic Divergence in Low-resource Machine Translation
    Zhou, Chunting
    Ma, Xuezhe
    Hu, Junjie
    Neubig, Graham
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1388 - 1394
  • [42] Neural Machine Translation for Low-resource Languages: A Survey
    Ranathunga, Surangika
    Lee, En-Shiun Annie
    Skenduli, Marjana Prifti
    Shekhar, Ravi
    Alam, Mehreen
    Kaur, Rishemjit
    ACM COMPUTING SURVEYS, 2023, 55 (11)
  • [43] Data Augmentation for Low-Resource Neural Machine Translation
    Fadaee, Marzieh
    Bisazza, Arianna
    Monz, Christof
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 567 - 573
  • [44] Better Low-Resource Machine Translation with Smaller Vocabularies
    Signoroni, Edoardo
    Rychly, Pavel
    TEXT, SPEECH, AND DIALOGUE, TSD 2024, PT I, 2024, 15048 : 184 - 195
  • [45] Recent advances of low-resource neural machine translation
    Haque, Rejwanul
    Liu, Chao-Hong
    Way, Andy
    MACHINE TRANSLATION, 2021, 35 (04) : 451 - 474
  • [46] Meaning preservation in Example-based Machine Translation with structural semantics
    Chua, Chong Chai
    Lim, Tek Yong
    Soon, Lay-Ki
    Tang, Enya Kong
    Ranaivo-Malancon, Bali
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 78 : 242 - 258
  • [47] Example-based machine translation without saying inferable predicate
    Aramaki, E
    Kurohashi, S
    Kashioka, H
    Tanaka, H
    NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 206 - 215
  • [48] Introduction to special issue on example-based machine translation INTRODUCTION
    Carl, Michael
    Way, Andy
    MACHINE TRANSLATION, 2005, 19 (3-4) : 193 - 195
  • [49] Improving example-based machine translation with statistical collocation model
    Liu, Z.-Y. (zhanyiliu@gmail.com), 2012, Chinese Academy of Sciences (23):
  • [50] Introduction of Phrase Structures into the Example-Based Machine Translation System
    Khoroshilov, Alexander A.
    Kozerenko, Elena B.
    Nikitin, Yuri, V
    Kalinin, Yuri P.
    Khoroshilov, Alexei A.
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 445 - 450