Improve example-based machine translation quality for low-resource language using ontology

被引:1
|
作者
Khan Md Anwarus K.M.A. [1 ]
Yamada S. [2 ]
Tetsuro N. [3 ]
机构
[1] IBM Research Tokyo, 19-21 Nihonbashi, Hakozaki-cho, Chuo-ku, Tokyo
[2] NTT Corporation, NTT Hibiya Building, 1-1-6 Uchisaiwai-cho, Chiyoda-ku, Tokyo
[3] University of Electro-Communications, Graduate School of Informatics and Engineering, 1-5-1 Chofugaoka, Chofu, Tokyo
关键词
Example-based machine translation; Knowledge engineering; WordNet;
D O I
10.2991/ijndc.2017.5.3.6
中图分类号
学科分类号
摘要
In this research we propose to use ontology to improve the performance of an EBMT system for low-resource language pair. The EBMT architecture use chunk-string templates (CSTs) and unknown word translation mechanism. CSTs consist of a chunk in source-language, a string in target-language, and word alignment in-formation. For unknown word translation, we used WordNet hypernym tree and English-Bengali dictionary. CSTs improved the wide-coverage by 57 points and quality by 48.81 points in human evaluation. Currently 64.29% of the test-set translations by the system were acceptable. The combined solutions of CSTs and unknown words generated 67.85% acceptable translations from the test-set. Un-known words mechanism improved translation quality by 3.56 points in human evaluation. Copyright © 2017, the Authors.
引用
收藏
页码:176 / 191
页数:15
相关论文
共 50 条
  • [1] Improve Example-Based Machine Translation Quality for Low-Resource Language Using Ontology
    Salam, Khan Md Anwarus
    Yamada, Setsuo
    Tetsuro, Nishio
    APPLIED COMPUTING & INFORMATION TECHNOLOGY, 2018, 727 : 67 - 90
  • [2] Machine Translation into Low-resource Language Varieties
    Kumar, Sachin
    Anastasopoulos, Antonios
    Wintner, Shuly
    Tsvetkov, Yulia
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 110 - 121
  • [3] Adding Visual Information to Improve Multimodal Machine Translation for Low-Resource Language
    Shi, Xiayang
    Yu, Zhenqiang
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [4] Language Model Prior for Low-Resource Neural Machine Translation
    Baziotis, Christos
    Haddow, Barry
    Birch, Alexandra
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7622 - 7634
  • [5] Automatic Machine Translation of Poetry and a Low-Resource Language Pair
    Dunder, I
    Seljan, S.
    Pavlovski, M.
    2020 43RD INTERNATIONAL CONVENTION ON INFORMATION, COMMUNICATION AND ELECTRONIC TECHNOLOGY (MIPRO 2020), 2020, : 1034 - 1039
  • [6] A metric for example matching in example-based machine translation
    Kim, Dong-Joo
    Kim, Han-Woo
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2006, E89A (06): : 1713 - 1716
  • [7] Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data
    Chowdhury, Koel Dutta
    Hasanuzzaman, Mohammed
    Liu, Qun
    DEEP LEARNING APPROACHES FOR LOW-RESOURCE NATURAL LANGUAGE PROCESSING (DEEPLO), 2018, : 33 - 42
  • [8] Survey of Low-Resource Machine Translation
    Haddow, Barry
    Bawden, Rachel
    Barone, Antonio Valerio Miceli
    Helcl, Jindrich
    Birch, Alexandra
    COMPUTATIONAL LINGUISTICS, 2022, 48 (03) : 673 - 732
  • [9] Conversion of the Vietnammese Grammar into Sign Language Structure using the Example-Based Machine Translation Algorithm
    Luyl-Da Quach
    Chi-Ngon Nguyen
    2018 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2018, : 27 - 31
  • [10] The Task of Post-Editing Machine Translation for the Low-Resource Language
    Rakhimova, Diana
    Karibayeva, Aidana
    Turarbek, Assem
    APPLIED SCIENCES-BASEL, 2024, 14 (02):