Improving English-to-Indian Language Neural Machine Translation Systems

被引:1
|
作者
Kandimalla, Akshara [1 ]
Lohar, Pintu [2 ]
Maji, Souvik Kumar [1 ]
Way, Andy [2 ]
机构
[1] Dublin City Univ, Sch Comp, Dublin D09 E432, Ireland
[2] Dublin City Univ, ADAPT Ctr, Dublin D09 Y074, Ireland
基金
爱尔兰科学基金会;
关键词
machine translation; back-translation; parallel data;
D O I
10.3390/info13050245
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most Indian languages lack sufficient parallel data for Machine Translation (MT) training. In this study, we build English-to-Indian language Neural Machine Translation (NMT) systems using the state-of-the-art transformer architecture. In addition, we investigate the utility of back-translation and its effect on system performance. Our experimental evaluation reveals that the back-translation method helps to improve the BLEU scores for both English-to-Hindi and English-to-Bengali NMT systems. We also observe that back-translation is more useful in improving the quality of weaker baseline MT systems. In addition, we perform a manual evaluation of the translation outputs and observe that the BLEU metric cannot always analyse the MT quality as well as humans. Our analysis shows that MT outputs for the English-Bengali pair are actually better than that evaluated by BLEU metric.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Neural Machine Translation System for English to Indian Language Translation Using MTIL Parallel Corpus
    Premjith, B.
    Kumar, M. Anand
    Soman, K. P.
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2019, 28 (03) : 387 - 398
  • [2] Neural machine translation systems for English to Khasi: A case study of an Austroasiatic language
    Hujon, Aiusha Vellintihun
    Singh, Thoudam Doren
    Amitab, Khwairakpam
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [3] Real Time Machine Translation System for English to Indian language
    Vyas, Raj
    Joshi, Kirti
    Sutar, Hitesh
    Nagarhalli, Tatwadarshi P.
    [J]. 2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 838 - 842
  • [4] AnglaBharati to AnglaMalayalam: An Experience with English to Indian Language Machine Translation
    Jayan, V
    Bhadran, V. K.
    [J]. 2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 282 - 287
  • [5] Morphology generation for English-Indian language statistical machine translation
    S. Sreelekha
    [J]. Soft Computing, 2021, 25 : 3657 - 3664
  • [6] A Machine Translation System from Indian Sign Language to English Text
    Mistree, Kinjal
    Thakor, Devendra
    Bhatt, Brijesh
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH, 2022, 15 (01)
  • [7] Morphology generation for English-Indian language statistical machine translation
    Sreelekha, S.
    [J]. SOFT COMPUTING, 2021, 25 (05) : 3657 - 3664
  • [8] Improving Adversarial Neural Machine Translation for Morphologically Rich Language
    Mi, Chenggang
    Xie, Lei
    Zhang, Yanning
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2020, 4 (04): : 417 - 426
  • [9] A Novel Framework for Neural Machine Translation of Indian-English Languages
    Nagarhalli, Tatwadarshi P.
    Vaze, Vinod
    Rana, N. K.
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT-2020), 2020, : 676 - 682
  • [10] English-Indonesian Neural Machine Translation for Spoken Language Domains
    Dwiastuti, Meisyarah
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 309 - 314