Extremely Low-resource Multilingual Neural Machine Translation for Indic Mizo Language

被引:0
|
作者
Lalrempuii C. [1 ]
Soni B. [1 ]
机构
[1] Department of Computer Science and Engineering, National Institute of Technology, Silchar, Assam, Silcha
关键词
BLEU; Flores; Indic; Machine Translation; Mizo; Multilingual; NMT;
D O I
10.1007/s41870-023-01480-8
中图分类号
学科分类号
摘要
Machine translation requires a vast amount of parallel data in order to generate high-quality translations. Since many Indian languages lack sufficient resources, enhancing translation performance for these language pairs can have a significant impact. This study aims to address the issue of low-resource neural machine translation between Mizo and English by utilizing other Indian languages in a multilingual framework. We explore the use of multilingual techniques for 13 pairs of Indic languages in both many-to-one and one-to-many setups, proposing a method for Multilingual Neural Machine Translation to enhance the translation quality of the low-resource Mizo language. We assess the effectiveness of ensemble decoding and transliteration into the Roman script through qualitative and quantitative approaches. The empirical findings demonstrate that incorporating transliteration and utilizing ensemble decoding with checkpoint ensembles leads to improved translation quality. © 2023, The Author(s), under exclusive licence to Bharati Vidyapeeth's Institute of Computer Applications and Management.
引用
收藏
页码:4275 / 4282
页数:7
相关论文
共 50 条
  • [1] Extremely low-resource neural machine translation for Asian languages
    Rubino, Raphael
    Marie, Benjamin
    Dabre, Raj
    Fujita, Atushi
    Utiyama, Masao
    Sumita, Eiichiro
    [J]. MACHINE TRANSLATION, 2020, 34 (04) : 347 - 382
  • [2] Multilingual Neural Machine Translation for Indic to Indic Languages
    Das, Sudhansu Bala
    Panda, Divyajyoti
    Mishra, Tapas Kumar
    Patra, Bidyut Kr.
    Ekbal, Asif
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (05)
  • [3] An Analysis of Massively Multilingual Neural Machine Translation for Low-Resource Languages
    Mueller, Aaron
    Nicolai, Garrett
    McCarthy, Arya D.
    Lewis, Dylan
    Wu, Winston
    Yarowsky, David
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3710 - 3718
  • [4] Language Model Prior for Low-Resource Neural Machine Translation
    Baziotis, Christos
    Haddow, Barry
    Birch, Alexandra
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7622 - 7634
  • [5] An empirical study of low-resource neural machine translation of manipuri in multilingual settings
    Singh, Salam Michael
    Singh, Thoudam Doren
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17): : 14823 - 14844
  • [6] An empirical study of low-resource neural machine translation of manipuri in multilingual settings
    Salam Michael Singh
    Thoudam Doren Singh
    [J]. Neural Computing and Applications, 2022, 34 : 14823 - 14844
  • [7] Continual Mixed-Language Pre-Training for Extremely Low-Resource Neural Machine Translation
    Liu, Zihan
    Winata, Genta Indra
    Fung, Pascale
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2706 - 2718
  • [8] Machine Translation into Low-resource Language Varieties
    Kumar, Sachin
    Anastasopoulos, Antonios
    Wintner, Shuly
    Tsvetkov, Yulia
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 110 - 121
  • [9] A Survey on Low-Resource Neural Machine Translation
    Wang, Rui
    Tan, Xu
    Luo, Renqian
    Qin, Tao
    Liu, Tie-Yan
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4636 - 4643
  • [10] Transformers for Low-resource Neural Machine Translation
    Gezmu, Andargachew Mekonnen
    Nuernberger, Andreas
    [J]. ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 1, 2022, : 459 - 466