Neural Machine Translation for Low-Resource Languages from a Chinese-centric Perspective: A Survey

被引:1
|
作者
Zhang, Jinyi [1 ,2 ]
Su, Ke [1 ]
Li, Haowei [1 ]
Mao, Jiannan [2 ]
Tian, Ye
Wen, Feng [1 ]
Guo, Chong [1 ]
Matsumoto, Tadahiro [2 ]
机构
[1] Shenyang Ligong Univ, Sch Informat Sci & Engn, Shenyang, Liaoning, Peoples R China
[2] Gifu Univ, Fac Engn, Gifu, Gifu, Japan
关键词
Low-resource languages; neural machine translation; unsupervised learning; transfer learning; multilingual translation; large language models; Chinese-centric languages; TRANSFORMER; BERT;
D O I
10.1145/3665244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine translation-the automatic transformation of one natural language (source language) into another (target language) through computational means-occupies a central role in computational linguistics and stands as a cornerstone of research within the field of Natural Language Processing (NLP). In recent years, the prominence of NeuralMachine Translation (NMT) has grown exponentially, offering an advanced framework formachine translation research. It is noted for its superior translation performance, especially when tackling the challenges posed by low-resource language pairs that suffer from a limited corpus of data resources. This article offers an exhaustive exploration of the historical trajectory and advancements inNMT, accompanied by an analysis of the underlying foundational concepts. It subsequently provides a concise demarcation of the unique characteristics associated with low-resource languages and presents a succinct review of pertinent translation models and their applications, specifically within the context of languages with low-resources. Moreover, this article delves deeply into machine translation techniques, highlighting approaches tailored for Chinese-centric low-resource languages. Ultimately, it anticipates upcoming research directions in the realm of low-resource language translation.
引用
收藏
页数:60
相关论文
共 50 条
  • [31] Revisiting Low-Resource Neural Machine Translation: A Case Study
    Sennrich, Rico
    Zhang, Biao
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 211 - 221
  • [32] Simple measures of bridging lexical divergence help unsupervised neural machine translation for low-resource languages
    Khatri, Jyotsana
    Murthy, Rudra
    Banerjee, Tamali
    Bhattacharyya, Pushpak
    MACHINE TRANSLATION, 2021, 35 (04) : 711 - 744
  • [33] Improving a Multi-Source Neural Machine Translation Model with Corpus Extension for Low-Resource Languages
    Choi, Gyu-Hyeon
    Shin, Jong-Hun
    Kim, Young-Kil
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 900 - 904
  • [34] Enabling Medical Translation for Low-Resource Languages
    Musleh, Ahmad
    Durrani, Nadir
    Temnikova, Irina
    Nakov, Preslav
    Vogel, Stephan
    Alsaad, Osama
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT II, 2018, 9624 : 3 - 16
  • [35] Incident-Driven Machine Translation and Name Tagging for Low-resource Languages
    Hermjakob, Ulf
    Li, Qiang
    Marcu, Daniel
    May, Jonathan
    Mielke, Sebastian J.
    Pourdamghani, Nima
    Pust, Michael
    Shi, Xing
    Knight, Kevin
    Levinboim, Tomer
    Murray, Kenton
    Chiang, David
    Zhang, Boliang
    Pan, Xiaoman
    Lu, Di
    Lin, Ying
    Ji, Heng
    MACHINE TRANSLATION, 2018, 32 (1-2) : 59 - 89
  • [36] Improving neural machine translation for low-resource Indian languages using rule-based feature extraction
    Muskaan Singh
    Ravinder Kumar
    Inderveer Chana
    Neural Computing and Applications, 2021, 33 : 1103 - 1122
  • [37] Improving neural machine translation for low-resource Indian languages using rule-based feature extraction
    Singh, Muskaan
    Kumar, Ravinder
    Chana, Inderveer
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (04): : 1103 - 1122
  • [38] A Diverse Data Augmentation Strategy for Low-Resource Neural Machine Translation
    Li, Yu
    Li, Xiao
    Yang, Yating
    Dong, Rui
    INFORMATION, 2020, 11 (05)
  • [39] Semantic Perception-Oriented Low-Resource Neural Machine Translation
    Wu, Nier
    Hou, Hongxu
    Li, Haoran
    Chang, Xin
    Jia, Xiaoning
    MACHINE TRANSLATION, CCMT 2021, 2021, 1464 : 51 - 62
  • [40] Rethinking the Exploitation of Monolingual Data for Low-Resource Neural Machine Translation
    Pang, Jianhui
    Yang, Baosong
    Wong, Derek Fai
    Wan, Yu
    Liu, Dayiheng
    Chao, Lidia Sam
    Xie, Jun
    COMPUTATIONAL LINGUISTICS, 2023, 50 (01) : 25 - 47