LOST IN MACHINE TRANSLATION: CONTEXTUAL LINGUISTIC UNCERTAINTY

被引:0
|
作者
Sukhoverkhov, Anton, V [1 ]
DeWitt, Dorothy [2 ]
Manasidi, Ioannis I. [3 ]
Nitta, Keiko [4 ]
Krstic, Vladimir [5 ]
机构
[1] Kuban State Agr Univ, Dept Philosophy, Kalinina St 13, Krasnodar 350044, Russia
[2] Univ Malaya, Dept Curriculum & Instruct Technol, Kuala Lumpur 50603, Malaysia
[3] Kuban State Agr Univ, Fac Appl Informat, Kalinina St 13, Krasnodar 350044, Russia
[4] Rikkyo Univ, Coll Arts, Dept Letters, 3-34-1 Nishi Ikebukuro, Toshima City, Tokyo 1718501, Japan
[5] Univ Auckland, Dept Philosophy, Private Bag 92019, Auckland 1142, New Zealand
来源
VESTNIK VOLGOGRADSKOGO GOSUDARSTVENNOGO UNIVERSITETA-SERIYA 2-YAZYKOZNANIE | 2019年 / 18卷 / 04期
关键词
machine translation; untranslatability; contextual translation; linguistic relativity; lexical ambiguity; syntactic ambiguity; TRENDS;
D O I
10.15688/jvolsu2.2019.4.10
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
The article considers the issues related to the semantic, grammatical, stylistic and technical difficulties currently present in machine translation and compares its four main approaches: Rule-based (RBMT), Corpora-based (CBMT), Neural (NMT), and Hybrid ( HMT). It also examines some "open systems", which allow the correction or augmentation of content by the users themselves ("crowdsourced translation"). The authors of the article, native speakers presenting different countries (Russia, Greece, Malaysia, Japan and Serbia), tested the translation quality of the most representative phrases from the English, Russian, Greek, Malay and Japanese languages by using different machine translation systems: PROMT ( RBMT), Yandex. Translate (HMT) and Google Translate (NMT). The test results presented by the authors show low "comprehension level" of semantic, linguistic and pragmatic contexts of translated texts, mistranslations of rare and culture-specific words, unnecessary translation of proper names, as well as a low rate of idiomatic phrase and metaphor recognition. It is argued that the development of machine translation requires incorporation of literal, conceptual, and contentand-contextual forms of meaning processing into text translation expansion of metaphor corpora and contextological dictionaries, and implementation of different types and styles of translation, which take into account gender peculiarities, specific dialects and idiolects of users. The problem of untranslatability (`linguistic relativity') of the concepts, unique to a particular culture, has been reviewed from the perspective of machine translation. It has also been shown, that the translation of booming Internet slang, where national languages merge with English, is almost impossible without human correction.
引用
收藏
页码:129 / 144
页数:16
相关论文
共 50 条
  • [31] On the Linguistic Representational Power of Neural Machine Translation Models
    Belinkov, Yonatan
    Durrani, Nadir
    Dalvi, Fahim
    Sajjad, Hassan
    Glass, James
    COMPUTATIONAL LINGUISTICS, 2020, 46 (01) : 1 - 52
  • [32] Lost in machine translation: The promises and pitfalls of machine translation for multilingual group work in global health education
    David C. Hill
    Christy Gombay
    Otto Sanchez
    Bethel Woappi
    Andrea S. Romero Vélez
    Stuart Davidson
    Emma Z. L. Richardson
    Discover Education, 1 (1):
  • [33] BERTology for Machine Translation: What BERT Knows about Linguistic Difficulties for Translation
    Dai, Yuqian
    De Kamps, Marc
    Sharoff, Serge
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6674 - 6690
  • [34] Detecting Source Contextual Barriers for Understanding Neural Machine Translation
    Li, Guanlin
    Liu, Lemao
    Zhu, Conghui
    Wang, Rui
    Zhao, Tiejun
    Shi, Shuming
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 3158 - 3169
  • [35] Multilingual Neural Machine Translation: Can Linguistic Hierarchies Help?
    Saleh, Fahimeh
    Buntine, Wray
    Haffari, Gholamreza
    Du, Lan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1313 - 1330
  • [36] Linguistic knowledge-based vocabularies for Neural Machine Translation
    Casas, Noe
    Costa-jussa, Marta R.
    Fonollosa, Jose A. R.
    Alonso, Juan A.
    Fanlo, Ramon
    NATURAL LANGUAGE ENGINEERING, 2021, 27 (04) : 485 - 506
  • [37] Error Detection for Statistical Machine Translation Using Linguistic Features
    Xiong, Deyi
    Zhang, Min
    Li, Haizhou
    ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 604 - 611
  • [38] Linguistic Factors in Statistical Machine Translation Involving Arabic Language
    Youssef, Islam
    Sakr, Mohamed
    Kouta, Mohamed
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (11): : 154 - 159
  • [39] Statistical Machine Translation Enhancements through Linguistic Levels: A Survey
    Costa-Jussa, Marta R.
    Farrus, Mireia
    ACM COMPUTING SURVEYS, 2014, 46 (03)
  • [40] Improving statistical machine translation using shallow linguistic knowledge
    Hwang, Young-Sook
    Finch, Andrew
    Sasaki, Yutaka
    COMPUTER SPEECH AND LANGUAGE, 2007, 21 (02): : 350 - 372