Linguistic Divergence of Sinhala and Tamil languages in Machine Translation

被引:0
|
作者
Dilshani, W. S. N. [1 ]
Yashothara, S. [1 ]
Uthayasanker, R. T. [1 ]
Jayasena, S. [1 ]
机构
[1] Univ Moratuwa, Dept Comp Sci & Engn, Moratuwa, Sri Lanka
关键词
Language Divergence; Sinhala; Tamil; Dorr's classification; NLP; translation challenges;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a study of the lexical-semantic divergence between Sinhala and Tamil languages. Study of divergence is critical as differences in linguistic and extra-linguistic features in languages play pivotal roles in translation. This research the first study of the divergence between Sinhala and Tamil languages and is based on Dorr's classification. We propose a computer-assisted divergence study procedure using statistical machine translation, which is easy and gives good performance compared to traditional approaches. Accordingly, this research has the twin aims of revisiting classification of divergence types as outlined by Dorr and outlining some of the new divergence patterns specific to Sinhala and Tamil languages. This study proposes a rule-based algorithm to classify a divergence.
引用
收藏
页码:13 / 18
页数:6
相关论文
共 50 条
  • [22] Addressing Word-order Divergence in Multilingual Neural Machine Translation for Extremely Low Resource Languages
    Murthy, Rudra, V
    Kunchukuttan, Anoop
    Bhattacharyya, Pushpak
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 3868 - 3873
  • [23] A Review on Machine Translation in Indian Languages
    Chopra, Deepti
    Joshi, Nisheeth
    Mathur, Iti
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2018, 8 (05) : 3475 - 3478
  • [24] Machine translation of very close languages
    Hajic, J
    Hric, J
    Kubon, V
    6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : 7 - 12
  • [25] Neural machine translation of Indian languages
    Revanuru, Karthik
    Turlapaty, Kaushik
    Rao, Shrisha
    ACM International Conference Proceeding Series, 2017, : 11 - 20
  • [26] Neural Machine Translation of Indian Languages
    Revanuru, Karthik
    Turlapaty, Kaushik
    Rao, Shrisha
    COMPUTE'17: PROCEEDINGS OF THE 10TH ANNUAL ACM INDIA COMPUTE CONFERENCE, 2017, : 11 - 20
  • [27] Neural Machine Translation for Indian Languages
    Pathak, Amarnath
    Pakray, Partha
    JOURNAL OF INTELLIGENT SYSTEMS, 2019, 28 (03) : 465 - 477
  • [28] SINHALA-TAMIL OR CENTER-PERIPHERY
    不详
    ECONOMIC AND POLITICAL WEEKLY, 1983, 18 (34) : 1472 - 1472
  • [29] RECENT DEVELOPMENTS IN SINHALA-TAMIL RELATIONS
    SIRIWEERA, WI
    ASIAN SURVEY, 1980, 20 (09) : 903 - 913
  • [30] LINGUISTIC MATERIALS FOR THE MACHINE TRANSLATION SYSTEMS
    Vicic, Jernej
    ANNALES-ANALI ZA ISTRSKE IN MEDITERANSKE STUDIJE-SERIES HISTORIA ET SOCIOLOGIA, 2016, 26 (04): : 751 - 766