GENERATING SEMANTIC SIMILARITY ATLAS FOR NATURAL LANGUAGES

被引:0
|
作者
Senel, Lutfi Kerem [1 ,2 ,3 ]
Utlu, Ihsan [1 ,2 ]
Yucesoy, Veysel [1 ]
Koc, Aykut [1 ]
Cukur, Tolga [2 ,3 ,4 ]
机构
[1] ASELSAN Res Ctr, Ankara, Turkey
[2] Bilkent Univ, Dept Elect & Elect Engn, Ankara, Turkey
[3] Bilkent Univ, UMRAM, Sabuncu Brain Res Ctr, Ankara, Turkey
[4] Bilkent Univ, Neurosci Program, Ankara, Turkey
来源
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018) | 2018年
关键词
cross-lingual semantic similarity; natural language processing; semantic similarity; word embedding; computational linguistics;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-lingual studies attract a growing interest in natural language processing (NLP) research, and several studies showed that similar languages are more advantageous to work with than fundamentally different languages in transferring knowledge. Different similarity measures for the languages are proposed by researchers from different domains. However, a similarity measure focusing on semantic structures of languages can be useful for selecting pairs or groups of languages to work with, especially for the tasks requiring semantic knowledge such as sentiment analysis or word sense disambiguation. For this purpose, in this work, we leverage a recently proposed word embedding based method to generate a language similarity atlas for 76 different languages around the world. This atlas can help researchers select similar language pairs or groups in cross-lingual applications. Our findings suggest that semantic similarity between two languages is strongly correlated with the geographic proximity of the countries in which they are used.
引用
收藏
页码:795 / 799
页数:5
相关论文
共 50 条
  • [31] Meta lingua: A Language to Mediate Communication with Semantic Web in Natural Languages
    Drugus, Ioachim
    ADVANCED INFORMATION TECHNOLOGY IN EDUCATION, 2012, 126 : 109 - 115
  • [32] Ontology-based Semantic Similarity in Generating Context-aware Collaborator Recommendations
    Li, Siying
    Abel, Marie-Helene
    Negre, Elsa
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 751 - 756
  • [33] Bridging Semantic Gaps between Natural Languages and APIs with Word Embedding
    Li, Xiaochen
    Jiang, He
    Kamei, Yasutaka
    Chen, Xin
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2020, 46 (10) : 1081 - 1097
  • [34] The Entropy of Morphological Systems in Natural Languages Is Modulated by Functional and Semantic Properties
    Franzon, Francesca
    Zanini, Chiara
    JOURNAL OF QUANTITATIVE LINGUISTICS, 2023, 30 (01) : 42 - 66
  • [35] Turkic languages in the Atlas of the Languages of Iran (ALI)
    Anonby, Erik
    Taheri-Ardali, Mortaza
    Schreiber, Laurentia
    Bulut, Christiane
    Haig, Geoffrey
    Dehkordi, Peiman Pishyar
    Talebi-Dastenaei, Mahnaz
    Shahverdi, Fatemeh
    Zarajabad, Hossein Hashemi
    Mohammadirad, Masoud
    Jamaleddin, Faranak
    Rahnema, Zohreh
    Mohammadi, Mohammad
    Izady, Elham
    Meshkinfam, Mehrdad
    Opengin, Ergin
    Nemati, Fatemeh
    TURKIC LANGUAGES, 2020, 24 (02): : 290 - 308
  • [36] Similarity in languages and programs
    Cui, Cewei
    Dang, Zhe
    Fischer, Thomas R.
    Ibarra, Oscar H.
    THEORETICAL COMPUTER SCIENCE, 2013, 498 : 58 - 75
  • [37] A Grammar-Based Semantic Similarity Algorithm for Natural Language Sentences
    Lee, Ming Che
    Chang, Jia Wei
    Hsieh, Tung Cheng
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [38] A METHOD FOR THE COMPUTATION OF THE SEMANTIC SIMILARITY AND RELATEDNESS BETWEEN NATURAL LANGUAGE WORDS
    Anisimov, A. V.
    Marchenko, O. O.
    Kysenko, V. K.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2011, 47 (04) : 515 - 522
  • [39] Atlas of the languages of Suriname.
    Launey, M
    LANGUAGE IN SOCIETY, 2005, 34 (01) : 151 - 155
  • [40] Atlas of the languages of Suriname.
    Goury, L
    JOURNAL OF PIDGIN AND CREOLE LANGUAGES, 2005, 20 (02) : 364 - 369