Semantic similarity is not enough: A novel NLP-based semantic similarity measure in context

被引:0
|
作者
Abbasi, Omid Reza [1 ]
Alesheikh, Ali Asghar [1 ]
Lotfata, Aynaz [2 ]
机构
[1] KN Toosi Univ Technol, Dept Geospatial Informat Syst, Tehran, Iran
[2] Univ Calif Davis, Sch Vet Med, Dept Pathol Microbiol & Immunol, Davis, CA 95616 USA
关键词
Computer science; Geographical information science; Machine learning;
D O I
10.1016/j.isci.2024.109883
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this study, we addressed two primary challenges: firstly, the issue of domain shift, which pertains changes in data characteristics or context that can impact model performance, and secondly, the discrepancy between semantic similarity and geographical distance. We employed topic modeling in conjunction with the BERT architecture. Our model was crafted to enhance similarity computations applied to geospatial text, aiming to integrate both semantic similarity and geographical proximity. We tested the model on two datasets, Persian Wikipedia articles and rental property advertisements. The findings demonstrate that the model effectively improved the correlation between semantic similarity and geographical distance. Furthermore, evaluation by real -world users within a recommender system context revealed notable increase in user satisfaction by approximately 22% for Wikipedia articles and 56% for advertisements.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Semantic similarity measure of polish nouns based on linguistic features
    Piasecki, Maciej
    Broda, Bartosz
    BUSINESS INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4439 : 381 - +
  • [42] TopoICSim: a new semantic similarity measure based on gene ontology
    Rezvan Ehsani
    Finn Drabløs
    BMC Bioinformatics, 17
  • [43] A relation based measure of semantic similarity for Gene Ontology annotations
    Brendan Sheehan
    Aaron Quigley
    Benoit Gaudin
    Simon Dobson
    BMC Bioinformatics, 9
  • [44] Semantic Similarity Based on Taxonomies
    De Nicola, Antonio
    Formica, Anna
    Mele, Ida
    Taglino, Francesco
    APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [45] Based on semantic web similarity
    Li Ruijie
    Yang Weidong
    Jiang Haowei
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 9 (ICCSIT 2010), 2010, : 327 - 330
  • [46] FAST: A Fuzzy Semantic Sentence Similarity Measure
    Chandran, David
    Crockett, Keeley
    Mclean, David
    Bandar, Zuhair
    2013 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ - IEEE 2013), 2013,
  • [47] An efficient method to measure the semantic similarity of ontologies
    Wang, James
    Ali, Farha
    Srimani, Pradip
    INTERNATIONAL JOURNAL OF PERVASIVE COMPUTING AND COMMUNICATIONS, 2010, 6 (01) : 88 - +
  • [48] Exploiting Synonymy to Measure Semantic Similarity of Sentences
    Shin, Youhyun
    Ahn, Yeonchan
    Kim, Hyuntak
    Lee, Sang-goo
    ACM IMCOM 2015, PROCEEDINGS, 2015,
  • [49] A Semantic and Syntactic Similarity Measure for Political Tweets
    Little, Claire
    Mclean, David
    Crockett, Keeley
    Edmonds, Bruce
    IEEE ACCESS, 2020, 8 : 154095 - 154113
  • [50] On fuzzy semantic similarity measure for DNA coding
    Ahmad, Muneer
    Jung, Low Tang
    Bhuiyan, Md Al-Amin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2016, 69 : 144 - 151