Semantic similarity is not enough: A novel NLP-based semantic similarity measure in context

被引:0
|
作者
Abbasi, Omid Reza [1 ]
Alesheikh, Ali Asghar [1 ]
Lotfata, Aynaz [2 ]
机构
[1] KN Toosi Univ Technol, Dept Geospatial Informat Syst, Tehran, Iran
[2] Univ Calif Davis, Sch Vet Med, Dept Pathol Microbiol & Immunol, Davis, CA 95616 USA
关键词
Computer science; Geographical information science; Machine learning;
D O I
10.1016/j.isci.2024.109883
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this study, we addressed two primary challenges: firstly, the issue of domain shift, which pertains changes in data characteristics or context that can impact model performance, and secondly, the discrepancy between semantic similarity and geographical distance. We employed topic modeling in conjunction with the BERT architecture. Our model was crafted to enhance similarity computations applied to geospatial text, aiming to integrate both semantic similarity and geographical proximity. We tested the model on two datasets, Persian Wikipedia articles and rental property advertisements. The findings demonstrate that the model effectively improved the correlation between semantic similarity and geographical distance. Furthermore, evaluation by real -world users within a recommender system context revealed notable increase in user satisfaction by approximately 22% for Wikipedia articles and 56% for advertisements.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Ontology-based Measure of Semantic Similarity between Concepts
    Shi Bin
    Fang Liying
    Yan Jianzhuo
    Wang Pu
    Zhao Zhongcheng
    2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 2, PROCEEDINGS, 2009, : 109 - 112
  • [32] GOntoSim: a semantic similarity measure based on LCA and common descendants
    Amna Binte Kamran
    Hammad Naveed
    Scientific Reports, 12
  • [33] A GRAPH-BASED SEMANTIC SIMILARITY MEASURE FOR THE GENE ONTOLOGY
    Alvarez, Marco A.
    Yan, Changhui
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2011, 9 (06) : 681 - 695
  • [34] A relation based measure of semantic similarity for Gene Ontology annotations
    Sheehan, Brendan
    Quigley, Aaron
    Gaudin, Benoit
    Dobson, Simon
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [35] Word Embedding based Textual Semantic Similarity Measure in Bengali
    Iqbal, Md Asif
    Sharif, Omar
    Hoque, Mohammed Moshiul
    Sarker, Iqbal H.
    10TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE (YSC2021), 2021, 193 : 92 - 101
  • [36] A Taxonomy based Semantic Similarity of Documents using the Cosine Measure
    Madylova, Ainura
    Oguducu, Sule Guenduez
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 129 - 134
  • [37] A semantic similarity measure based on information distance for ontology alignment
    Jiang, Yong
    Wang, Xinmin
    Zheng, Hai-Tao
    INFORMATION SCIENCES, 2014, 278 : 76 - 87
  • [38] Ontology-based Semantic Similarity Measure with Concept Lattice
    Song, Huazhu
    Xiao, Cong
    Xu, Lu
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 177 - 181
  • [39] GOntoSim: a semantic similarity measure based on LCA and common descendants
    Kamran, Amna Binte
    Naveed, Hammad
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [40] TopoICSim: a new semantic similarity measure based on gene ontology
    Ehsani, Rezvan
    Drablos, Finn
    BMC BIOINFORMATICS, 2016, 17