Semantic similarity is not enough: A novel NLP-based semantic similarity measure in context

被引:0
|
作者
Abbasi, Omid Reza [1 ]
Alesheikh, Ali Asghar [1 ]
Lotfata, Aynaz [2 ]
机构
[1] KN Toosi Univ Technol, Dept Geospatial Informat Syst, Tehran, Iran
[2] Univ Calif Davis, Sch Vet Med, Dept Pathol Microbiol & Immunol, Davis, CA 95616 USA
关键词
Computer science; Geographical information science; Machine learning;
D O I
10.1016/j.isci.2024.109883
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this study, we addressed two primary challenges: firstly, the issue of domain shift, which pertains changes in data characteristics or context that can impact model performance, and secondly, the discrepancy between semantic similarity and geographical distance. We employed topic modeling in conjunction with the BERT architecture. Our model was crafted to enhance similarity computations applied to geospatial text, aiming to integrate both semantic similarity and geographical proximity. We tested the model on two datasets, Persian Wikipedia articles and rental property advertisements. The findings demonstrate that the model effectively improved the correlation between semantic similarity and geographical distance. Furthermore, evaluation by real -world users within a recommender system context revealed notable increase in user satisfaction by approximately 22% for Wikipedia articles and 56% for advertisements.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A New Hybrid Semantic Similarity Measure Based on WordNet
    Meng, Lingling
    Gu, Junzhong
    Zhou, Zili
    NETWORK COMPUTING AND INFORMATION SECURITY, 2012, 345 : 739 - +
  • [22] Weighting-based semantic similarity measure based on topological parameters in semantic taxonomy
    Saif, Abdulgabbar
    Zainodin, Ummi Zakiah
    Omar, Nazlia
    Ghareb, Abdullah Saeed
    NATURAL LANGUAGE ENGINEERING, 2018, 24 (06) : 861 - 886
  • [23] Novel Approach to Find Semantic Similarity Measure between Words
    Sahni, Lakshay
    Sehgal, Anubhav
    Kochar, Shaivi
    Ahmad, Faiyaz
    Ahmad, Tanvir
    PROCEEDINGS OF 2014 2ND INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL AND BUSINESS INTELLIGENCE (ISCBI), 2014, : 89 - 92
  • [24] Chinese Sentence Similarity based on Word Context and Semantic
    Gu, Tianjiao
    Ren, Fuji
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 535 - 539
  • [25] The effect of context on semantic similarity measurement
    Kessler, Carsten
    Raubal, Martin
    Janowicz, Krzysztof
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2007: OTM 2007 WORKSHOPS, PT 2, PROCEEDINGS, 2007, 4806 : 1274 - +
  • [26] A measure of semantic similarity between gene ontology terms based on semantic pathway covering
    LI Rong
    Shanghai Center for Bioin-formation Technology
    School of Life Sciences
    Bioinfor-rnation Center of Shanghai Institute for Biological Sciences
    ProgressinNaturalScience, 2006, (07) : 721 - 726
  • [27] From Ontology to Semantic Similarity: Calculation of Ontology-Based Semantic Similarity
    Gan, Mingxin
    Dou, Xue
    Jiang, Rui
    SCIENTIFIC WORLD JOURNAL, 2013,
  • [28] A measure of semantic similarity between gene ontology terms based on semantic pathway covering
    Li Rong
    Cao Shunliang
    Li Yuanyuan
    Tan Hao
    Zhu Yangyong
    Zhong Yang
    Li Yixue
    PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2006, 16 (07) : 721 - 726
  • [29] A Laplacian Eigenmaps Based Semantic Similarity Measure between Words
    Wu, Yuming
    Cao, Cungen
    Wang, Shi
    Wang, Dongsheng
    INTELLIGENT INFORMATION PROCESSING V, 2010, 340 : 291 - 296
  • [30] An ontology-based measure to compute semantic similarity in biomedicine
    Batet, Montserrat
    Sanchez, David
    Valls, Aida
    JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (01) : 118 - 125