A Fast and Efficient Semantic Short Text Similarity Metric

被引:0
|
作者
Croft, David [1 ]
Coupland, Simon [2 ]
Shell, Jethro [1 ]
Brown, Stephen [1 ]
机构
[1] De Montfort Univ, Knowledge Media Design, Leicester LE1 9BH, Leics, England
[2] De Montfort Univ, Ctr Computat Intelligence, Leicester LE1 9BH, Leics, England
基金
英国艺术与人文研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The semantic comparison of short sections of text is an emerging aspect of Natural Language Processing (NLP). In this paper we present a novel Short Text Semantic Similarity (STSS) method, Lightweight Semantic Similarity (LSS), to address the issues that arise with sparse text representation. The proposed approach captures the semantic information contained when comparing text to process the similarity. The methodology combines semantic term similarities with a vector similarity method used within statistical analysis. A modification of the term vectors using synset similarity values addresses issues that are encountered with sparse text. LSS is shown to be comparable to current semantic similarity approaches, LSA and STASIS, whilst having a lower computational footprint.
引用
收藏
页码:221 / 227
页数:7
相关论文
共 50 条
  • [1] Benchmarking short text semantic similarity
    O'Shea, James
    Bandar, Zuhair
    Crockett, Keeley
    McLean, David
    [J]. International Journal of Intelligent Information and Database Systems, 2010, 4 (02) : 103 - 120
  • [2] Semantic similarity metric and its application in text classification
    Zhang, Pei-ying
    [J]. PROGRESS IN CIVIL ENGINEERING, PTS 1-4, 2012, 170-173 : 3711 - 3714
  • [3] Short Text Semantic Similarity Measurement Approach Based on Semantic Network
    Hameed, Naamah Hussien
    Alimi, Adel M.
    Sadiq, Ahmed T.
    [J]. BAGHDAD SCIENCE JOURNAL, 2022, 19 (06) : 1581 - 1591
  • [4] An algorithm for semantic similarity of short text based on WordNet
    Zhai, Yan-Dong
    Wang, Kang-Ping
    Zhang, Dong-Na
    Hunag, Lan
    Zhou, Chun-Guang
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2012, 40 (03): : 617 - 620
  • [5] NGram Approach for Semantic Similarity on Arabic Short Text
    Al-Mahmoud, Rana Husni
    Sharieh, Ahmad
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (11) : 857 - 866
  • [6] Short Text Similarity Calculation Using Semantic Information
    Pu, Haoyu
    Fei, Gaolei
    Zhao, Hailin
    Hu, Guangmin
    Jiao, Chengbo
    Xu, Zhoujun
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM), 2017, : 144 - 150
  • [7] A comparative study of two short text semantic similarity measures
    O'Shea, James
    Bandar, Zuhair
    Crockett, Keeley
    McLean, David
    [J]. AGENT AND MULTI-AGENT SYSTEMS: TECHNOLOGIES AND APPLICATIONS, PROCEEDINGS, 2008, 4953 : 172 - 181
  • [8] A survey on the techniques, applications, and performance of short text semantic similarity
    Han, Mengting
    Zhang, Xuan
    Yuan, Xin
    Jiang, Jiahao
    Yun, Wei
    Gao, Chen
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (05):
  • [9] MEASURING SHORT TEXT SEMANTIC SIMILARITY USING MULTIPLE MEASUREMENTS
    Zhu, Tian-Tian
    Lan, Man
    [J]. PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 808 - 813
  • [10] Short Text Similarity Calculation Based on Jaccard and Semantic Mixture
    Wu, Shushu
    Liu, Fang
    Zhang, Kai
    [J]. Communications in Computer and Information Science, 2021, 1363 CCIS : 37 - 45