EVALUATING SEMANTIC RELATEDNESS USING WIKIPEDIA-BASED REPRESENTATIVE FEATURES ANALYSIS

被引:0
|
作者
Cui, Qing-jun [1 ]
Zhang, Hui [1 ]
Liu, Rui [1 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China
关键词
Representative Features; semantic relatedness; Wikipedia; Concept Interpreting Network;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In order to evaluate semantic relatedness of natural language concepts automatically, we propose Representative Features Analysis (RFA), a novel approach that represents the meaning of concepts in a high-dimensional space of representative features as a semantic-surrounding concept vector. The vector elements are weighted by the combination of TF-IDF scheme and the link status of Concept Interpreting Network in which nodes represent the concepts and edges represent the interpreting relation between concepts. Assessing the relatedness amounts to comparing the corresponding vectors using conventional metrics. Compared with the previous state of the art, using RFA results in substantial improvements in correlation of computed relatedness scores with human judgments: from r = 0.75 to 0.78 for concepts and performs better in recalling the top n relevant concepts than ESA method. Importantly, the RFA model could evaluate semantic similarity for concepts with low occurrence in Wikipeida articles and eliminate the negative effect caused by the meaningless occurrence of words in the Wikipedia articles, which the approach of ESA neglects.
引用
收藏
页码:467 / 472
页数:6
相关论文
共 50 条
  • [31] Semantic Relatedness for Named Entity Disambiguation Using a Small Wikipedia
    Fernandez, Izaskun
    Alegria, Inaki
    Ezeiza, Nerea
    TEXT, SPEECH AND DIALOGUE, TSD 2011, 2011, 6836 : 276 - 283
  • [32] Semantic Relatedness Estimation using the Layout Information of Wikipedia Articles
    Chan, Patrick
    Hijikata, Yoshinori
    Kuramochi, Toshiya
    Nishida, Shogo
    INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2013, 7 (02) : 30 - 48
  • [33] Semantic concept model using Wikipedia semantic features
    Saif, Abdulgabbar
    Omar, Nazlia
    Ab Aziz, Mohd Juzaiddin
    Zainodin, Ummi Zakiah
    Salim, Naomie
    JOURNAL OF INFORMATION SCIENCE, 2018, 44 (04) : 526 - 551
  • [34] Towards perfect text classification with Wikipedia-based semantic Naive Bayes learning
    Kim, Han-joon
    Kim, Jiyun
    Kim, Jinseog
    Lim, Pureum
    NEUROCOMPUTING, 2018, 315 : 128 - 134
  • [35] Efficient feature integration with Wikipedia-based semantic feature extraction for Turkish text summarization
    Guran, Aysun
    Bayazit, Nilgun Guler
    Gurbuz, Mustafa Zahid
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2013, 21 (05) : 1411 - 1425
  • [36] An efficient approach for measuring semantic relatedness using Wikipedia bidirectional links
    Zhu, Xinhua
    Guo, Qingsong
    Zhang, Bo
    Li, Fei
    APPLIED INTELLIGENCE, 2019, 49 (10) : 3708 - 3730
  • [37] Comparing Semantic Relatedness between Word Pairs in Portuguese Using Wikipedia
    Granada, Roger
    Trojahn, Cassia
    Vieira, Renata
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, 2014, 8775 : 170 - 175
  • [38] An efficient approach for measuring semantic relatedness using Wikipedia bidirectional links
    Xinhua Zhu
    Qingsong Guo
    Bo Zhang
    Fei Li
    Applied Intelligence, 2019, 49 : 3708 - 3730
  • [39] Measuring Semantic Relatedness using Wikipedia Revision Information in a Signed Network
    Yang, Wen-Teng
    Kao, Hung-Yu
    2011 INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2011), 2011, : 69 - 74
  • [40] Using Wikipedia-based conceptual contexts to calculate document similarity
    Kaiser, Fabian
    THIRD INTERNATIONAL CONFERENCE ON DIGITAL SOCIETY: ICDS 2009, PROCEEDINGS, 2009, : 322 - 327