EVALUATING SEMANTIC RELATEDNESS USING WIKIPEDIA-BASED REPRESENTATIVE FEATURES ANALYSIS

被引:0
|
作者
Cui, Qing-jun [1 ]
Zhang, Hui [1 ]
Liu, Rui [1 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China
关键词
Representative Features; semantic relatedness; Wikipedia; Concept Interpreting Network;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In order to evaluate semantic relatedness of natural language concepts automatically, we propose Representative Features Analysis (RFA), a novel approach that represents the meaning of concepts in a high-dimensional space of representative features as a semantic-surrounding concept vector. The vector elements are weighted by the combination of TF-IDF scheme and the link status of Concept Interpreting Network in which nodes represent the concepts and edges represent the interpreting relation between concepts. Assessing the relatedness amounts to comparing the corresponding vectors using conventional metrics. Compared with the previous state of the art, using RFA results in substantial improvements in correlation of computed relatedness scores with human judgments: from r = 0.75 to 0.78 for concepts and performs better in recalling the top n relevant concepts than ESA method. Importantly, the RFA model could evaluate semantic similarity for concepts with low occurrence in Wikipeida articles and eliminate the negative effect caused by the meaningless occurrence of words in the Wikipedia articles, which the approach of ESA neglects.
引用
收藏
页码:467 / 472
页数:6
相关论文
共 50 条
  • [1] Computing Semantic Relatedness using Wikipedia-based Explicit Semantic Analysis
    Gabrilovich, Evgeniy
    Markovitch, Shaul
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1606 - 1611
  • [2] A Hybrid Model for Learning Semantic Relatedness Using Wikipedia-Based Features
    Jabeen, Shahida
    Gao, Xiaoying
    Andreae, Peter
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2014, PT I, 2014, 8786 : 523 - 533
  • [3] A Hybrid Model for Learning Semantic Relatedness Using Wikipedia-Based Features
    Jabeen, Shahida
    Gao, Xiaoying
    Andreae, Peter
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8786 : 523 - 533
  • [4] Computing semantic relatedness using Wikipedia features
    Taieb, Mohamed Ali Hadj
    Ben Aouicha, Mohamed
    Ben Hamadou, Abdelmajid
    KNOWLEDGE-BASED SYSTEMS, 2013, 50 : 260 - 278
  • [5] A wikipedia-based semantic relatedness framework for effective dimensions classification in online reputation management
    Qureshi, M. Atif
    Younus, Arjumand
    O'Riordan, Colm
    Pasi, Gabriella
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2018, 9 (05) : 1403 - 1413
  • [6] A wikipedia-based semantic relatedness framework for effective dimensions classification in online reputation management
    M. Atif Qureshi
    Arjumand Younus
    Colm O’Riordan
    Gabriella Pasi
    Journal of Ambient Intelligence and Humanized Computing, 2018, 9 : 1403 - 1413
  • [7] Exploiting Wikipedia for Evaluating Semantic Relatedness Mechanisms
    Ferrara, Felice
    Tasso, Carlo
    BRIDGING BETWEEN CULTURAL HERITAGE INSTITUTIONS, 2014, 385 : 105 - 117
  • [8] A WIKIPEDIA-BASED FRAMEWORK FOR COLLABORATIVE SEMANTIC ANNOTATION
    Fernandez, N.
    Fisteus, J. A.
    Fuentes, D.
    Sanchez, L.
    Luque, V.
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2011, 20 (05) : 847 - 886
  • [9] A Wikipedia-based Semantic Model for Text Clustering
    Zhou, Jing-min
    Cui, Qing-jun
    Zhang, Hui
    2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 2, 2011, : 413 - 416
  • [10] Wikipedia-based Semantic Interpretation for Natural Language Processing
    Gabrilovich, Evgeniy
    Markovitch, Shaul
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 34 : 443 - 498