EVALUATING SEMANTIC RELATEDNESS USING WIKIPEDIA-BASED REPRESENTATIVE FEATURES ANALYSIS

被引:0
|
作者
Cui, Qing-jun [1 ]
Zhang, Hui [1 ]
Liu, Rui [1 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China
关键词
Representative Features; semantic relatedness; Wikipedia; Concept Interpreting Network;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In order to evaluate semantic relatedness of natural language concepts automatically, we propose Representative Features Analysis (RFA), a novel approach that represents the meaning of concepts in a high-dimensional space of representative features as a semantic-surrounding concept vector. The vector elements are weighted by the combination of TF-IDF scheme and the link status of Concept Interpreting Network in which nodes represent the concepts and edges represent the interpreting relation between concepts. Assessing the relatedness amounts to comparing the corresponding vectors using conventional metrics. Compared with the previous state of the art, using RFA results in substantial improvements in correlation of computed relatedness scores with human judgments: from r = 0.75 to 0.78 for concepts and performs better in recalling the top n relevant concepts than ESA method. Importantly, the RFA model could evaluate semantic similarity for concepts with low occurrence in Wikipeida articles and eliminate the negative effect caused by the meaningless occurrence of words in the Wikipedia articles, which the approach of ESA neglects.
引用
收藏
页码:467 / 472
页数:6
相关论文
共 50 条
  • [21] Measuring Semantic Relatedness using Wikipedia Signed Network
    Yang, Wen-Teng
    Kao, Hung-Yu
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2013, 29 (04) : 615 - 630
  • [22] Measuring semantic relatedness using wikipedia signed network
    1600, Institute of Information Science (29):
  • [23] A Novel Semantic Tagging Technique Exploiting Wikipedia-based Associated Words
    Hong, Hyun-Ki
    Park, Kyung-Wook
    Lee, Dong-Ho
    IEEE 39TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSAC 2015), VOL 3, 2015, : 648 - 649
  • [24] WSR: A semantic relatedness measure based on Wikipedia structure
    Sun, C.-C. (bigchansuns@163.com), 1600, Science Press (35):
  • [25] Wikipedia-Based Semantic Smoothing for the Language Modeling Approach to Information Retrieval
    Tu, Xinhui
    He, Tingting
    Chen, Long
    Luo, Jing
    Zhang, Maoyuan
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2010, 5993 : 370 - +
  • [26] A Self-Adaptive Explicit Semantic Analysis Method for Computing Semantic Relatedness using Wikipedia
    Wang, Weiping
    Chen, Peng
    Liu, Bowen
    2008 INTERNATIONAL SEMINAR ON FUTURE INFORMATION TECHNOLOGY AND MANAGEMENT ENGINEERING, PROCEEDINGS, 2008, : 3 - 6
  • [27] Clustering Documents Using a Wikipedia-Based Concept Representation
    Huang, Anna
    Milne, David
    Frank, Eibe
    Witten, Ian H.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2009, 5476 : 628 - 636
  • [28] Semantic relatedness measurement based on Wikipedia link co-occurrence analysis
    Ito, Masahiro
    Nakayama, Kotaro
    Hara, Takahiro
    Nishio, Shojiro
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2011, 7 (01) : 44 - +
  • [29] Wikipedia-Based Semantic Similarity Measurements for Noisy Short Texts Using Extended Naive Bayes
    Shirakawa, Masumi
    Nakayama, Kotaro
    Hara, Takahiro
    Nishio, Shojiro
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2015, 3 (02) : 205 - 219
  • [30] Learning to Compute Semantic Relatedness Using Knowledge from Wikipedia
    Zheng, Chen
    Wang, Zhichun
    Bie, Rongfang
    Zhou, Mingquan
    WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2014, 2014, 8709 : 236 - 246