Knowledge derived from Wikipedia for computing semantic relatedness

被引:114
|
作者
Ponzetto, Simone Paolo [1 ]
Strube, Michael [1 ]
机构
[1] EML Res gGmbH, Nat Language Proc Grp, D-69118 Heidelberg, Germany
来源
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH | 2007年 / 30卷 / 181-212期
关键词
Multimedia systems;
D O I
10.1613/jair.2308
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Wikipedia provides a semantic network for computing semantic relatedness in a more structured fashion than a search engine and with more coverage than WordNet. We present experiments on using Wikipedia for computing semantic relatedness and compare it to WordNet on various bench-marking datasets. Existing relatedness measures perform better using Wikipedia than a baseline given by Google counts, and we show that Wikipedia outperforms WordNet on some datasets. We also address the question whether and how Wikipedia can be integrated into NLP applications as a knowledge base. Including Wikipedia improves the performance of a machine learning based coreference resolution system, indicating that it represents a valuable resource for NLP applications. Finally, we show that our method can be easily used for languages other than English by computing semantic relatedness for a German dataset.
引用
收藏
页码:181 / 212
页数:32
相关论文
共 50 条
  • [41] EVALUATING SEMANTIC RELATEDNESS USING WIKIPEDIA-BASED REPRESENTATIVE FEATURES ANALYSIS
    Cui, Qing-jun
    Zhang, Hui
    Liu, Rui
    2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT, 2011, : 467 - 472
  • [42] A Wikipedia Two-way Link Vector Model for Measuring Semantic Relatedness
    Zhu, Xinhua
    Guo, Qingsong
    Zhang, Bo
    2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 323 - 330
  • [43] The More the Better? Assessing the Influence of Wikipedia's Growth on Semantic Relatedness Measures
    Zesch, Torsten
    Gurevych, Iryna
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1374 - 1380
  • [44] Semantic relatedness measurement based on Wikipedia link co-occurrence analysis
    Ito, Masahiro
    Nakayama, Kotaro
    Hara, Takahiro
    Nishio, Shojiro
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2011, 7 (01) : 44 - +
  • [45] A Hybrid Model for Learning Semantic Relatedness Using Wikipedia-Based Features
    Jabeen, Shahida
    Gao, Xiaoying
    Andreae, Peter
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8786 : 523 - 533
  • [46] A Hybrid Model for Learning Semantic Relatedness Using Wikipedia-Based Features
    Jabeen, Shahida
    Gao, Xiaoying
    Andreae, Peter
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2014, PT I, 2014, 8786 : 523 - 533
  • [47] Computing semantic similarity based on novel models of semantic representation using Wikipedia
    Qu, Rong
    Fang, Yongyi
    Bai, Wen
    Jiang, Yuncheng
    INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (06) : 1002 - 1021
  • [48] Measuring Semantic Relatedness with Knowledge Association Network
    Li, Jiapeng
    Chen, Wei
    Gu, Binbin
    Fang, Junhua
    Li, Zhixu
    Zhao, Lei
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT I, 2019, 11446 : 676 - 691
  • [49] Using the structure of a conceptual network in computing semantic relatedness
    Gurevych, I
    NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 767 - 778
  • [50] Constructing Semantic Knowledge Base based on Wikipedia automation
    Niu, Wanpeng
    Chen, Junting
    Chen, Meilin
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON MATERIALS ENGINEERING AND INFORMATION TECHNOLOGY APPLICATIONS (MEITA 2016), 2017, 107 : 202 - 209