Evaluating Entity Linking with Wikipedia

被引:135
|
作者
Hachey, Ben [1 ]
Radford, Will [2 ,3 ]
Nothman, Joel [2 ,3 ]
Honnibal, Matthew [4 ]
Curran, James R. [2 ,3 ]
机构
[1] Thomson Reuters Corp, Res & Dev, St Paul, MN 55123 USA
[2] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
[3] Capital Markets CRC, Sydney, NSW 2000, Australia
[4] Macquarie Univ, Dept Comp, N Ryde, NSW 2109, Australia
关键词
Named Entity Linking; Disambiguation; Information extraction; Wikipedia; Semi-structured resources; WEB;
D O I
10.1016/j.artint.2012.04.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Linking (NEL) grounds entity mentions to their corresponding node in a Knowledge Base (KB). Recently,. a number of systems have been proposed for linking entity mentions in text to Wikipedia pages. Such systems typically search for candidate entities and then disambiguate them, returning either the best candidate or NIL. However, comparison has focused on disambiguation accuracy, making it difficult to determine how search impacts performance. Furthermore, important approaches from the literature have not been systematically compared on standard data sets. We reimplement three seminal NEL. systems and present a detailed evaluation of search strategies. Our experiments find that coreference and acronym handling lead to substantial improvement, and search strategies account for much of the variation between systems. This is an interesting finding, because these aspects of the problem have often been neglected in the literature, which has focused largely on complex candidate ranking algorithms. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:130 / 150
页数:21
相关论文
共 50 条
  • [1] Named entity linking based on wikipedia
    [J]. Jin, P. (jpq@ustc.edu.cn), 1600, Science and Engineering Research Support Society (07):
  • [2] An Entity Disambiguation Approach Based on Wikipedia for Entity Linking in Microblogs
    Urata, Tomoaki
    Maeda, Akira
    [J]. 2017 6TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI), 2017, : 334 - 338
  • [3] Cross-Lingual Entity Linking in Wikipedia Infoboxes
    Yang, Juheng
    Wang, Zhichun
    [J]. KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING, 2019, 1134 : 38 - 49
  • [4] Graph-Based Named Entity Linking with Wikipedia
    Hachey, Ben
    Radford, Will
    Curran, James R.
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2011, 2011, 6997 : 213 - +
  • [5] CHOLAN: A Modular Approach for Neural Entity Linking on Wikipedia and Wikidata
    Ravi, Manoj Prabhakar Kannan
    Singh, Kuldeep
    Mulang, Isaiah Onando
    Shekarpour, Saeedeh
    Hoffart, Johannes
    Lehmann, Jens
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 504 - 514
  • [6] Multilingual Entity Linking System for Wikipedia with a Machine-in-the-Loop Approach
    Gerlach, Martin
    Miller, Marshall
    Ho, Rita
    Harlan, Kosta
    Difallah, Djellel
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3818 - 3827
  • [7] Entity Ranking in Wikipedia
    Vercoustre, Anne-Marie
    Thom, James A.
    Pehcevski, Jovan
    [J]. APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 1101 - 1106
  • [8] Evaluating Tabular and Textual Entity Linking in Financial Documents
    Nararatwong, Rungsiman
    Kertkeidkachorn, Natthawut
    Ichise, Ryutaro
    [J]. 18TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC 2024, 2024, : 130 - 133
  • [9] Entity Extraction, Linking, Classification, and Tagging for Social Media: A Wikipedia-Based Approach
    Gattani, Abhishek
    Lamba, Digvijay S.
    Garera, Nikesh
    Tiwari, Mitul
    Chai, Xiaoyong
    Das, Sanjib
    Subramaniam, Sri
    Rajaraman, Anand
    Harinarayan, Venky
    Doan, Anhai
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (11): : 1126 - 1137
  • [10] Linking Wikipedia to the Web
    Kaptein, Rianne
    Serdyukov, Pavel
    Kamps, Jaap
    [J]. SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 839 - 840