Entity-Linking via Graph-Distance Minimization

被引:2
|
作者
Blanco, Roi [1 ]
Boldi, Paolo [2 ]
Marino, Andrea [2 ]
机构
[1] Yahoo Res, Barcelona, Spain
[2] Univ Studi Milano, Dipartimento Informat, Milan, Italy
关键词
D O I
10.4204/EPTCS.159.4
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Entity-linking is a natural-language-processing task that consists in identifying the entities mentioned in a piece of text, linking each to an appropriate item in some knowledge base; when the knowledge base is Wikipedia, the problem comes to be known as wikification (in this case, items are wikipedia articles). One instance of entity-linking can be formalized as an optimization problem on the underlying concept graph, where the quantity to be optimized is the average distance between chosen items. Inspired by this application, we define a new graph problem which is a natural variant of the Maximum Capacity Representative Set. We prove that our problem is NP-hard for general graphs; nonetheless, under some restrictive assumptions, it turns out to be solvable in linear time. For the general case, we propose two heuristics: one tries to enforce the above assumptions and another one is based on the notion of hitting distance; we show experimentally how these approaches perform with respect to some baselines on a real-world dataset.
引用
收藏
页码:30 / 43
页数:14
相关论文
共 50 条
  • [1] Expressive power of entity-linking frameworks
    Burdick, Douglas
    Fagin, Ronald
    Kolaitis, Phokion G.
    Popa, Lucian
    Tan, Wang-Chiew
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2019, 100 (44-69) : 44 - 69
  • [2] SMAPH: A Piggyback Approach for Entity-Linking in Web Queries
    Cornolti, Marco
    Ferragina, Paolo
    Ciaramita, Massimiliano
    Rued, Stefan
    Schuetze, Hinrich
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2019, 37 (01)
  • [3] Fast and Accurate Entity Linking via Graph Embedding
    Parravicini, Alberto
    Patra, Rhicheek
    Bartolini, Davide B.
    Santambrogio, Marco D.
    PROCEEDINGS OF THE 2ND ACM SIGMOD JOINT INTERNATIONAL WORKSHOP ON GRAPH DATA MANAGEMENT EXPERIENCES & SYSTEMS (GRADES) AND NETWORK DATA ANALYTICS (NDA) 2019, 2019,
  • [4] Graph-distance distribution of the Boltzmann ensemble of RNA secondary structures
    Jing Qin
    Markus Fricke
    Manja Marz
    Peter F Stadler
    Rolf Backofen
    Algorithms for Molecular Biology, 9
  • [5] Entity-Linking Interfaces in User-Contributed Content: Preference and Performance
    Dong, Xiao
    Harper, F. Maxwell
    Konstan, Joseph A.
    29TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2011, : 2187 - 2196
  • [6] Graph-distance convergence and uniform local boundedness of monotone mappings
    Pennanen, T
    Revalski, JP
    Théra, M
    PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY, 2003, 131 (12) : 3721 - 3729
  • [7] Graph-distance distribution of the Boltzmann ensemble of RNA secondary structures
    Qin, Jing
    Fricke, Markus
    Marz, Manja
    Stadler, Peter F.
    Backofen, Rolf
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2014, 9
  • [8] An Experimental Analysis of Graph-Distance Algorithms for Comparing API Usages
    Nielebock, Sebastian
    Blockhaus, Paul
    Kruger, Jacob
    Ortmeier, Frank
    IEEE 21ST INTERNATIONAL WORKING CONFERENCE ON SOURCE CODE ANALYSIS AND MANIPULATION (SCAM 2021), 2021, : 214 - 225
  • [9] Entity Linking on Graph Data
    Yu, Minghe
    Feng, Jianhua
    WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 21 - 25
  • [10] Improving Entity Linking with Graph Networks
    Deng, Ziheng
    Li, Zhixu
    Yang, Qiang
    Liu, Qingsheng
    Chen, Zhigang
    WEB INFORMATION SYSTEMS ENGINEERING, WISE 2020, PT I, 2020, 12342 : 343 - 354