Random walk-based entity representation learning and re-ranking for entity search

被引:0
|
作者
Takahiro Komamizu
机构
[1] Nagoya University,
来源
关键词
Linked Data; Graph analysis; Entity representation learning; PageRank-based re-ranking; Random walk with restart; Entity search;
D O I
暂无
中图分类号
学科分类号
摘要
Linked Data (LD) has become a valuable source of factual records, and entity search is a fundamental task in LD. The task is, given a query consisting of a set of keywords, to retrieve a set of relevant entities in LD. The state-of-the-art approaches for entity search are based on information retrieval techniques. This paper first examines these approaches with a traditional evaluation metric, recall@k, to reveal their potential for improvement. To obtain evidence for the potentials, an investigation is carried out on the relationship between queries and answer entities in terms of path lengths on a graph of LD. On the basis of the investigation, learning representations of entities are dealt with. The existing methods of entity search are based on heuristics that determine relevant fields (i.e., predicates and related entities) to constitute entity representations. Since the heuristics require burdensome human decisions, this paper is aimed at removing the burden with a graph proximity measurement. To this end, in this paper, RWRDoc is proposed. It is an RWR (random walk with restart)-based representation learning method that learns representations of entities by using weighted combinations of representations of reachable entities w.r.t. RWR. RWRDoc is mainly designed to improve recall scores; therefore, as shown in experiments, it lacks capability in ranking. In order to improve the ranking qualities, this paper proposes a personalized PageRank-based re-ranking method, PPRSD (Personalized PageRank-based Score Distribution), for the retrieved results. PPRSD distributes relevance scores calculated by text-based entity search methods in a personalized PageRank manner. Experimental evaluations showcase that RWRDoc can improve search qualities in terms of recall@1000 and PPRSD can compensate for RWRDoc’s insufficient ranking capability, and the evaluations confirmed this compensation.
引用
收藏
页码:2989 / 3013
页数:24
相关论文
共 50 条
  • [2] Collective entity linking: a random walk-based perspective
    Liu, Ming
    Zhao, Yanyan
    Qin, Bing
    Liu, Ting
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (03) : 1611 - 1643
  • [3] Collective entity linking: a random walk-based perspective
    Ming Liu
    Yanyan Zhao
    Bing Qin
    Ting Liu
    [J]. Knowledge and Information Systems, 2019, 60 : 1611 - 1643
  • [4] A Hybrid Re-ranking Method for Entity Recognition and Linking in Search Queries
    Tang, Gongbo
    Guo, Yuting
    Yu, Dong
    Xun, Endong
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2015, 2015, 9362 : 598 - 605
  • [5] Named Entity Based Document Similarity with SVM-Based Re-ranking for Entity Linking
    Alhelbawy, Ayman
    Gaizauskas, Rob
    [J]. ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS, 2012, 322 : 379 - 388
  • [6] Iterative Entity Alignment via Re-Ranking
    Zeng W.
    Zhao X.
    Tang J.
    Tan Z.
    Wang W.
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (07): : 1460 - 1471
  • [7] Representation Learning for Entity Type Ranking
    Rahman, Md Mostafizur
    Takasu, Atsuhiro
    Demartini, Gianluca
    [J]. PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 2049 - 2056
  • [8] A Walk-based Model on Entity Graphs for Relation Extraction
    Christopoulou, Fenia
    Miwa, Makoto
    Ananiadou, Sophia
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 81 - 88
  • [9] Re-ranking for Joint Named-Entity Recognition and Linking
    Sil, Avirup
    Yates, Alexander
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2369 - 2374
  • [10] DREQ: Document Re-ranking Using Entity-Based Query Understanding
    Chatterjee, Shubham
    Mackie, Iain
    Dalton, Jeff
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 210 - 229