Predicting Links on Wikipedia with Anchor Text Information

被引:0
|
作者
Brochier, Robin [1 ]
Bechet, Frederic [1 ]
机构
[1] Aix Marseille Univ, Univ Toulon, CNRS, LIS, Marseille, France
关键词
Wikipedia; link prediction; evaluation; hyperlinks; NETWORKS;
D O I
10.1145/3404835.3462994
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wikipedia, the largest open-collaborative online encyclopedia, is a corpus of documents bound together by internal hyperlinks. These links form the building blocks of a large network whose structure contains important information on the concepts covered in this encyclopedia. The presence of a link between two articles, materialised by an anchor text in the source page pointing to the target page, can increase readers' understanding of a topic. However, the process of linking follows specific editorial rules to avoid both under-linking and over-linking. In this paper, we study the transductive and the inductive tasks of link prediction on several subsets of the English Wikipedia and identify some key challenges behind automatic linking based on anchor text information. We propose an appropriate evaluation sampling methodology and compare several algorithms. Moreover, we propose baseline models that provide a good estimation of the overall difficulty of the tasks.
引用
收藏
页码:1758 / 1762
页数:5
相关论文
共 50 条
  • [1] Relevancy between Anchor Text and Wikipedia: A Web Search Framework
    Al-akashi, Falah
    Inkpen, Diana
    [J]. JOURNAL OF INFORMATION AND ORGANIZATIONAL SCIENCES, 2024, 48 (01) : 1 - 17
  • [2] Improving complex interactive question answering with Wikipedia anchor text
    MacKinnon, Ian
    Vechtomova, Olga
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2008, 4956 : 438 - +
  • [3] Predicting Anchor Links between Heterogeneous Social Networks
    Sajadmanesh, Sina
    Rabiee, Hamid R.
    Khodadadi, Ali
    [J]. PROCEEDINGS OF THE 2016 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING ASONAM 2016, 2016, : 158 - 163
  • [4] Wikipedia as Text
    Pataki, Mate
    Vajna, Miklos
    Marosi, Attila Csaba
    [J]. ERCIM NEWS, 2012, (89): : 48 - 49
  • [5] PAAE: A UNIFIED FRAMEWORK FOR PREDICTING ANCHOR LINKS WITH ADVERSARIAL EMBEDDING
    Shang, Yanmin
    Kang, Zhezhou
    Cao, Yanan
    Zhang, Dongjie
    Li, Yangxi
    Li, Yang
    Liu, Yanbing
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 682 - 687
  • [6] Using Wikipedia as a reference for extracting semantic information from a text
    Prato, Andrea
    Ronchetti, Marco
    [J]. 2009 THIRD INTERNATIONAL CONFERENCE ON ADVANCES IN SEMANTIC PROCESSING, 2009, : 56 - 61
  • [7] Using and Detecting Links in Wikipedia
    Fachry, Khairun Nisa
    Kamps, Jaap
    Koolen, Marijn
    Mang, Junte
    [J]. FOCUSED ACCESS TO XML DOCUMENTS, 2008, 4862 : 388 - 403
  • [8] Predicting missing links via local information
    Zhou, Tao
    Lu, Linyuan
    Zhang, Yi-Cheng
    [J]. EUROPEAN PHYSICAL JOURNAL B, 2009, 71 (04): : 623 - 630
  • [9] Predicting missing links via local information
    Tao Zhou
    Linyuan Lü
    Yi-Cheng Zhang
    [J]. The European Physical Journal B, 2009, 71 : 623 - 630
  • [10] User Identity Linkage with Accumulated Information from Neighbouring Anchor Links
    Li, Xiang
    Su, Yijun
    Tang, Wei
    Gao, Neng
    Xiang, Ji
    [J]. WEB INFORMATION SYSTEMS ENGINEERING, WISE 2018, PT II, 2018, 11234 : 335 - 344