Error Link Detection and Correction in Wikipedia

被引:6
|
作者
Wang, Chengyu [1 ]
Zhang, Rong [1 ]
He, Xiaofeng [1 ]
Zhou, Aoying [1 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Software Engn, Shanghai, Peoples R China
关键词
error link; Wikipedia; LinkRank; pairwise learning; LARGE-SCALE;
D O I
10.1145/2983323.2983705
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The hyperlink structure of Wikipedia forms a rich semantic network connecting entities and concepts, enabling it as a valuable source for knowledge harvesting. Wikipedia, as crowd-sourced data, faces various data quality issues which significantly impacts knowledge systems depending on it as the information source. One such issue occurs when an anchor text in a Wikipage links to a wrong Wikipage, causing the error link problem. While much of previous work has focused on leveraging Wikipedia for entity linking, little has been done to detect error links. In this paper, we address the error link problem, and propose algorithms to detect and correct error links. We introduce an efficient method to generate candidate error links based on iterative ranking in an Anchor Text Semantic Network. This greatly reduces the problem space. A more accurate pairwise learning model was used to detect error links from the reduced candidate error link set, while suggesting correct links in the same time. This approach is effective when data sparsity is a challenging issue. The experiments on both English and Chinese Wikipedia illustrate the effectiveness of our approach. We also provide a preliminary analysis on possible causes of error links in English and Chinese Wikipedia.
引用
收藏
页码:307 / 316
页数:10
相关论文
共 50 条
  • [21] Error Detection and Correction in Communication Networks
    Shangguan, Chong
    Tamo, Itzhak
    2020 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2020, : 96 - 101
  • [22] ERROR-DETECTION AND CORRECTION IN SPELLING
    LYDIATT, S
    ACADEMIC THERAPY, 1984, 20 (01): : 33 - 40
  • [23] ERROR CORRECTION AND DETECTION, A GEOMETRIC APPROACH
    WARD, RK
    TABANDEH, M
    COMPUTER JOURNAL, 1984, 27 (03): : 246 - 253
  • [24] Error detection/correction in collaborative writing
    Pilotti, Maura
    Chodorow, Martin
    READING AND WRITING, 2009, 22 (03) : 245 - 260
  • [25] USING CODES FOR ERROR CORRECTION AND DETECTION
    KLOVE, T
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1984, 30 (06) : 868 - 870
  • [26] ERROR DETECTION AND CORRECTION IN FORMAL LANGUAGES
    IWAMOTO, K
    SAWANO, A
    NEC RESEARCH & DEVELOPMENT, 1973, (30): : 64 - 71
  • [27] An Error Detection and Correction Framework for Connectomics
    Zung, Jonathan
    Tartavull, Ignacio
    Lee, Kisuk
    Seung, H. Sebastian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [28] DATA ERROR-DETECTION AND CORRECTION
    WATKINSON, JR
    WIRELESS WORLD, 1983, 89 (1565): : 44 - 48
  • [29] ERROR-DETECTION AND CORRECTION PRIMER
    不详
    HEWLETT-PACKARD JOURNAL, 1990, 41 (06): : 46 - 47
  • [30] Error Detection and Correction in Data Collection
    Challinor, Julia
    INTERNATIONAL JOURNAL OF ANTIMICROBIAL AGENTS, 2005, 26 : S63 - S63