Joint Open Knowledge Base Canonicalization and Linking

被引:9
|
作者
Liu, Yinan [1 ]
Shen, Wei [1 ]
Wang, Yuanfei [1 ]
Wang, Jianyong [2 ,3 ]
Yang, Zhenglu [1 ]
Yuan, Xiaojie [1 ]
机构
[1] Nankai Univ, Coll Comp Sci, TKLNDST, Tianjin 300071, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[3] Jiangsu Normal Univ, Jiangsu Collaborat Innovat Ctr Language Abil, Xuzhou 221009, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Open Knowledge Base Canonicalization; Open Knowledge Base Linking; Factor Graph Model; ENTITY LINKING;
D O I
10.1145/3448016.3452776
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Open Information Extraction (OIE) methods extract a large number of OIE triples (noun phrase, relation phrase, noun phrase) from text, which compose large Open Knowledge Bases (OKBs). However, noun phrases (NPs) and relation phrases (RPs) in OKBs are not canonicalized and often appear in different paraphrased textual variants, which leads to redundant and ambiguous facts. To address this problem, there are two related tasks: OKB canonicalization (i.e., convert NPs and RPs to canonicalized form) and OKB linking (i.e., link NPs and RPs with their corresponding entities and relations in a curated Knowledge Base (e.g., DBPedia). These two tasks are tightly coupled, and one task can benefit significantly from the other. However, they have been studied in isolation so far. In this paper, we explore the task of joint OKB canonicalization and linking for the first time, and propose a novel framework JOCL based on factor graph model to make them reinforce each other. JOCL is flexible enough to combine different signals from both tasks, and able to extend to fit any new signals. A thorough experimental study over two large scale OIE triple data sets shows that our framework outperforms all the baseline methods for the task of OKB canonicalization (OKB linking) in terms of average F1 (accuracy).
引用
收藏
页码:2253 / 2261
页数:9
相关论文
共 50 条
  • [1] Towards Practical Open Knowledge Base Canonicalization
    Wu, Tien-Hsuan
    Wu, Zhiyong
    Kao, Ben
    Yin, Pengcheng
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 883 - 892
  • [2] Open knowledge base canonicalization with multi-task learning
    Liu, Bingchen
    Peng, Huang
    Zeng, Weixin
    Zhao, Xiang
    Liu, Shijun
    Pan, Li
    Li, Xin
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (05):
  • [3] Multi-View Clustering for Open Knowledge Base Canonicalization
    Shen, Wei
    Yang, Yang
    Liu, Yinan
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 1578 - 1588
  • [4] Multi-level feature interaction for open knowledge base canonicalization
    Sui, Xuhui
    Zhang, Ying
    Song, Kehui
    Zhou, Baohang
    Yuan, Xiaojie
    KNOWLEDGE-BASED SYSTEMS, 2024, 303
  • [5] Open Knowledge Graphs Canonicalization using Variational Autoencoders
    Dash, Sarthak
    Rossiello, Gaetano
    Bagchi, Sugato
    Mihindukulasooriya, Nandana
    Gliozzo, Alfio
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 10379 - 10394
  • [6] Relation Canonicalization in Open Knowledge Graphs: A Quantitative Analysis
    Lomaeva, Maria
    Jain, Nitisha
    SEMANTIC WEB: ESWC 2022 SATELLITE EVENTS, 2022, 13384 : 21 - 25
  • [7] CMVC plus : A Multi-View Clustering Framework for Open Knowledge Base Canonicalization Via Contrastive Learning
    Yang, Yang
    Shen, Wei
    Shu, Junfeng
    Liu, Yinan
    Curry, Edward
    Li, Guoliang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (05) : 2296 - 2310
  • [8] A Probabilistic Method for Linking BI Provenances to Open Knowledge Base
    Wang, Jing
    Yu, Yongchuan
    Yan, Jianzhuo
    Chen, Jianhui
    Zhao, Zhongcheng
    Wang, Dongsheng
    BRAIN INFORMATICS AND HEALTH, 2016, 9919 : 367 - 376
  • [9] MULCE: Multi-level Canonicalization with Embeddings of Open Knowledge Bases
    Wu, Tien-Hsuan
    Kao, Ben
    Wu, Zhiyong
    Feng, Xiyang
    Song, Qianli
    Chen, Cheng
    WEB INFORMATION SYSTEMS ENGINEERING, WISE 2020, PT I, 2020, 12342 : 315 - 327
  • [10] Canonicalization of Open Knowledge Bases with Side Information from the Source Text
    Lin, Xueling
    Chen, Lei
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 950 - 961