CoType: Joint Extraction of Typed Entities and Relations with Knowledge Bases

被引:198
|
作者
Ren, Xiang [1 ]
Wu, Zeqiu [1 ]
He, Wenqi [1 ]
Qu, Meng [1 ]
Voss, Clare R. [2 ]
Ji, Heng [3 ]
Abdelzaher, Tarek F. [1 ]
Han, Jiawei [1 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
[2] Army Res Lab, Computat & Informat Sci Directorate, Adelphi, MD USA
[3] Rensselaer Polytech Inst, Comp Sci Dept, Troy, NY USA
基金
美国国家科学基金会;
关键词
D O I
10.1145/3038912.3052708
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extracting entities and relations for types of interest from text is important for understanding massive text corpora. Traditionally, systems of entity relation extraction have relied on human-annotated corpora for training and adopted an incremental pipeline. Such systems require additional human expertise to be ported to a new domain, and are vulnerable to errors cascading down the pipeline. In this paper, we investigate joint extraction of typed entities and relations with labeled data heuristically obtained from knowledge bases (i.e., distant supervision). As our algorithm for type labeling via distant supervision is context-agnostic, noisy training data poses unique challenges for the task. We propose a novel domain-independent framework, called COTYPE, that runs a data-driven text segmentation algorithm to extract entity mentions, and jointly embeds entity mentions, relation mentions, text features and type labels into two low-dimensional spaces (for entity and relation mentions respectively), where, in each space, objects whose types are close will also have similar representations. COTYPE, then using these learned embeddings, estimates the types of test (unlinkable) mentions. We formulate a joint optimization problem to learn embeddings from text corpora and knowledge bases, adopting a novel partial-label loss function for noisy labeled data and introducing an object "translation" function to capture the cross-constraints of entities and relations on each other. Experiments on three public datasets demonstrate the effectiveness of COTYPE across different domains (e.g., news, biomedical), with an average of 25% improvement in F1 score compared to the next best method.
引用
收藏
页码:1015 / 1024
页数:10
相关论文
共 50 条
  • [21] Joint extraction of entities and overlapping relations by improved graph convolutional networks
    Sun, Qi
    Zhang, Kun
    Lv, Laishui
    Li, Xun
    Huang, Kun
    Zhang, Ting
    APPLIED INTELLIGENCE, 2022, 52 (05) : 5212 - 5224
  • [22] Joint extraction of entities and overlapping relations by improved graph convolutional networks
    Qi Sun
    Kun Zhang
    Laishui Lv
    Xun Li
    Kun Huang
    Ting Zhang
    Applied Intelligence, 2022, 52 : 5212 - 5224
  • [23] Joint Extraction of Multiple Relations and Entities from Building Code Clauses
    Li, Fulin
    Song, Yuanbin
    Shan, Yongwei
    APPLIED SCIENCES-BASEL, 2020, 10 (20): : 1 - 18
  • [24] Joint Extraction of Multiple Relations and Entities by Using a Hybrid Neural Network
    Zhou, Peng
    Zheng, Suncong
    Xu, Jiaming
    Qi, Zhenyu
    Bao, Hongyun
    Xu, Bo
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2017, 2017, 10565 : 135 - 146
  • [25] TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations
    Li, Xianming
    Luo, Xiaotian
    Dong, Chenghao
    Yang, Daichuan
    Luan, Beidi
    He, Zhen
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8055 - 8064
  • [26] Joint Extraction of Entities and Relations Using Reinforcement Learning and Deep Learning
    Feng, Yuntian
    Zhang, Hongjun
    Hao, Wenning
    Chen, Gang
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2017, 2017
  • [27] Assessing the Completeness of Entities in Knowledge Bases
    Ahmeti, Albin
    Razniewski, Simon
    Polleres, Axel
    SEMANTIC WEB: ESWC 2017 SATELLITE EVENTS, 2017, 10577 : 7 - 11
  • [28] SEPC: Improving Joint Extraction of Entities and Relations by Strengthening Entity Pairs Connection
    Zhao, Jiapeng
    Zhang, Panpan
    Liu, Tingwen
    Shi, Jinqiao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT I, 2021, 12712 : 817 - 828
  • [29] Joint extraction of entities and relations via an entity correlated attention neural model
    Li, Ren
    Li, Dong
    Yang, Jianxi
    Xiang, Fangyue
    Ren, Hao
    Jiang, Shixin
    Zhang, Luyi
    INFORMATION SCIENCES, 2021, 581 (581) : 179 - 193
  • [30] A Unified Multi-Task Learning Framework for Joint Extraction of Entities and Relations
    Zhao, Tianyang
    Yan, Zhao
    Cao, Yunbo
    Li, Zhoujun
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14524 - 14531