Joint Entity Resolution

被引:11
|
作者
Whang, Steven Euijong [1 ]
Garcia-Molina, Hector [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
D O I
10.1109/ICDE.2012.119
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Entity resolution (ER) is the problem of identifying which records in a database represent the same entity. Often, records of different types are involved (e.g., authors, publications, institutions, venues), and resolving records of one type can impact the resolution of other types of records. In this paper we propose a flexible, modular resolution framework where existing ER algorithms developed for a given record type can be plugged in and used in concert with other ER algorithms. Our approach also makes it possible to run ER on subsets of similar records at a time, important when the full data is too large to resolve together. We study the scheduling and coordination of the individual ER algorithms in order to resolve the full data set. We then evaluate our joint ER techniques on synthetic and real data and show the scalability of our approach.
引用
收藏
页码:294 / 305
页数:12
相关论文
共 50 条
  • [1] Joint entity resolution on multiple datasets
    Whang, Steven Euijong
    Garcia-Molina, Hector
    VLDB JOURNAL, 2013, 22 (06): : 773 - 795
  • [2] Joint entity resolution on multiple datasets
    Steven Euijong Whang
    Hector Garcia-Molina
    The VLDB Journal, 2013, 22 : 773 - 795
  • [3] A related data oriented joint entity resolution approach
    Sun, Chen-Chen
    Shen, De-Rong
    Kou, Yue
    Nie, Tie-Zheng
    Yu, Ge
    Jisuanji Xuebao/Chinese Journal of Computers, 2015, 38 (09): : 1740 - 1754
  • [4] Global joint models for coreference resolution and named entity classification
    Denis, Pascal
    Baldridge, Jason
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2009, (42): : 87 - 96
  • [5] Towards Consistent Document-level Entity Linking: Joint Models for Entity Linking and Coreference Resolution
    Zaporojets, Klim
    Deleu, Johannes
    Jiang, Yiwei
    Demeester, Thomas
    Develder, Chris
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 778 - 784
  • [6] Revisiting Joint Modeling of Cross-document Entity and Event Coreference Resolution
    Barhom, Shany
    Shwartz, Vered
    Eirew, Alon
    Bugert, Michael
    Reimers, Nils
    Dagan, Ido
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4179 - 4189
  • [7] GB-JER: A Graph-Based Model for Joint Entity Resolution
    Sun, Chenchen
    Shen, Derong
    Kou, Yue
    Nie, Tiezheng
    Yu, Ge
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT1, 2015, 9049 : 458 - 473
  • [8] Joint Multi-field Siamese Recurrent Neural Network for Entity Resolution
    Lv, Yang
    Qi, Lei
    Huo, Jing
    Wang, Hao
    Gao, Yang
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 482 - 490
  • [9] A Joint Model of Named Entity Recognition and Coreference Resolution Based on Hybrid Neural Network
    Gao C.-S.
    Zhang J.-F.
    Li W.-P.
    Zhao W.
    Zhang S.-K.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (03): : 442 - 448
  • [10] Provenance for Entity Resolution
    Oppold, Sarah
    Herschel, Melanie
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, IPAW 2018, 2018, 11017 : 226 - 230