Joint Entity Resolution

被引:11
|
作者
Whang, Steven Euijong [1 ]
Garcia-Molina, Hector [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
D O I
10.1109/ICDE.2012.119
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Entity resolution (ER) is the problem of identifying which records in a database represent the same entity. Often, records of different types are involved (e.g., authors, publications, institutions, venues), and resolving records of one type can impact the resolution of other types of records. In this paper we propose a flexible, modular resolution framework where existing ER algorithms developed for a given record type can be plugged in and used in concert with other ER algorithms. Our approach also makes it possible to run ER on subsets of similar records at a time, important when the full data is too large to resolve together. We study the scheduling and coordination of the individual ER algorithms in order to resolve the full data set. We then evaluate our joint ER techniques on synthetic and real data and show the scalability of our approach.
引用
收藏
页码:294 / 305
页数:12
相关论文
共 50 条
  • [31] Crowdsourcing Algorithms for Entity Resolution
    Vesdapunt, Norases
    Bellare, Kedar
    Dalvi, Nilesh
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (12): : 1071 - 1082
  • [32] Entity Resolution with Iterative Blocking
    Whang, Steven Euijong
    Menestrina, David
    Koutrika, Georgia
    Theobald, Martin
    Garcia-Molina, Hector
    ACM SIGMOD/PODS 2009 CONFERENCE, 2009, : 219 - 231
  • [33] ENTITY RESOLUTION AND BLOCKING: A REVIEW
    Vidhya, K. A.
    Geetha, T. V.
    PROCEEDINGS OF THE 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC 2019), 2019, : 133 - 140
  • [34] Entity Resolution On-Demand
    Simonini, Giovanni
    Zecchini, Luca
    Bergamaschi, Sonia
    Naumann, Felix
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (07): : 1506 - 1518
  • [35] Disinformation Techniques for Entity Resolution
    Whang, Steven Euijong
    Garcia-Molina, Hector
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 715 - 720
  • [36] Entity Resolution in the Web of Data
    Stefanidis, Kostas
    Efthymiou, Vasilis
    Herschel, Melanie
    Christophides, Vassilis
    WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 203 - 203
  • [37] Entity Resolution in the Web of Data
    Department of Computer Science, University of Crete, Greece
    不详
    不详
    Synth. lect. semant. web : theory technol., 3 (1-124):
  • [38] Entity Resolution in Dissimilarity Spaces
    Verykios, Vassilios S.
    Karapiperis, Dimitrios
    25TH PAN-HELLENIC CONFERENCE ON INFORMATICS WITH INTERNATIONAL PARTICIPATION (PCI2021), 2021, : 413 - 418
  • [39] Parallel Entity Resolution with Dedoop
    Lars Kolb
    Erhard Rahm
    Kolb, Lars (kolb@informatik.uni-leipzig.de), 1600, Springer Medizin (13): : 23 - 32
  • [40] Bilinear joint learning of word and entity embeddings for Entity Linking
    Chen, Hui
    Wei, Baogang
    Liu, Yonghuai
    Li, Yiming
    Yu, Jifang
    Zhu, Wenhao
    NEUROCOMPUTING, 2018, 294 : 12 - 18