A Confidence-based Entity Resolution Approach with Incomplete Information

被引:0
|
作者
Gu, Qi [1 ,2 ]
Zhang, Yan [1 ]
Cao, Jian [1 ]
Xu, Guandong [3 ]
Cuzzocrea, Alfredo [4 ,5 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
[2] Nantong Univ, Sch Comp Sci & Technol, Nantong, Peoples R China
[3] Univ Technol Sydney, Sydney, NSW, Australia
[4] ICAR CNR, Cosenza, Italy
[5] Univ Calabria, Cosenza, Italy
关键词
Entity Resolution; Cover Rate; Confidence; Accuracy;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Entity resolution identifies entities from different data sources that refer to the same real-world entity and it is an important prerequisite for integrating data from multiple sources. Entity resolution mainly relies on similarity measures on data records. Unfortunately, the data quality of data sources is not so good in practice. Especially web data sources often only provide incomplete information, which leads to the difficulties of direct applying similarity measures to identify the same entities. In order to address this problem, the concept of confidence is introduced to measure the trustworthy of the similarity calculation. An adaptive rule-based approach is used to calculate the similarity between records and its confidence is also derived. Then the similarity and confidence are propagated on the entity relational graph until fix point is reached. Finally, any pair of two records can be determined as matched or unmatched based on a threshold. We performed a series of experiments on real data sets and experiment results show that our approach has a better performance comparing with others.
引用
收藏
页码:97 / 103
页数:7
相关论文
共 50 条
  • [31] EXPLORING CONFIDENCE-BASED NEIGHBORHOODS IN OUTLIER DETECTION
    Fu, Juihsi
    Lee, Singling
    Wu, Chiawen
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 81 - 86
  • [32] Confidence Based Consensus in Environments with High Uncertainty and Incomplete Information
    Urena, Raquel
    Chiclana, Francisco
    Fujita, Hamido
    Herrera-Viedma, Enrique
    NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2017, 297 : 176 - 189
  • [33] Costra: Confidence-based self-training
    Cheng, Shengjun
    Huang, Qingcheng
    Liu, Jiafeng
    Tang, Xianglong
    Journal of Computational Information Systems, 2013, 9 (24): : 9761 - 9769
  • [34] Quality of Trilateration: Confidence-Based Iterative Localization
    Yang, Zheng
    Liu, Yunhao
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2010, 21 (05) : 631 - 640
  • [35] Confidence-based Refinement of Corrupted Depth Maps
    Ikehata, Satoshi
    Aizawa, Kiyoharu
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [36] Label confidence-based noise correction for crowdsourcing
    Ren, Lijuan
    Jiang, Liangxiao
    Li, Chaoqun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 117
  • [37] Confidence-based Somatic Mutation Evaluation and Prioritization
    Loewer, Martin
    Renard, Bernhard Y.
    de Graaf, Jos
    Wagner, Meike
    Paret, Claudia
    Kneip, Christoph
    Tuereci, Oezlem
    Diken, Mustafa
    Britten, Cedrik
    Kreiter, Sebastian
    Koslowski, Michael
    Castle, John C.
    Sahin, Ugur
    PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (09)
  • [38] Improving Reinforcement Learning with Confidence-Based Demonstrations
    Wang, Zhaodong
    Taylor, Matthew E.
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3027 - 3033
  • [39] A Novel Confidence-Based Algorithm for Structured Bandits
    Tirinzoni, Andrea
    Lazaric, Alessandro
    Restelli, Marcello
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
  • [40] Confidence-based Fast Intra Prediction Algorithm
    Wei, Hongan
    Zhou, Binqian
    Chen, Jinling
    Xu, Yiwen
    PROCEEDINGS OF 2018 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, ELECTRONICS AND ELECTRICAL ENGINEERING (AUTEEE), 2018, : 158 - 161