Inconsistency-driven approach for human-in-the-loop entity matching

被引:0
|
作者
Ito, Hiroyoshi [1 ]
Koizumi, Takahiro [2 ]
Yoshimoto, Ryuji [3 ]
Fukushima, Yukihiro [4 ]
Harada, Takashi [5 ]
Morishima, Atsuyuki [1 ]
机构
[1] Univ Tsukuba, Inst Lib Informat & Media Sci, Tsukuba, Japan
[2] Univ Tsukuba, Grad Sch Comprehens Human Sci, Tsukuba, Japan
[3] CARLIL Inc, Tokyo, Japan
[4] Keio Univ, Fac Pharm, Keio, Japan
[5] Doshisha Univ, Ctr License & Qualificat, Kyoto, Japan
关键词
D O I
10.47989/ir30iConf47140
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Introduction. Entity matching is a fundamental operation in a wide range of information management applications and a tremendous number of methods have been proposed to address the problem. Human-in-the-loop entity matching is a human-AI collaborative approach which is effective when the data for entity matching is incomplete or requires domain knowledge. A typical human-in-the- loop approach is to allow a machine-learning-based matcher to ask humans to match entities when it cannot match them with high confidence. However, ML- based matchers cannot avoid the unknown-unknown problem, i.e., they can resolve the entities incorrectly with high confidence. Method. This paper addresses an inconsistency-based method to deal with this problem. The method asks humans to resolve the entities when we find inconsistency in the transitivity property behind entity matching. For example, if a matcher returns a positive result only for two combinations among three entities, the result is inconsistent. Analysis. This paper shows an implementation of our idea in similarity-based blocking method and Bayesian inference and explains the result of an extensive set of experiments that reveals how and when the method is effective. Results. The result showed that the inconsistency-based sampling selects very different entity pairs compared to other sampling strategies and that a simple hybrid strategy performs well in many practical situations. Conclusion. The results indicate our approach complements any existing matcher that can cause the unknown-unknown problem in entity matching.
引用
收藏
页码:1024 / 1038
页数:15
相关论文
共 50 条
  • [31] Human-in-the-Loop Insulin Dosing
    Bequette, B. Wayne
    JOURNAL OF DIABETES SCIENCE AND TECHNOLOGY, 2021, 15 (03): : 699 - 704
  • [32] A fuzzy system hazard analysis approach for human-in-the-loop systems
    Zahabi, Maryam
    Kaber, David
    SAFETY SCIENCE, 2019, 120 : 922 - 931
  • [33] Adaptive Task Assignment in Spatial Crowdsourcing: A Human-in-The-Loop Approach
    Wu, Qingshun
    Li, Yafei
    Yan, Jinxing
    Zhang, Mei
    Xu, Jianliang
    Xu, Mingliang
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (04) : 2726 - 2739
  • [34] A Human-in-the-Loop Approach based on Explainability to Improve NTL Detection
    Coma-Puig, Bernat
    Carmona, Josep
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 943 - 950
  • [35] Human-in-the-loop Data Integration
    Li, Guoliang
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (12): : 2006 - 2017
  • [36] Human-in-the-loop Augmented Mapping
    Sidaoui, Abbas
    Elhajj, Imad H.
    Asmar, Daniel
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 3190 - 3195
  • [37] Digital Human-in-the-Loop Framework
    Demirel, H. Onan
    DIGITAL HUMAN MODELING AND APPLICATIONS IN HEALTH, SAFETY, ERGONOMICS AND RISK MANAGEMENT. POSTURE, MOTION AND HEALTH, DHM 2020, PT I, 2020, 12198 : 18 - 32
  • [38] Human-in-the-loop Reinforcement Learning
    Liang, Huanghuang
    Yang, Lu
    Cheng, Hong
    Tu, Wenzhe
    Xu, Mengjie
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 4511 - 4518
  • [39] Human-in-the-loop issues for demining
    Herman, H
    Iglesias, D
    DETECTION AND REMEDIATION TECHNOLOGIES FOR MINES AND MINELIKE TARGETS IV, PTS 1 AND 2, 1999, 3710 : 797 - 805
  • [40] Human-in-the-Loop Feature Selection
    Correia, Alvaro H. C.
    Lecue, Freddy
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 2438 - 2445