Modeling Topic-Based Human Expertise for Crowd Entity Resolution

被引:4
|
作者
Gong, Sai-Sai [1 ]
Hu, Wei [1 ]
Ge, Wei-Yi [2 ]
Qu, Yu-Zhong [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
[2] Sci & Technol Informat Syst Engn Lab, Nanjing 210007, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
entity resolution; crowdsourcing; human expertise; topic modeling; task similarity;
D O I
10.1007/s11390-018-1882-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Entity resolution (ER) aims to identify whether two entities in an ER task refer to the same real-world thing. Crowd ER uses humans, in addition to machine algorithms, to obtain the truths of ER tasks. However, inaccurate or erroneous results are likely to be generated when humans give unreliable judgments. Previous studies have found that correctly estimating human accuracy or expertise in crowd ER is crucial to truth inference. However, a large number of them assume that humans have consistent expertise over all the tasks, and ignore the fact that humans may have varied expertise on different topics (e.g., music versus sport). In this paper, we deal with crowd ER in the Semantic Web area. We identify multiple topics of ER tasks and model human expertise on different topics. Furthermore, we leverage similar task clustering to enhance the topic modeling and expertise estimation. We propose a probabilistic graphical model that computes ER task similarity, estimates human expertise, and infers the task truths in a unified framework. Our evaluation results on real-world and synthetic datasets show that, compared with several state-of-the-art approaches, our proposed model achieves higher accuracy on the task truth inference and is more consistent with the human real expertise.
引用
收藏
页码:1204 / 1218
页数:15
相关论文
共 50 条
  • [31] Collaborative topic regression for predicting topic-based social influence
    Hamzehei, Asso
    Wong, Raymond K.
    Koutra, Danai
    Chen, Fang
    [J]. MACHINE LEARNING, 2019, 108 (10) : 1831 - 1850
  • [32] Collaborative topic regression for predicting topic-based social influence
    Asso Hamzehei
    Raymond K. Wong
    Danai Koutra
    Fang Chen
    [J]. Machine Learning, 2019, 108 : 1831 - 1850
  • [33] Content Patterns in Topic-Based Overlapping Communities
    Rios, Sebastian A.
    Munoz, Ricardo
    [J]. SCIENTIFIC WORLD JOURNAL, 2014,
  • [34] Topic-based Defect Prediction (NIER Track)
    Tung Thanh Nguyen
    Nguyen, Tien N.
    Tu Minh Phuong
    [J]. 2011 33RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2011, : 932 - 935
  • [35] A topic-based browser for large online resources
    Stuckenschmidt, H
    de Waard, A
    Bhogal, R
    Fluit, C
    Kampman, A
    van Buel, J
    van Mulligen, E
    Broekstra, J
    Crowlesmith, I
    van Harmelen, F
    Scerri, T
    [J]. ENGINEERING KNOWLEDGE IN THE AGE OF THE SEMANTIC WEB, PROCEEDINGS, 2004, 3257 : 433 - 448
  • [36] Automatic image annotation based on topic-based smoothing
    Zhou, XD
    Ye, JY
    Chen, L
    Zhang, L
    Shi, BL
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2005, PROCEEDINGS, 2005, 3578 : 86 - 93
  • [37] Towards Topic-Based Trust in Social Networks
    Knap, Tomas
    Mlynkova, Irena
    [J]. UBIQUITOUS INTELLIGENCE AND COMPUTING, 2010, 6406 : 635 - 649
  • [38] Topic-based influential user detection: a survey
    Rrubaa Panchendrarajan
    Akrati Saxena
    [J]. Applied Intelligence, 2023, 53 : 5998 - 6024
  • [39] Topic-based Classification through Unigram Unmasking
    HaCohen-Kerner, Yaakov
    Rosenfeld, Avi
    Sabag, Asaf
    Tzidkani, Maor
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 69 - 76
  • [40] TCPM: Topic-based Clinical Pathway Mining
    Xu, Xiao
    Jin, Tao
    Wei, Zhijie
    Lv, Cheng
    Wang, Jianmin
    [J]. 2016 IEEE FIRST INTERNATIONAL CONFERENCE ON CONNECTED HEALTH: APPLICATIONS, SYSTEMS AND ENGINEERING TECHNOLOGIES (CHASE), 2016, : 292 - 301