Modeling Topic-Based Human Expertise for Crowd Entity Resolution

被引:4
|
作者
Gong, Sai-Sai [1 ]
Hu, Wei [1 ]
Ge, Wei-Yi [2 ]
Qu, Yu-Zhong [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
[2] Sci & Technol Informat Syst Engn Lab, Nanjing 210007, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
entity resolution; crowdsourcing; human expertise; topic modeling; task similarity;
D O I
10.1007/s11390-018-1882-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Entity resolution (ER) aims to identify whether two entities in an ER task refer to the same real-world thing. Crowd ER uses humans, in addition to machine algorithms, to obtain the truths of ER tasks. However, inaccurate or erroneous results are likely to be generated when humans give unreliable judgments. Previous studies have found that correctly estimating human accuracy or expertise in crowd ER is crucial to truth inference. However, a large number of them assume that humans have consistent expertise over all the tasks, and ignore the fact that humans may have varied expertise on different topics (e.g., music versus sport). In this paper, we deal with crowd ER in the Semantic Web area. We identify multiple topics of ER tasks and model human expertise on different topics. Furthermore, we leverage similar task clustering to enhance the topic modeling and expertise estimation. We propose a probabilistic graphical model that computes ER task similarity, estimates human expertise, and infers the task truths in a unified framework. Our evaluation results on real-world and synthetic datasets show that, compared with several state-of-the-art approaches, our proposed model achieves higher accuracy on the task truth inference and is more consistent with the human real expertise.
引用
收藏
页码:1204 / 1218
页数:15
相关论文
共 50 条
  • [1] Modeling Topic-Based Human Expertise for Crowd Entity Resolution
    Sai-Sai Gong
    Wei Hu
    Wei-Yi Ge
    Yu-Zhong Qu
    [J]. Journal of Computer Science and Technology, 2018, 33 : 1204 - 1218
  • [2] Evaluation of topic-based adaptation and student modeling in QuizGuide
    Sergey Sosnovsky
    Peter Brusilovsky
    [J]. User Modeling and User-Adapted Interaction, 2015, 25 : 371 - 424
  • [3] Evaluation of topic-based adaptation and student modeling in QuizGuide
    Sosnovsky, Sergey
    Brusilovsky, Peter
    [J]. USER MODELING AND USER-ADAPTED INTERACTION, 2015, 25 (04) : 371 - 424
  • [4] Topic-based coherence modeling for statistical machine translation
    Institute for Infocomm Research, Singapore
    138632, Singapore
    不详
    215006, China
    [J]. IEEE Trans. Audio Speech Lang. Process., 3 (483-493):
  • [5] Topic-Based Language Modeling with Dynamic Bayesian Networks
    Wiggers, Pascal
    Rothkrantz, Leon J. M.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1866 - 1869
  • [6] Topic-Based Coherence Modeling for Statistical Machine Translation
    Xiong, Deyi
    Zhang, Min
    Wang, Xing
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (03) : 483 - 493
  • [7] Latent topic-based super-resolution for remote sensing
    Fernandez-Beltran, Ruben
    Latorre-Carmona, Pedro
    Pla, Filiberto
    [J]. REMOTE SENSING LETTERS, 2017, 8 (06) : 498 - 507
  • [8] Attribute-based Crowd Entity Resolution
    Khan, Asif R.
    Garcia-Molina, Hector
    [J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 549 - 558
  • [9] Waldo: An Adaptive Human Interface for Crowd Entity Resolution
    Verroios, Vasilis
    Garcia-Molina, Hector
    Papakonstantinou, Yannis
    [J]. SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 1133 - 1148
  • [10] Modeling Flickr Communities Through Probabilistic Topic-Based Analysis
    Negoescu, Radu-Andrei
    Gatica-Perez, Daniel
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (05) : 399 - 416