Domain-Aware Multi-Truth Discovery from Conflicting Sources

被引:27
|
作者
Lin, Xueling [1 ]
Chen, Lei [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2018年 / 11卷 / 05期
关键词
D O I
10.1145/3177732.3177739
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the Big Data era, truth discovery has served as a promising technique to solve conflicts in the facts provided by numerous data sources. The most significant challenge for this task is to estimate source reliability and select the answers supported by high quality sources. However, existing works assume that one data source has the same reliability on any kinds of entity, ignoring the possibility that a source may vary in reliability on different domains. To capture the influence of various levels of expertise in different domains, we integrate domain expertise knowledge to achieve a more precise estimation of source reliability. We propose to infer the domain expertise of a data source based on its data richness in different domains. We also study the mutual influence between domains, which will affect the inference of domain expertise. Through leveraging the unique features of the multi-truth problem that sources may provide partially correct values of a data item, we assign more reasonable confidence scores to value sets. We propose an integrated Bayesian approach to incorporate the domain expertise of data sources and confidence scores of value sets, aiming to find multiple possible truths without any supervision. Experimental results on two real-world datasets demonstrate the feasibility, efficiency and effectiveness of our approach.
引用
收藏
页码:635 / 647
页数:13
相关论文
共 36 条
  • [21] Multi-level optimization with the koopman operator for data-driven, domain-aware, and dynamic system security
    Oster, Matthew R.
    King, Ethan
    Bakker, Craig
    Bhattacharya, Arnab
    Chatterjee, Samrat
    Pan, Feng
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2023, 237
  • [22] From Truth Discovery to Trustworthy Opinion Discovery: An Uncertainty-Aware Quantitative Modeling Approach
    Wan, Mengting
    Chen, Xiangyu
    Kaplan, Lance
    Han, Jiawei
    Gao, Jing
    Zhao, Bo
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1885 - 1894
  • [23] Estimate Information Fusion Weight of WSNs Nodes Based on Truth Discovery Optimization Method Among Conflicting Sources of Data
    Xiao, Kejiang
    Chen, Zhiwen
    Yang, Chunhua
    IEEE ACCESS, 2019, 7 : 35606 - 35618
  • [24] Truth Discovery via Exploiting Implications from Multi-Source Data
    Wang, Xianzhi
    Sheng, Quan Z.
    Yao, Lina
    Li, Xue
    Fang, Xiu Susie
    Xu, Xiaofei
    Benatallah, Boualem
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 861 - 870
  • [25] An Unsupervised Approach of Truth Discovery From Multi-Sourced Text Data
    Chang, Chen
    Cao, Jianjun
    Zheng, Qibin
    Lv, Guojun
    Weng, Nianfeng
    Zhang, Xiaoxiong
    Li, Hongmei
    IEEE ACCESS, 2019, 7 : 143479 - 143489
  • [26] Cross-domain landslide mapping from large-scale remote sensing images using prototype-guided domain-aware progressive representation learning
    Zhang, Xiaokang
    Yu, Weikang
    Pun, Man-On
    Shi, Wenzhong
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 197 : 1 - 17
  • [27] AllegatorTrack: Combining and Reporting Results of Truth Discovery from Multi-source Data
    Waguih, Dalia Attia
    Goel, Naman
    Hammady, Hossam M.
    Berti-Equille, Laure
    2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 1440 - 1443
  • [28] Truth Discovery from Multi-Sourced Text Data Based on Ant Colony Optimization
    Chang, Chen
    Cao, Jianjun
    Lv, Guojun
    Weng, Nianfeng
    2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 164 - 172
  • [29] Causal discovery from multi-domain data using the independence of modularities
    Qiao, Jie
    Bai, Yiming
    Cai, Ruichu
    Hao, Zhifeng
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (03): : 1939 - 1949
  • [30] Causal discovery from multi-domain data using the independence of modularities
    Jie Qiao
    Yiming Bai
    Ruichu Cai
    Zhifeng Hao
    Neural Computing and Applications, 2022, 34 : 1939 - 1949