Domain-Aware Multi-Truth Discovery from Conflicting Sources

被引:27
|
作者
Lin, Xueling [1 ]
Chen, Lei [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2018年 / 11卷 / 05期
关键词
D O I
10.1145/3177732.3177739
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the Big Data era, truth discovery has served as a promising technique to solve conflicts in the facts provided by numerous data sources. The most significant challenge for this task is to estimate source reliability and select the answers supported by high quality sources. However, existing works assume that one data source has the same reliability on any kinds of entity, ignoring the possibility that a source may vary in reliability on different domains. To capture the influence of various levels of expertise in different domains, we integrate domain expertise knowledge to achieve a more precise estimation of source reliability. We propose to infer the domain expertise of a data source based on its data richness in different domains. We also study the mutual influence between domains, which will affect the inference of domain expertise. Through leveraging the unique features of the multi-truth problem that sources may provide partially correct values of a data item, we assign more reasonable confidence scores to value sets. We propose an integrated Bayesian approach to incorporate the domain expertise of data sources and confidence scores of value sets, aiming to find multiple possible truths without any supervision. Experimental results on two real-world datasets demonstrate the feasibility, efficiency and effectiveness of our approach.
引用
收藏
页码:635 / 647
页数:13
相关论文
共 36 条
  • [31] Edge-aware resource discovery and fair intelligent admission control scheme over multi-domain differentiated services networks
    Li, M
    Hoang, DB
    2004 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-7, 2004, : 2065 - 2070
  • [32] Spatially Aware Domain Adaptation Enables Cell Type Deconvolution from Multi-Modal Spatially Resolved Transcriptomics
    Wang, Lequn
    Bai, Xiaosheng
    Zhang, Chuanchao
    Shi, Qianqian
    Chen, Luonan
    SMALL METHODS, 2024,
  • [33] Semantics-aware Multi-modal Domain Translation: From LiDAR Point Clouds to Panoramic Color Images
    Cortinhal, Tiago
    Kurnaz, Fatih
    Aksoy, Eren Erdal
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3032 - 3041
  • [34] Bayesian multi-domain learning for cancer subtype discovery from next-generation sequencing count data
    Hajiramezanali, Ehsan
    Dadaneh, Siamak Zamani
    Karbalayghareh, Alireza
    Zhou, Mingyuan
    Qian, Xiaoning
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [35] Uncertainty-aware multi-criteria decision analysis for evaluation of explainable artificial intelligence methods: A use case from the healthcare domain
    Aliyeva, Kamala
    Mehdiyev, Nijat
    INFORMATION SCIENCES, 2024, 657
  • [36] Depth- and semantics-aware multi-modal domain translation: Generating 3D panoramic color images from LiDAR point clouds
    Cortinhal, Tiago
    Aksoy, Eren Erdal
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2024, 171