Domain-Aware Multi-Truth Discovery from Conflicting Sources

被引:27
|
作者
Lin, Xueling [1 ]
Chen, Lei [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2018年 / 11卷 / 05期
关键词
D O I
10.1145/3177732.3177739
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the Big Data era, truth discovery has served as a promising technique to solve conflicts in the facts provided by numerous data sources. The most significant challenge for this task is to estimate source reliability and select the answers supported by high quality sources. However, existing works assume that one data source has the same reliability on any kinds of entity, ignoring the possibility that a source may vary in reliability on different domains. To capture the influence of various levels of expertise in different domains, we integrate domain expertise knowledge to achieve a more precise estimation of source reliability. We propose to infer the domain expertise of a data source based on its data richness in different domains. We also study the mutual influence between domains, which will affect the inference of domain expertise. Through leveraging the unique features of the multi-truth problem that sources may provide partially correct values of a data item, we assign more reasonable confidence scores to value sets. We propose an integrated Bayesian approach to incorporate the domain expertise of data sources and confidence scores of value sets, aiming to find multiple possible truths without any supervision. Experimental results on two real-world datasets demonstrate the feasibility, efficiency and effectiveness of our approach.
引用
收藏
页码:635 / 647
页数:13
相关论文
共 36 条
  • [1] Enhancing domain-aware multi-truth data fusion using copy-based source authority and value similarity
    Fabio Azzalini
    Davide Piantella
    Emanuele Rabosio
    Letizia Tanca
    The VLDB Journal, 2023, 32 : 475 - 500
  • [2] Enhancing domain-aware multi-truth data fusion using copy-based source authority and value similarity
    Azzalini, Fabio
    Piantella, Davide
    Rabosio, Emanuele
    Tanca, Letizia
    VLDB JOURNAL, 2023, 32 (03): : 475 - 500
  • [3] Empowering Truth Discovery with Multi-Truth Prediction
    Wang, Xianzhi
    Sheng, Quan Z.
    Yao, Lina
    Li, Xue
    Fang, Xiu Susie
    Xu, Xiaofei
    Benatallah, Boualem
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 881 - 890
  • [4] Multi-Truth Discovery While Being Aware of Unbalanced Data Distribution
    Fang, Xiu Susie
    Sheng, Quan Z.
    Sun, Guohao
    Chang, Shan
    Wang, Hongya
    Yang, Jian
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [5] Generalizing truth discovery by incorporating multi-truth features
    Fang, Xiu Susie
    Wang, Xianzhi
    Sheng, Quan Z.
    Yao, Lina
    COMPUTING, 2024, 106 (05) : 1557 - 1583
  • [6] Multi-Truth Discovery Method Based on Attribute Fusion
    Haolin, Yang
    Yongquan, Dong
    Huafeng, Chen
    Guoxi, Zhang
    Data Analysis and Knowledge Discovery, 2022, 6 (11) : 52 - 60
  • [7] Truth Discovery from Conflicting Multi-Valued Objects
    Fang, Xiu Susie
    WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 711 - 715
  • [8] Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue Systems
    Balaraman, Vevake
    Magnini, Bernardo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 866 - 873
  • [9] Multi-Domain Sentiment Classification Based on Domain-Aware Embedding and Attention
    Cai, Yitao
    Wan, Xiaojun
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4904 - 4910
  • [10] Domain-Aware Contrastive Knowledge Transfer for Multi-domain Imbalanced Data
    Ke, Zixuan
    Kachuee, Mohammad
    Lee, Sungjin
    PROCEEDINGS OF THE 12TH WORKSHOP ON COMPUTATIONAL APPROACHES TO SUBJECTIVITY, SENTIMENT & SOCIAL MEDIA ANALYSIS, 2022, : 25 - 36