Domain-Aware Multi-Truth Discovery from Conflicting Sources

被引：27

作者：

Lin, Xueling ^{[1
]}

Chen, Lei ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China

来源：

PROCEEDINGS OF THE VLDB ENDOWMENT | 2018年 / 11卷 / 05期

关键词：

D O I：

10.1145/3177732.3177739

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the Big Data era, truth discovery has served as a promising technique to solve conflicts in the facts provided by numerous data sources. The most significant challenge for this task is to estimate source reliability and select the answers supported by high quality sources. However, existing works assume that one data source has the same reliability on any kinds of entity, ignoring the possibility that a source may vary in reliability on different domains. To capture the influence of various levels of expertise in different domains, we integrate domain expertise knowledge to achieve a more precise estimation of source reliability. We propose to infer the domain expertise of a data source based on its data richness in different domains. We also study the mutual influence between domains, which will affect the inference of domain expertise. Through leveraging the unique features of the multi-truth problem that sources may provide partially correct values of a data item, we assign more reasonable confidence scores to value sets. We propose an integrated Bayesian approach to incorporate the domain expertise of data sources and confidence scores of value sets, aiming to find multiple possible truths without any supervision. Experimental results on two real-world datasets demonstrate the feasibility, efficiency and effectiveness of our approach.

引用

页码：635 / 647

页数：13

共 36 条

[1] Enhancing domain-aware multi-truth data fusion using copy-based source authority and value similarity
Fabio Azzalini
Davide Piantella
Emanuele Rabosio
Letizia Tanca
The VLDB Journal, 2023, 32 : 475 - 500
[2] Enhancing domain-aware multi-truth data fusion using copy-based source authority and value similarity
Azzalini, Fabio
Piantella, Davide
Rabosio, Emanuele
Tanca, Letizia
VLDB JOURNAL, 2023, 32 (03): : 475 - 500
[3] Empowering Truth Discovery with Multi-Truth Prediction
Wang, Xianzhi
Sheng, Quan Z.
Yao, Lina
Li, Xue
Fang, Xiu Susie
Xu, Xiaofei
Benatallah, Boualem
CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 881 - 890
[4] Multi-Truth Discovery While Being Aware of Unbalanced Data Distribution
Fang, Xiu Susie
Sheng, Quan Z.
Sun, Guohao
Chang, Shan
Wang, Hongya
Yang, Jian
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[5] Generalizing truth discovery by incorporating multi-truth features
Fang, Xiu Susie
Wang, Xianzhi
Sheng, Quan Z.
Yao, Lina
COMPUTING, 2024, 106 (05) : 1557 - 1583
[6] Multi-Truth Discovery Method Based on Attribute Fusion
Haolin, Yang
Yongquan, Dong
Huafeng, Chen
Guoxi, Zhang
Data Analysis and Knowledge Discovery, 2022, 6 (11) : 52 - 60
[7] Truth Discovery from Conflicting Multi-Valued Objects
Fang, Xiu Susie
WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 711 - 715
[8] Domain-Aware Dialogue State Tracker for Multi-Domain Dialogue Systems
Balaraman, Vevake
Magnini, Bernardo
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 866 - 873
[9] Multi-Domain Sentiment Classification Based on Domain-Aware Embedding and Attention
Cai, Yitao
Wan, Xiaojun
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4904 - 4910
[10] Domain-Aware Contrastive Knowledge Transfer for Multi-domain Imbalanced Data
Ke, Zixuan
Kachuee, Mohammad
Lee, Sungjin
PROCEEDINGS OF THE 12TH WORKSHOP ON COMPUTATIONAL APPROACHES TO SUBJECTIVITY, SENTIMENT & SOCIAL MEDIA ANALYSIS, 2022, : 25 - 36

← 1 2 3 4 →