From Appearance to Essence: Comparing Truth Discovery Methods without Using Ground Truth

被引:4
|
作者
Fang, Xiu Susie [1 ]
Sheng, Quan Z. [2 ]
Wang, Xianzhi [3 ]
Zhang, Wei Emma [4 ]
Ngu, Anne H. H. [5 ]
Yang, Jian [2 ]
机构
[1] Donghua Univ, Shanghai, Peoples R China
[2] Macquarie Univ, N Ryde, NSW, Australia
[3] Univ Technol Sydney, Sydney, NSW, Australia
[4] Univ Adelaide, Adelaide, SA, Australia
[5] Texas State Univ, San Marcos, TX USA
基金
澳大利亚研究理事会;
关键词
Web search; truth discovery methods; sparse ground truth; performance evaluation; single-valued objects; multi-valued objects;
D O I
10.1145/3411749
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Truth discovery has been widely studied in recent years as a fundamental means for resolving the conflicts in multi-source data. Although many truth discovery methods have been proposed based on different considerations and intuitions, investigations show that no single method consistently outperforms the others. To select the right truth discovery method for a specific application scenario, it becomes essential to evaluate and compare the performance of different methods. A drawback of current research efforts is that they commonly assume the availability of certain ground truth for the evaluation of methods. However, the ground truth may be very limited or even impossible to obtain, rendering the evaluation biased. In this article, we present CompTruthHyp, a generic approach for comparing the performance of truth discovery methods without using ground truth. In particular, our approach calculates the probability of observations in a dataset based on the output of different methods. The probability is then ranked to reflect the performance of these methods. We review and compare 12 representative truth discovery methods and consider both single-valued and multi-valued objects. The empirical studies on both real-world and synthetic datasets demonstrate the effectiveness of our approach for comparing truth discovery methods.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Evaluation Without Ground Truth in Social Media Research
    Zafarani, Reza
    Liu, Huan
    COMMUNICATIONS OF THE ACM, 2015, 58 (06) : 54 - 60
  • [22] On evaluating brain tissue classifiers without a ground truth
    Bouix, Sylvain
    Martin-Fernandez, Marcos
    Ungar, Lida
    Nakamura, Motoaki
    Koo, Min-Seong
    McCarley, Robert W.
    Shenton, Martha E.
    NEUROIMAGE, 2007, 36 (04) : 1207 - 1224
  • [23] SHAPE INITIALIZATION WITHOUT GROUND TRUTH FOR FACE ALIGNMENT
    Qin, Rizhen
    Zhang, Ting
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 1278 - 1282
  • [24] Evaluating and Improving Disparity Maps Without Ground Truth
    Pocol, Andreea
    Istead, Lesley
    Kaplan, Craig S.
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, INTELLISYS 2024, 2024, 1065 : 153 - 177
  • [25] Finding Ground Truth from Above
    不详
    AMERICAN SCIENTIST, 2012, 100 (01) : 66 - 67
  • [26] Data Veracity Estimation with Ensembling Truth Discovery Methods
    Berti-Equille, Laure
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2628 - 2636
  • [27] A Survey on Truth Discovery: Concepts, Methods, Applications, and Opportunities
    Wang, Shuang
    Zhang, He
    Sheng, Quan Z.
    Li, Xiaoping
    Sun, Zhu
    Cai, Taotao
    Zhang, Wei Emma
    Yang, Jian
    Gao, Qing
    IEEE TRANSACTIONS ON BIG DATA, 2025, 11 (02) : 314 - 332
  • [28] Evaluating Anomaly Explanations Using Ground Truth
    Friedman, Liat Antwarg
    Galed, Chen
    Rokach, Lior
    Shapira, Bracha
    AI, 2024, 5 (04) : 2375 - 2392
  • [29] Data Fusion without Knowledge of the Ground Truth Using Tseltin-like Automata
    Yazidi, Anis
    Sandnes, Frode Eika
    2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 4501 - 4506
  • [30] Synthetic microbleeds generation for classifier training without ground truth
    Momeni, Saba
    Fazlollahi, Amir
    Yates, Paul
    Rowe, Christopher
    Gao, Yongsheng
    Liew, Alan Wee-Chung
    Salvado, Olivier
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 207