Estimating Accuracy from Unlabeled Data: A Bayesian Approach

被引:0
|
作者
Platanios, Emmanouil Antonios [1 ]
Dubey, Avinava [1 ]
Mitchell, Tom [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
DISTRIBUTIONS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the question of how unlabeled data can be used to estimate the true accuracy of learned classifiers, and the related question of how outputs from several classifiers performing the same task can be combined based on their estimated accuracies. To answer these questions, we first present a simple graphical model that performs well in practice. We then provide two nonparametric extensions to it that improve its performance. Experiments on two real-world data sets produce accuracy estimates within a few percent of the true accuracy, using solely unlabeled data. Our models also outperform existing state-of-the-art solutions in both estimating accuracies, and combining multiple classifier outputs.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Efficient heuristics for learning scalable Bayesian network classifier from labeled and unlabeled data
    Limin Wang
    Junjie Wang
    Lu Guo
    Qilong Li
    Applied Intelligence, 2024, 54 : 1957 - 1979
  • [32] A Bayesian method for estimating the accuracy of recalled depression
    Rutter, CM
    Simon, G
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2004, 53 : 341 - 353
  • [33] Estimating the distribution of sensorimotor synchronization data: A Bayesian hierarchical modeling approach
    Baath, Rasmus
    BEHAVIOR RESEARCH METHODS, 2016, 48 (02) : 463 - 474
  • [34] Estimating the demand for health care with panel data:: a semiparametric Bayesian approach
    Jochmann, M
    León-González, R
    HEALTH ECONOMICS, 2004, 13 (10) : 1003 - 1014
  • [35] A Bayesian Approach for Estimating Dynamic Functional Network Connectivity in fMRI Data
    Warnick, Ryan
    Guindani, Michele
    Erhardt, Erik
    Allen, Elena
    Calhoun, Vince
    Vannucci, Marina
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (521) : 134 - 151
  • [36] Estimating the distribution of sensorimotor synchronization data: A Bayesian hierarchical modeling approach
    Rasmus Bååth
    Behavior Research Methods, 2016, 48 : 463 - 474
  • [37] Estimating the accuracy of vectors derived from open data
    Nikolakopoulos, Konstantinos G.
    Dimitropoulos, George
    EARTH RESOURCES AND ENVIRONMENTAL REMOTE SENSING/GIS APPLICATIONS VIII, 2017, 10428
  • [38] Nonparametric targeted Bayesian estimation of class proportions in unlabeled data
    Diaz, Ivan
    Savenkov, Oleksander
    Kamel, Hooman
    BIOSTATISTICS, 2022, 23 (01) : 274 - 293
  • [39] A Dynamic Centroid Text Classification Approach by Learning from Unlabeled Data
    Jiang, Cuicui
    Zhu, Dingju
    Jiang, Qingshan
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON MULTIMEDIA TECHNOLOGY (ICMT-13), 2013, 84 : 1420 - 1429
  • [40] Classifier Invariant Approach to Learn from Positive-Unlabeled Data
    Dhurandhar, Amit
    Gurumoorthy, Karthik S.
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 102 - 111