A Bayesian method for comparing and combining binary classifiers in the absence of a gold standard

被引:3
|
作者
Keith, Jonathan M. [1 ]
Davey, Christian M. [1 ]
Boyd, Sarah E. [1 ]
机构
[1] Monash Univ, Sch Math Sci, Clayton, Vic 3800, Australia
来源
BMC BIOINFORMATICS | 2012年 / 13卷
基金
澳大利亚研究理事会;
关键词
Binary classifier; Bayesian methods; Protein sub-cellular localisation; Diagnostic tests; Genome wide association studies; WINBUGS; TESTS;
D O I
10.1186/1471-2105-13-179
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Many problems in bioinformatics involve classification based on features such as sequence, structure or morphology. Given multiple classifiers, two crucial questions arise: how does their performance compare, and how can they best be combined to produce a better classifier? A classifier can be evaluated in terms of sensitivity and specificity using benchmark, or gold standard, data, that is, data for which the true classification is known. However, a gold standard is not always available. Here we demonstrate that a Bayesian model for comparing medical diagnostics without a gold standard can be successfully applied in the bioinformatics domain, to genomic scale data sets. We present a new implementation, which unlike previous implementations is applicable to any number of classifiers. We apply this model, for the first time, to the problem of finding the globally optimal logical combination of classifiers. Results: We compared three classifiers of protein subcellular localisation, and evaluated our estimates of sensitivity and specificity against estimates obtained using a gold standard. The method overestimated sensitivity and specificity with only a small discrepancy, and correctly ranked the classifiers. Diagnostic tests for swine flu were then compared on a small data set. Lastly, classifiers for a genome-wide association study of macular degeneration with 541094 SNPs were analysed. In all cases, run times were feasible, and results precise. The optimal logical combination of classifiers was also determined for all three data sets. Code and data are available from http://bioinformatics.monash.edu.au/downloads/. Conclusions: The examples demonstrate the methods are suitable for both small and large data sets, applicable to the wide range of bioinformatics classification problems, and robust to dependence between classifiers. In all three test cases, the globally optimal logical combination of the classifiers was found to be their union, according to three out of four ranking criteria. We propose as a general rule of thumb that the union of classifiers will be close to optimal.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Combining Binary Classifiers for a Multiclass Problem with Differential Privacy
    Sazonova, Vera
    Matwin, Stan
    [J]. TRANSACTIONS ON DATA PRIVACY, 2014, 7 (01) : 51 - 70
  • [32] BAYESIAN-ESTIMATION OF DISEASE PREVALENCE AND THE PARAMETERS OF DIAGNOSTIC-TESTS IN THE ABSENCE OF A GOLD STANDARD
    JOSEPH, L
    GYORKOS, TW
    COUPAL, L
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 1995, 141 (03) : 263 - 272
  • [33] Comparison of four confirmatory tests for the diagnosis of primary aldosteronism: Bayesian analysis in the absence of a gold standard
    Shen, Hang
    Luo, Wenjin
    Hu, Jinbo
    Yang, Jun
    Song, Ying
    Chen, Xiangjun
    Yang, Yi
    Ma, Linqiang
    Cheng, Qingfeng
    Wang, Zhihong
    Li, Qifu
    Yang, Shumin
    [J]. ENDOCRINE, 2024, 85 (03) : 1417 - 1424
  • [34] Bayesian Meta-Analysis of the Accuracy of a Test for Tuberculous Pleuritis in the Absence of a Gold Standard Reference
    Dendukuri, Nandini
    Schiller, Ian
    Joseph, Lawrence
    Pai, Madhukar
    [J]. BIOMETRICS, 2012, 68 (04) : 1285 - 1293
  • [35] Interferon release assay for the diagnosis of uveitis associated with tuberculosis: a Bayesian evaluation in the absence of a gold standard
    Ang, Marcus
    Wong, Wan Ling
    Li, Xiang
    Chee, Soon-Phaik
    [J]. BRITISH JOURNAL OF OPHTHALMOLOGY, 2013, 97 (08) : 1062 - 1067
  • [36] Handling numeric attributes when comparing Bayesian network classifiers: does the discretization method matter?
    M. Julia Flores
    José A. Gámez
    Ana M. Martínez
    José M. Puerta
    [J]. Applied Intelligence, 2011, 34 : 372 - 385
  • [37] Handling numeric attributes when comparing Bayesian network classifiers: does the discretization method matter?
    Flores, M. Julia
    Gamez, Jose A.
    Martinez, Ana M.
    Puerta, Jose M.
    [J]. APPLIED INTELLIGENCE, 2011, 34 (03) : 372 - 385
  • [38] Bayesian sample size determination for prevalence and diagnostic test studies in the absence of a gold standard test
    Dendukuri, N
    Rahme, E
    Bélisle, P
    Joseph, L
    [J]. BIOMETRICS, 2004, 60 (02) : 388 - 397
  • [39] Measuring obesity in the absence of a gold standard
    O'Neill, Donal
    [J]. ECONOMICS & HUMAN BIOLOGY, 2015, 17 : 116 - 128
  • [40] COMPARING BAYESIAN MODELS IN THE ABSENCE OF GROUND TRUTH
    Pereyra, Marcelo
    McLaughlin, Steve
    [J]. 2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 528 - 532