Performance evaluation of classifier ensembles in terms of diversity and performance of individual systems

被引:4
|
作者
Chung, Yun-Sheng [1 ]
Hsu, D. [2 ]
Liu, Chun-Yi [1 ]
Tang, Chuan-Yi [3 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
[2] Fordham Univ, Dept Comp & Informat Sci, New York, NY 10023 USA
[3] Providence Univ, Dept Comp & Informat Engn, Taichung, Taiwan
关键词
Classification schemes; Performance appraisal; Performance criteria;
D O I
10.1108/17427371011097604
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Purpose - Multiple classifier systems have been used widely in computing, communications, and informatics. Combining multiple classifier systems (MCS) has been shown to outperform a single classifier system. It has been demonstrated that improvement in ensemble performance depends on either the diversity among or the performance of individual systems. Avariety of diversity measures and ensemble methods have been proposed and studied. However, it remains a challenging problem to estimate the ensemble performance in terms of the performance of and the diversity among individual systems. The purpose of this paper is to study the general problem of estimating ensemble performance for various combination methods using the concept of a performance distribution pattern (PDP). Design/methodology/approach - In particular, the paper establishes upper and lower bounds for majority voting ensemble performance with disagreement diversity measure Dis, weighted majority voting performance in terms of weighted average performance and weighted disagreement diversity, and plurality voting ensemble performance with entropy diversity measure (D) over bar. Findings - Bounds for these three cases are shown to be tight using the PDP for the input set. Originality/value - As a consequence of the authors' previous results on diversity equivalence, the results of majority voting ensemble performance can be extended to several other diversity measures. Moreover, the paper showed in the case of majority voting ensemble performance that when the average of individual systems performance (P) over bar is big enough, the ensemble performance P-m resulting from a maximum (information-theoretic) entropy PDP is an increasing function with respect to the disagreement diversity Dis. Eight experiments using data sets from various application domains are conducted to demonstrate the complexity, richness, and diverseness of the problem in estimating the ensemble performance.
引用
收藏
页码:373 / +
页数:32
相关论文
共 50 条
  • [31] Tree-Based Classifier Ensembles for PE Malware Analysis: A Performance Revisit
    Louk, Maya Hilda Lestari
    Tama, Bayu Adhi
    ALGORITHMS, 2022, 15 (09)
  • [32] Performance analysis of classifier ensembles: Neural networks versus nearest neighbor rule
    Valdovinos, R. M.
    Sanchez, J. S.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 1, PROCEEDINGS, 2007, 4477 : 105 - +
  • [33] Diversity techniques improve the performance of the best imbalance learning ensembles
    Diez-Pastor, Jose F.
    Rodriguez, Juan J.
    Garcia-Osorio, Cesar I.
    Kuncheva, Ludmila I.
    INFORMATION SCIENCES, 2015, 325 : 98 - 117
  • [34] A Strategy on Selecting Performance Metrics for Classifier Evaluation
    Liu, Yangguang
    Zhou, Yangming
    Wen, Shiting
    Tang, Chaogang
    INTERNATIONAL JOURNAL OF MOBILE COMPUTING AND MULTIMEDIA COMMUNICATIONS, 2014, 6 (04) : 20 - 35
  • [35] The optimum classifier and the performance evaluation by Bayesian approach
    Han, XX
    Wakabayashi, T
    Kimura, F
    ADVANCES IN PATTERN RECOGNITION, 2000, 1876 : 591 - 600
  • [36] Measure-based classifier performance evaluation
    Andersson, A
    Davidsson, P
    Lindén, J
    PATTERN RECOGNITION LETTERS, 1999, 20 (11-13) : 1165 - 1173
  • [37] An Evaluation of Classifier Ensembles for Class Imbalance Problems
    Krawczyk, Bartosz
    Schaefer, Gerald
    Wozniak, Michal
    2013 INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV), 2013,
  • [38] EVALUATION OF POLARIZATION DIVERSITY PERFORMANCE
    GLASER, JL
    FABER, LP
    PROCEEDINGS OF THE INSTITUTE OF RADIO ENGINEERS, 1953, 41 (12): : 1774 - 1778
  • [39] The Evaluation of Heterogeneous Classifier Ensembles for Turkish Texts
    Kilimci, Zeynep Hilal
    Akyokus, Selim
    Omurca, Sevinc Ilhan
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2017, : 307 - 311
  • [40] Data partitioning evaluation measures for classifier ensembles
    Dara, RA
    Makrehchi, M
    Kamel, MS
    MULTIPLE CLASSIFIER SYSTEMS, 2005, 3541 : 306 - 315