Relationships between Diversity of Classification Ensembles and Single-Class Performance Measures

被引:87
|
作者
Wang, Shuo [1 ]
Yao, Xin [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
基金
英国工程与自然科学研究理事会;
关键词
Class imbalance learning; ensemble learning; diversity; single-class performance measures; data mining; STATISTICS; ACCURACY;
D O I
10.1109/TKDE.2011.207
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In class imbalance learning problems, how to better recognize examples from the minority class is the key focus, since it is usually more important and expensive than the majority class. Quite a few ensemble solutions have been proposed in the literature with varying degrees of success. It is generally believed that diversity in an ensemble could help to improve the performance of class imbalance learning. However, no study has actually investigated diversity in depth in terms of its definitions and effects in the context of class imbalance learning. It is unclear whether diversity will have a similar or different impact on the performance of minority and majority classes. In this paper, we aim to gain a deeper understanding of if and when ensemble diversity has a positive impact on the classification of imbalanced data sets. First, we explain when and why diversity measured by Q-statistic can bring improved overall accuracy based on two classification patterns proposed by Kuncheva et al. We define and give insights into good and bad patterns in imbalanced scenarios. Then, the pattern analysis is extended to single-class performance measures, including recall, precision, and F-measure, which are widely used in class imbalance learning. Six different situations of diversity's impact on these measures are obtained through theoretical analysis. Finally, to further understand how diversity affects the single class performance and overall performance in class imbalance problems, we carry out extensive experimental studies on both artificial data sets and real-world benchmarks with highly skewed class distributions. We find strong correlations between diversity and discussed performance measures. Diversity shows a positive impact on the minority class in general. It is also beneficial to the overall performance in terms of AUC and G-mean.
引用
收藏
页码:206 / 219
页数:14
相关论文
共 50 条
  • [41] Relationships between continuous performance task scores and other cognitive measures: Causality or commonality?
    Aylward, GP
    Gordon, M
    Verhulst, SJ
    ASSESSMENT, 1997, 4 (04) : 325 - 336
  • [42] High intensity exercise. assessment: Relationships between laboratory and field measures of performance
    Baker, JS
    Davies, B
    JOURNAL OF SCIENCE AND MEDICINE IN SPORT, 2002, 5 (04) : 341 - 347
  • [43] Relationships between the BASFI questionnaire and performance measures of physical functioning in patients with ankylosing spondylitis
    van Weely, S. F. E.
    van Denderen, J. C.
    Steultjens, M. P. M.
    Nurmohamed, M. T.
    Dijkmans, B. A. C.
    Dekker, J.
    van der Horst-Bruinsma, I. E.
    CLINICAL AND EXPERIMENTAL RHEUMATOLOGY, 2008, 26 (04) : 751 - 751
  • [44] Relationships between Resisted Sprint Performance and Different Strength and Power Measures in Rugby Players
    Zabaloy, Santiago
    Carlos-Vivas, Jorge
    Freitas, Tomas T.
    Pareja-Blanco, Fernando
    Pereira, Lucas
    Loturco, Irineu
    Comyns, Thomas
    Galvez-Gonzalez, Javier
    Alcaraz, Pedro E.
    SPORTS, 2020, 8 (03)
  • [45] THE RELATIONSHIPS BETWEEN PERFORMANCE MEASURES AND EMPLOYEE OUTCOMES: THE MEDIATING ROLES OF PROCEDURAL FAIRNESS AND TRUST
    Chia, Debbie P. S.
    Lau, Chong M.
    Tan, Sharon L. C.
    PERFORMANCE MEASUREMENT AND MANAGEMENT CONTROL: BEHAVIORAL IMPLICATIONS AND HUMAN ACTIONS, 2014, 28 : 203 - 232
  • [46] An Empirical Investigation of the Relationships between Modes and Degree of Expatriate Adjustment and Multiple Measures of Performance
    Shay, Jeffrey P.
    Baack, Sally
    INTERNATIONAL JOURNAL OF CROSS CULTURAL MANAGEMENT, 2006, 6 (03) : 275 - 294
  • [47] RELATIONSHIPS BETWEEN DIVERSITY OF INTERESTS, AGE, JOB-SATISFACTION AND JOB-PERFORMANCE
    ARVEY, RD
    DEWHIRST, HD
    JOURNAL OF OCCUPATIONAL PSYCHOLOGY, 1979, 52 (01): : 17 - 23
  • [48] Scheduling of multi-class single-server queues under nontraditional performance measures
    Ayhan, H
    Olsen, TL
    OPERATIONS RESEARCH, 2000, 48 (03) : 482 - 489
  • [49] RELATIONSHIPS BETWEEN MINNESOTA MULTIPHASIC PERSONALITY INVENTORY SCORES AND JOB PERFORMANCE MEASURES OF FIRE FIGHTERS
    ARVEY, RD
    PAYNE, G
    MUSSIO, SJ
    PSYCHOLOGICAL REPORTS, 1972, 31 (01) : 199 - &
  • [50] Modeling relationships between traditional preadmission measures and clinical skills performance on a medical licensure examination
    Roberts, William L.
    Pugliano, Gina
    Langenau, Erik
    Boulet, John R.
    ADVANCES IN HEALTH SCIENCES EDUCATION, 2012, 17 (03) : 403 - 417