Exploiting diversity in ensembles: Improving the performance on unbalanced datasets

被引:0
|
作者
Chawla, Nitesh V. [1 ]
Sylvester, Jared [2 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[2] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensembles are often capable of greater predictive performance than any of their individual classifiers. Despite the need for classifiers to make different kinds of errors, the majority voting scheme, typically used, treats each classifier as though it contributed equally to the group's performance. This can be particularly limiting, on unbalanced datasets, as one is more interested in complementing classifiers that can assist in improving the true positive rate without signicantly increasing the false positive rate. Therefore, we implement a genetic algorithm based framework to weight the contribution of each classifier by an appropriate fitness function, such that the classifiers that complement each other on the unbalanced dataset are preferred, resulting in significantly improved performances. The proposed framework can be built on top of any collection of classifiers with different fitness functions.
引用
收藏
页码:397 / +
页数:2
相关论文
共 50 条
  • [1] Exploiting diversity of neural ensembles with speciated evolution
    Lee, SI
    Ahn, JH
    Cho, SB
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 808 - 813
  • [2] Ordering-based pruning for improving the performance of ensembles of classifiers in the framework of imbalanced datasets
    Galar, Mikel
    Fernandez, Alberto
    Barrenechea, Edurne
    Bustince, Humberto
    Herrera, Francisco
    INFORMATION SCIENCES, 2016, 354 : 178 - 196
  • [3] Improving Diversity in Concept Drift Ensembles
    Martinez Perez, Jose Luis
    Palomino Marino, Laura Maria
    Maior de Barros, Roberto Souto
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [4] Exploiting Performance Estimates for Augmenting Recommendation Ensembles
    Penha, Gustavo
    Santos, Rodrygo L. T.
    RECSYS 2020: 14TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, 2020, : 111 - 119
  • [5] Robust Document Clustering by Exploiting Feature Diversity in Cluster Ensembles
    Sevillano, Xavier
    Cobo, German
    Alias, Francesc
    Claudi Socoro, Joan
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2006, (37): : 169 - 176
  • [6] Exploiting the noise: improving biomarkers with ensembles of data analysis methodologies
    Maud HW Starmans
    Melania Pintilie
    Thomas John
    Sandy D Der
    Frances A Shepherd
    Igor Jurisica
    Philippe Lambin
    Ming-Sound Tsao
    Paul C Boutros
    Genome Medicine, 4
  • [7] Improving Robustness and Calibration in Ensembles with Diversity Regularization
    Mehrtens, Hendrik Alexander
    Gonzalez, Camila
    Mukhopadhyay, Anirban
    PATTERN RECOGNITION, DAGM GCPR 2022, 2022, 13485 : 36 - 50
  • [8] Exploiting the noise: improving biomarkers with ensembles of data analysis methodologies
    Starmans, Maud H. W.
    Pintilie, Melania
    John, Thomas
    Der, Sandy D.
    Shepherd, Frances A.
    Jurisica, Igor
    Lambin, Philippe
    Tsao, Ming-Sound
    Boutros, Paul C.
    GENOME MEDICINE, 2012, 4
  • [9] EXPLOITING DIVERSITY OF NEURAL NETWORK ENSEMBLES BASED ON EXTREME LEARNING MACHINE
    Garcia-Laencina, Pedro J.
    Roca-Gonzalez, Jose-Luis
    Bueno-Crespo, Andres
    Sancho-Gomez, Jose-Luis
    NEURAL NETWORK WORLD, 2013, 23 (05) : 395 - 409
  • [10] Improving peptide-MHC class I binding prediction for unbalanced datasets
    Ana Paula Sales
    Georgia D Tomaras
    Thomas B Kepler
    BMC Bioinformatics, 9