Exploiting diversity in ensembles: Improving the performance on unbalanced datasets

被引:0
|
作者
Chawla, Nitesh V. [1 ]
Sylvester, Jared [2 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[2] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensembles are often capable of greater predictive performance than any of their individual classifiers. Despite the need for classifiers to make different kinds of errors, the majority voting scheme, typically used, treats each classifier as though it contributed equally to the group's performance. This can be particularly limiting, on unbalanced datasets, as one is more interested in complementing classifiers that can assist in improving the true positive rate without signicantly increasing the false positive rate. Therefore, we implement a genetic algorithm based framework to weight the contribution of each classifier by an appropriate fitness function, such that the classifiers that complement each other on the unbalanced dataset are preferred, resulting in significantly improved performances. The proposed framework can be built on top of any collection of classifiers with different fitness functions.
引用
收藏
页码:397 / +
页数:2
相关论文
共 50 条
  • [41] Improving neural network performance on the classification of complex geographic datasets
    Gahegan M.
    German G.
    West G.
    Journal of Geographical Systems, 1999, 1 (1) : 3 - 22
  • [42] RSMOTE: improving classification performance over imbalanced medical datasets
    Naseriparsa, Mehdi
    Al-Shammari, Ahmed
    Sheng, Ming
    Zhang, Yong
    Zhou, Rui
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2020, 8 (01)
  • [43] PFCG: Improving the Restore Performance of Package Datasets in Deduplication Systems
    Zuo, Chunxue
    Fang, Wang
    Huang, Ping
    Hu, Yuchong
    Feng, Dan
    Zhang, Yucheng
    2018 IEEE 36TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2018, : 553 - 560
  • [44] Bounding XCS's parameters for unbalanced datasets
    Orriols-Puig, Albert
    Bernado-Mansilla, Ester
    GECCO 2006: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2006, : 1561 - +
  • [45] Performance and diversity evaluation in hybrid and non-hybrid structures of ensembles
    Canuto, AMP
    Oliveira, LDM
    Xavier, JC
    Santos, ADM
    Abreu, MCC
    HIS 2005: 5TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, : 285 - 290
  • [46] Improving the adversarial robustness of quantized neural networks via exploiting the feature diversity
    Chu, Tianshu
    Fang, Kun
    Yang, Jie
    Huang, Xiaolin
    PATTERN RECOGNITION LETTERS, 2023, 176 : 117 - 122
  • [47] On the design and performance of scheduling policies exploiting spatial diversity for URLLC
    Chagdali, A.
    Elayoubi, S. E.
    Masucci, A. M.
    Simonian, A.
    COMPUTER COMMUNICATIONS, 2023, 212 : 275 - 283
  • [48] Exploiting diversity and correlation to improve the performance of intrusion detection systems
    Coppolino, L.
    D'Antonio, S.
    Esposito, M.
    Romano, L.
    2009 INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE SECURITY, 2009, : 167 - +
  • [49] Improving the SSD Performance by Exploiting Request Characteristics and Internal Parallelism
    Mao, Bo
    Wu, Suzhen
    Duan, Lide
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (02) : 472 - 484
  • [50] Improving Plasmon Sensing Performance by Exploiting the Spatially Confined Field
    Liu, Zhengqi
    Liu, Guiqiang
    Liu, Xiaoshan
    Huang, Shan
    Pan, Pingping
    Wang, Yan
    Zou, Chengwu
    Gu, Gang
    PLASMONICS, 2016, 11 (01) : 29 - 36