Exploiting diversity in ensembles: Improving the performance on unbalanced datasets

被引:0
|
作者
Chawla, Nitesh V. [1 ]
Sylvester, Jared [2 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[2] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensembles are often capable of greater predictive performance than any of their individual classifiers. Despite the need for classifiers to make different kinds of errors, the majority voting scheme, typically used, treats each classifier as though it contributed equally to the group's performance. This can be particularly limiting, on unbalanced datasets, as one is more interested in complementing classifiers that can assist in improving the true positive rate without signicantly increasing the false positive rate. Therefore, we implement a genetic algorithm based framework to weight the contribution of each classifier by an appropriate fitness function, such that the classifiers that complement each other on the unbalanced dataset are preferred, resulting in significantly improved performances. The proposed framework can be built on top of any collection of classifiers with different fitness functions.
引用
收藏
页码:397 / +
页数:2
相关论文
共 50 条
  • [21] AdaBoosted Deep Ensembles: Getting Maximum Performance Out of Small Training Datasets
    Reza, Syed M. S.
    Butman, John A.
    Park, Deric M.
    Pham, Dzung L.
    Roy, Snehashis
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2020, 2020, 12436 : 572 - 582
  • [22] Improving the performance of predictive process modeling for large datasets
    Finley, Andrew O.
    Sang, Huiyan
    Banerjee, Sudipto
    Gelfand, Alan E.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (08) : 2873 - 2884
  • [23] Hybrid algorithm for classification of unbalanced datasets
    Han, Min
    Zhu, Xin-Rong
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2011, 28 (10): : 1485 - 1489
  • [24] Diversity techniques improve the performance of the best imbalance learning ensembles
    Diez-Pastor, Jose F.
    Rodriguez, Juan J.
    Garcia-Osorio, Cesar I.
    Kuncheva, Ludmila I.
    INFORMATION SCIENCES, 2015, 325 : 98 - 117
  • [25] Improving Map Reduce Performance by Exploiting Input Redundancy
    Kim, Shin-Gyu
    Han, Hyuck
    Jung, Hyungsoo
    Eom, Hyeonsang
    Yeom, Heon Y.
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2011, 27 (03) : 1137 - 1152
  • [26] Improving bagging performance through multi-algorithm ensembles
    Hsu, Kuo-Wei
    Srivastava, Jaideep
    FRONTIERS OF COMPUTER SCIENCE, 2012, 6 (05) : 498 - 512
  • [27] Exploiting Network Parallelism for Improving Data Transfer Performance
    Gunter, Dan
    Kettimuthu, Raj
    Kissel, Ezra
    Swany, Martin
    Yi, Jun
    Zurawski, Jason
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 1600 - 1606
  • [28] Improving the Performance of Recommender System by Exploiting the Categories of Products
    Sharma, Mohak
    Reddy, P. Krishna
    Kiran, R. Uday
    Ragunathan, T.
    DATABASES IN NETWORKED INFORMATION SYSTEMS, 2011, 7108 : 137 - +
  • [29] Improving BitTorrent Traffic Performance by Exploiting Geographic Locality
    Tian, Chen
    Liu, Xue
    Jiang, Hongbo
    Liu, Wenyu
    Wang, Yi
    GLOBECOM 2008 - 2008 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, 2008,
  • [30] Improving bagging performance through multi-algorithm ensembles
    Kuo-Wei Hsu
    Jaideep Srivastava
    Frontiers of Computer Science, 2012, 6 : 498 - 512