Exploiting diversity in ensembles: Improving the performance on unbalanced datasets

被引:0
|
作者
Chawla, Nitesh V. [1 ]
Sylvester, Jared [2 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46556 USA
[2] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensembles are often capable of greater predictive performance than any of their individual classifiers. Despite the need for classifiers to make different kinds of errors, the majority voting scheme, typically used, treats each classifier as though it contributed equally to the group's performance. This can be particularly limiting, on unbalanced datasets, as one is more interested in complementing classifiers that can assist in improving the true positive rate without signicantly increasing the false positive rate. Therefore, we implement a genetic algorithm based framework to weight the contribution of each classifier by an appropriate fitness function, such that the classifiers that complement each other on the unbalanced dataset are preferred, resulting in significantly improved performances. The proposed framework can be built on top of any collection of classifiers with different fitness functions.
引用
收藏
页码:397 / +
页数:2
相关论文
共 50 条
  • [31] Improving the performance of an LVCSR system through ensembles of acoustic models
    Zhang, R
    Rudnicky, AI
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 876 - 879
  • [32] Improving Performance of UPQC-DG for Compensation of Unbalanced Loads
    Patel, Ashish
    Mathur, Hitesh Datt
    Bhanot, Surekha
    2018 8TH IEEE INDIA INTERNATIONAL CONFERENCE ON POWER ELECTRONICS (IICPE), 2018,
  • [33] IMPROVING ONLINE CONTINUAL LEARNING PERFORMANCE AND STABILITY WITH TEMPORAL ENSEMBLES
    Soutif-Cormerais, Albin
    Carta, Antonio
    Van de Weijer, Joost
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 828 - 845
  • [34] Is handling unbalanced datasets for machine learning uplifts system performance?: A case of diabetic prediction
    Narwane, Swati V.
    Sawarkar, Sudhir D.
    DIABETES & METABOLIC SYNDROME-CLINICAL RESEARCH & REVIEWS, 2022, 16 (09)
  • [35] Improving Collaborative Filtering's Rating Prediction Coverage in Sparse Datasets by Exploiting User Dissimilarity
    Margaris, Dionisis
    Vassilakis, Costas
    2018 16TH IEEE INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP, 16TH IEEE INT CONF ON PERVAS INTELLIGENCE AND COMP, 4TH IEEE INT CONF ON BIG DATA INTELLIGENCE AND COMP, 3RD IEEE CYBER SCI AND TECHNOL CONGRESS (DASC/PICOM/DATACOM/CYBERSCITECH), 2018, : 1054 - 1059
  • [36] Improving the workflow to crack Small, Unbalanced, Noisy, but Genuine (SUNG) datasets in bioacoustics: The case of bonobo calls
    Arnaud, Vincent
    Pellegrino, Francois
    Keenan, Sumir
    St-Gelais, Xavier
    Mathevon, Nicolas
    Levrero, Florence
    Coupe, Christophe
    PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (04)
  • [37] Exploiting diversity
    Robert Frederickson
    Nature Biotechnology, 1999, 17 (12) : 1150 - 1150
  • [38] Improving the Performance of Sentiment Classification on Imbalanced Datasets With Transfer Learning
    Xiao, Z.
    Wang, L.
    Du, J. Y.
    IEEE ACCESS, 2019, 7 : 28281 - 28290
  • [39] Improving the Performance of Image Captioning Models Trained on Small Datasets
    du Plessis, Mikkel
    Brink, Willie
    ARTIFICIAL INTELLIGENCE RESEARCH, SACAIR 2021, 2022, 1551 : 77 - 91
  • [40] RSMOTE: improving classification performance over imbalanced medical datasets
    Mehdi Naseriparsa
    Ahmed Al-Shammari
    Ming Sheng
    Yong Zhang
    Rui Zhou
    Health Information Science and Systems, 8