ImbTreeEntropy: An R package for building entropy-based classification trees on imbalanced datasets

被引:2
|
作者
Gajowniczek, Krzysztof [1 ]
Zabkowski, Tomasz [1 ]
机构
[1] Warsaw Univ Life Sci SGGW, Inst Informat Technol, Dept Artificial Intelligence, PL-02776 Warsaw, Poland
关键词
Decision trees; Generalized entropy; Cost-sensitive learning; Imbalanced data;
D O I
10.1016/j.softx.2021.100841
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we propose a novel R package, named ImbTreeEntropy, for building binary and multiclass decision trees using generalized entropy functions, such as Renyi, Tsallis, Sharma-Mittal, Sharma-Taneja and Kapur, to measure the impurity of a node. These are important extensions of the existing algorithms that usually employ Shannon entropy and the concept of information gain. Additionally, ImbTreeEntropy is able to handle imbalanced data, which is a challenging issue in many practical applications. The package supports cost-sensitive learning by defining a misclassification cost matrix and weighted sensitive learning. It accepts all types of attributes, including continuous, ordered and nominal attributes. The package and its code are made freely available. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] ImbTreeAUC: An R package for building classification trees using the area under the ROC curve (AUC) on imbalanced datasets
    Gajowniczek, Krzysztof
    Zabkowski, Tomasz
    [J]. SOFTWAREX, 2021, 15
  • [2] Entropy-based fuzzy support vector machine for imbalanced datasets
    Fan, Qi
    Wang, Zhe
    Li, Dongdong
    Gao, Daqi
    Zha, Hongyuan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 115 : 87 - 99
  • [3] ImbTreeEntropy and ImbTreeAUC: Novel R Packages for Decision Tree Learning on the Imbalanced Datasets
    Gajowniczek, Krzysztof
    Zabkowski, Tomasz
    [J]. ELECTRONICS, 2021, 10 (06) : 1 - 23
  • [4] Rank Entropy-Based Decision Trees for Monotonic Classification
    Hu, Qinghua
    Che, Xunjian
    Zhang, Lei
    Zhang, David
    Guo, Maozu
    Yu, Daren
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (11) : 2052 - 2064
  • [5] Entropy-based fuzzy support vector machine for imbalanced datasets (vol 115, pg 87, 2017)
    Rezvani, Salim
    Wang, Xizhao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 192
  • [6] EWGAN: Entropy-Based Wasserstein GAN for Imbalanced Learning
    Ren, Jinfu
    Liu, Yang
    Liu, Jiming
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 10011 - 10012
  • [7] Entropy-Based Estimation in Classification Problems
    Yu. A. Dubnov
    [J]. Automation and Remote Control, 2019, 80 : 502 - 512
  • [8] Entropy-Based Iterative Face Classification
    Kyperountas, Marios
    Tefas, Anastasios
    Pitas, Ioannis
    [J]. BIOMETRICS AND ID MANAGEMENT, 2011, 6583 : 137 - 143
  • [9] Entropy-Based Estimation in Classification Problems
    Dubnov, Yu. A.
    [J]. AUTOMATION AND REMOTE CONTROL, 2019, 80 (03) : 502 - 512
  • [10] Contextual classification by entropy-based polygonization
    Hermes, L
    Buhmann, JM
    [J]. 2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2001, : 442 - 447