DECISION TREE WITH BETTER CLASS PROBABILITY ESTIMATION

被引:12
|
作者
Jiang, Liangxiao [1 ]
Li, Chaoqun [2 ]
Cai, Zhihua [3 ]
机构
[1] China Univ Geosci, Fac Comp Sci, Wuhan 430074, Hubei, Peoples R China
[2] China Univ Geosci, Fac Math, Wuhan 430074, Hubei, Peoples R China
[3] China Univ Geosci, Fac Comp Sci, Wuhan 430074, Hubei, Peoples R China
关键词
C4.4; locally weighted C4.4; class probability estimation; locally weighted learning; conditional log likelihood; AUC; classification; ROC CURVE; AREA;
D O I
10.1142/S0218001409007296
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditionally, the performance of a classifier is measured by its classification accuracy or error rate. In fact, probability-based classifiers also produce the class probability estimation (the probability that a test instance belongs to the predicted class). This information is often ignored in classification, as long as the class with the highest class probability estimation is identical to the actual class. In many data mining applications, however, classification accuracy and error rate are not enough. For example, in direct marketing, we often need to deploy different promotion strategies to customers with different likelihood (class probability) of buying some products. Thus, accurate class probability estimations are often required to make optimal decisions. In this paper, we firstly review some state-of-the-art probability-based classifiers and empirically investigate their class probability estimation performance. From our experimental results, we can draw a conclusion: C4.4 is an attractive algorithm for class probability estimation. Then, we present a locally weighted version of C4.4 to scale up its class probability estimation performance by combining locally weighted learning with C4.4. We call our improved algorithm locally weighted C4.4, simply LWC4.4. We experimentally test LWC4.4 using the whole 36 UCI data sets selected by Weka. The experimental results show that LWC4.4 significantly outperforms C4.4 in terms of class probability estimation.
引用
收藏
页码:745 / 763
页数:19
相关论文
共 50 条
  • [1] Tree aggregation for random forest class probability estimation
    Sage, Andrew J.
    Genschel, Ulrike
    Nettleton, Dan
    STATISTICAL ANALYSIS AND DATA MINING, 2020, 13 (02) : 134 - 150
  • [2] Improved class probability estimates from decision tree models
    Margineantu, DD
    Dietterich, TG
    NONLINEAR ESTIMATION AND CLASSIFICATION, 2003, 171 : 173 - 188
  • [3] Improving Tree augmented Naive Bayes for class probability estimation
    Jiang, Liangxiao
    Cai, Zhihua
    Wang, Dianhong
    Zhang, Harry
    KNOWLEDGE-BASED SYSTEMS, 2012, 26 : 239 - 245
  • [4] An Efficient Probability Estimation Decision Tree Postprocessing Method for Mining Optimal Profitable Knowledge for Enterprises with Multi-Class Customers
    Muneiah, Janapati Naga
    Rao, Ch D. V. Subba
    INTELIGENCIA ARTIFICIAL-IBEROAMERICAL JOURNAL OF ARTIFICIAL INTELLIGENCE, 2019, 22 (64): : 63 - 84
  • [5] A one-class classification decision tree based on kernel density estimation
    Itani, Sarah
    Lecron, Fabian
    Fortemps, Philippe
    APPLIED SOFT COMPUTING, 2020, 91
  • [6] Building a Better Decision Tree by Delaying the Split Decision
    Caudle, Kyle
    Pyeatt, Larry
    Morast, Anthony
    Karlsson, Christer
    Hoover, Randy C.
    PROCEEDINGS OF THE 2019 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTE AND DATA ANALYSIS (ICCDA 2019), 2019, : 78 - 83
  • [7] Subjective probability over a subjective decision tree
    Takeoka, Norio
    JOURNAL OF ECONOMIC THEORY, 2007, 136 (01) : 536 - 571
  • [8] Class probability estimation for medical studies
    Simon, Richard
    BIOMETRICAL JOURNAL, 2014, 56 (04) : 597 - 600
  • [9] An exact probability metric for decision tree splitting and stopping
    Martin, JK
    MACHINE LEARNING, 1997, 28 (2-3) : 257 - 291
  • [10] PROBABILITY FOR POSTOPERATIVE INFECTION UTILIZING A DECISION TREE IN THE SICU
    Ghabra, Hussam
    White, William
    Ngoc, Vu
    Jennings, Katherine
    Townsend, Michael
    Goldberg, Joshua
    Boysen, Philip
    Nossaman, Bobby
    CRITICAL CARE MEDICINE, 2016, 44 (12)