Test strategies for cost-sensitive decision trees

被引:87
|
作者
Ling, Charles X. [1 ]
Sheng, Victor S.
Yang, Qiang
机构
[1] Univ Western Ontario, Dept Comp Sci, London, ON N6A 5B7, Canada
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
基金
加拿大自然科学与工程研究理事会;
关键词
induction; concept learning; mining methods and algorithms; classification;
D O I
10.1109/TKDE.2006.131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In medical diagnosis, doctors must often determine what medical tests ( e. g., X-ray and blood tests) should be ordered for a patient to minimize the total cost of medical tests and misdiagnosis. In this paper, we design cost-sensitive machine learning algorithms to model this learning and diagnosis process. Medical tests are like attributes in machine learning whose values may be obtained at a cost ( attribute cost), and misdiagnoses are like misclassifications which may also incur a cost ( misclassification cost). We first propose a lazy decision tree learning algorithm that minimizes the sum of attribute costs and misclassification costs. Then, we design several novel "test strategies" that can request to obtain values of unknown attributes at a cost ( similar to doctors' ordering of medical tests at a cost) in order to minimize the total cost for test examples ( new patients). These test strategies correspond to different situations in real-world diagnoses. We empirically evaluate these test strategies, and show that they are effective and outperform previous methods. Our results can be readily applied to real-world diagnosis tasks. A case study on heart disease is given throughout the paper.
引用
收藏
页码:1055 / 1067
页数:13
相关论文
共 50 条
  • [41] Cost-sensitive Decision Tree with Missing Values and Multiple Cost Scales
    Liu, Xingyi
    FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 294 - 297
  • [42] Time-constrained cost-sensitive decision tree induction
    Chen, Yen-Liang
    Wu, Chia-Chi
    Tang, Kwei
    INFORMATION SCIENCES, 2016, 354 : 140 - 152
  • [43] Cost-sensitive decision tree ensembles for effective imbalanced classification
    Krawczyk, Bartosz
    Wozniak, Michal
    Schaefer, Gerald
    APPLIED SOFT COMPUTING, 2014, 14 : 554 - 562
  • [44] An empirical comparison of cost-sensitive decision tree induction algorithms
    Lomax, Susan
    Vadera, Sunil
    EXPERT SYSTEMS, 2011, 28 (03) : 227 - 268
  • [45] Cost-Sensitive Boosting
    Masnadi-Shirazi, Hamed
    Vasconcelos, Nuno
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (02) : 294 - 309
  • [46] Cost-Sensitive Three-Way Decision: A Sequential Strategy
    Li, Huaxiong
    Zhou, Xianzhong
    Huang, Bing
    Liu, Dun
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY: 8TH INTERNATIONAL CONFERENCE, 2013, 8171 : 325 - 337
  • [47] Cost-Sensitive Multigranulation Approximation in Decision-Making Applications
    Yang, Jie
    Kuang, Juncheng
    Liu, Qun
    Liu, Yanmin
    ELECTRONICS, 2022, 11 (22)
  • [48] A Cost-sensitive Decision Tree under the Condition of Multiple Classes
    Feng, Shaorong
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON LOGISTICS, ENGINEERING, MANAGEMENT AND COMPUTER SCIENCE (LEMCS 2015), 2015, 117 : 1212 - 1218
  • [49] Cost-Sensitive Learning
    Zhou, Zlii-Hua
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, MDAI 2011, 2011, 6820 : 17 - 18
  • [50] A new decision to take for cost-sensitive Naive Bayes classifiers
    Di Nunzio, Giorgio Maria
    INFORMATION PROCESSING & MANAGEMENT, 2014, 50 (05) : 653 - 674