Cost-sensitive case-based reasoning using a genetic algorithm: Application to medical diagnosis

被引:28
|
作者
Park, Yoon-Joo [1 ]
Chun, Se-Hak [1 ]
Kim, Byung-Chun [2 ]
机构
[1] Seoul Natl Univ Sci & Technol, Dept Business Adm, Seoul 139743, South Korea
[2] KAIST Business Sch, Seoul 130722, South Korea
关键词
Cost-sensitive case-based reasoning; Misclassification cost; Genetic algorithm; Medical diagnosis; Heart disease; Diabetes; Hepatitis; Breast cancer; SYSTEM; MACHINE; CBR;
D O I
10.1016/j.artmed.2010.12.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: The paper studies the new learning technique called cost-sensitive case-based reasoning (CSCBR) incorporating unequal misclassification cost into CBR model. Conventional CBR is now considered as a suitable technique for diagnosis, prognosis and prescription in medicine. However it lacks the ability to reflect asymmetric misclassification and often assumes that the cost of a positive diagnosis (an illness) as a negative one (no illness) is the same with that of the opposite situation. Thus, the objective of this research is to overcome the limitation of conventional CBR and encourage applying CBR to many real world medical cases associated with costs of asymmetric misclassification errors. Methods: The main idea involves adjusting the optimal cut-off classification point for classifying the absence or presence of diseases and the cut-off distance point for selecting optimal neighbors within search spaces based on similarity distribution. These steps are dynamically adapted to new target cases using a genetic algorithm. We apply this proposed method to five real medical datasets and compare the results with two other cost-sensitive learning methods-C5.0 and CART. Results: Our finding shows that the total misclassification cost of CSCBR is lower than other cost-sensitive methods in many cases. Even though the genetic algorithm has limitations in terms of unstable results and over-fitting training data, CSCBR results with GA are better overall than those of other methods. Also the paired t-test results indicate that the total misclassification cost of CSCBR is significantly less than C5.0 and CART for several datasets. Conclusion: We have proposed a new CBR method called cost-sensitive case-based reasoning (CSCBR) that can incorporate unequal misclassification costs into CBR and optimize the number of neighbors dynamically using a genetic algorithm. It is meaningful not only for introducing the concept of cost-sensitive learning to CBR, but also for encouraging the use of CBR in the medical area. The result shows that the total misclassification costs of CSCBR do not increase in arithmetic progression as the cost of false absence increases arithmetically, thus it is cost-sensitive. We also show that total misclassification costs of CSCBR are the lowest among all methods in four datasets out of five and the result is statistically significant in many cases. The limitation of our proposed CSCBR is confined to classify binary cases for minimizing misclassification cost because our proposed CSCBR is originally designed to classify binary case. Our future work extends this method for multi-classification which can classify more than two groups. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:133 / 145
页数:13
相关论文
共 50 条
  • [1] Hybrid case-based reasoning system by cost-sensitive neural network for classification
    Biswas, Saroj Kr
    Chakraborty, Manomita
    Singh, Heisnam Rohen
    Devi, Debashree
    Purkayastha, Biswajit
    Das, Akhil Kr
    [J]. SOFT COMPUTING, 2017, 21 (24) : 7579 - 7596
  • [2] Hybrid case-based reasoning system by cost-sensitive neural network for classification
    Saroj Kr. Biswas
    Manomita Chakraborty
    Heisnam Rohen Singh
    Debashree Devi
    Biswajit Purkayastha
    Akhil Kr. Das
    [J]. Soft Computing, 2017, 21 : 7579 - 7596
  • [3] New knowledge extraction technique using probability for case-based reasoning: application to medical diagnosis
    Park, YJ
    Kim, BC
    Chun, SH
    [J]. EXPERT SYSTEMS, 2006, 23 (01) : 2 - 20
  • [4] Cost-sensitive classification based on Bregman divergences for medical diagnosis
    Santos-Rodriguez, Raul
    Garcia-Garcia, Dario
    Cid-Sueiro, Jesus
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2009, : 551 - 556
  • [5] Construction Cost Estimation Using a Case-Based Reasoning Hybrid Genetic Algorithm Based on Local Search Method
    Jung, Sangsun
    Pyeon, Jae-Ho
    Lee, Hyun-Soo
    Park, Moonseo
    Yoon, Inseok
    Rho, Juhee
    [J]. SUSTAINABILITY, 2020, 12 (19)
  • [6] Cost-sensitive fuzzy classification for medical diagnosis
    Schaefer, G.
    Nakashima, T.
    Yokota, Y.
    Ishibuchi, H.
    [J]. 2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2007, : 312 - +
  • [7] Case-Based Reasoning applied to medical diagnosis and treatment
    Blanco, Xiomara
    Rodríguez, Sara
    Corchado, Juan M.
    Zato, Carolina
    [J]. Advances in Intelligent Systems and Computing, 2013, 217 : 137 - 146
  • [8] A Case-Based Reasoning system for complex medical diagnosis
    Chattopadhyay, Subhagata
    Banerjee, Suvendu
    Rabhi, Fethi A.
    Acharya, U. Rajendra
    [J]. EXPERT SYSTEMS, 2013, 30 (01) : 12 - 20
  • [9] Cost-Sensitive Clustering for Uncertain Data Based on Genetic Algorithm
    Liu, C. Y.
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2013, 40 (10): : 161 - 169
  • [10] Conversational Case-Based Reasoning in Medical Classification and Diagnosis
    McSherry, David
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, PROCEEDINGS, 2009, 5651 : 116 - 125