Prediction of benign and malignant breast cancer using data mining techniques

被引:94
|
作者
Chaurasia, Vikas [1 ]
Pal, Saurabh [1 ]
Tiwari, B. B. [2 ]
机构
[1] VBS Purvanchal Univ, Dept MCA, Jaunpur, UP, India
[2] VBS Purvanchal Univ, Fac Engn & Technol, Dept ECE, Jaunpur, India
关键词
Breast cancer; data mining; Naive Bayes; RBF Network; J48;
D O I
10.1177/1748301818756225
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Breast cancer is the second most leading cancer occurring in women compared to all other cancers. Around 1.1 million cases were recorded in 2004. Observed rates of this cancer increase with industrialization and urbanization and also with facilities for early detection. It remains much more common in high-income countries but is now increasing rapidly in middle- and low-income countries including within Africa, much of Asia, and Latin America. Breast cancer is fatal in under half of all cases and is the leading cause of death from cancer in women, accounting for 16% of all cancer deaths worldwide. The objective of this research paper is to present a report on breast cancer where we took advantage of those available technological advancements to develop prediction models for breast cancer survivability. We used three popular data mining algorithms (Naive Bayes, RBF Network, J48) to develop the prediction models using a large dataset (683 breast cancer cases). We also used 10-fold cross-validation methods to measure the unbiased estimate of the three prediction models for performance comparison purposes. The results (based on average accuracy Breast Cancer dataset) indicated that the Naive Bayes is the best predictor with 97.36% accuracy on the holdout sample (this prediction accuracy is better than any reported in the literature), RBF Network came out to be the second with 96.77% accuracy, J48 came out third with 93.41% accuracy.
引用
收藏
页码:119 / 126
页数:8
相关论文
共 50 条
  • [41] Breast cancer prediction and detection using data mining classification algorithms: A comparative study
    Kaya Keleş, Mümine
    [J]. Tehnicki Vjesnik, 2019, 26 (01): : 149 - 155
  • [42] The comparisons of prognostic indexes using data mining techniques and Cox regression analysis in the breast cancer data
    Ture, Mevlut
    Tokatli, Fusun
    Omurlu, Imran Kurt
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) : 8247 - 8254
  • [43] Chronic Heart Disease Prediction Using Data Mining Techniques
    Nalluri, Sravani
    Saraswathi, R. Vijaya
    Ramasubbareddy, Somula
    Govinda, K.
    Swetha, E.
    [J]. DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 903 - 912
  • [44] Prediction of Crop Production in India Using Data Mining Techniques
    Jambekar, Suvidha
    Nema, Shikha
    Saquib, Zia
    [J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [45] Survey and prediction of the ionospheric scintillation using data mining techniques
    Rezende, L. F. C.
    de Paula, E. R.
    Stephany, S.
    Kantor, I. J.
    Muella, M. T. A. H.
    de Siqueira, P. M.
    Correa, K. S.
    [J]. SPACE WEATHER-THE INTERNATIONAL JOURNAL OF RESEARCH AND APPLICATIONS, 2010, 8
  • [46] A Review on Consumer Behavior Prediction using Data Mining Techniques
    Kareena
    Kapoor, Nitika
    [J]. PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 1089 - 1093
  • [47] Dengue fever prediction modelling using data mining techniques
    Buathong, Wipawan
    Jarupunphol, Pita
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2021, 25 (1-2) : 103 - 127
  • [48] Prediction of Traffic-Violation Using Data Mining Techniques
    Amiruzzaman, Md
    [J]. PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2018, VOL 1, 2019, 880 : 283 - 297
  • [49] Rainfall Prediction in Lahore City using Data Mining Techniques
    Aftab, Shabib
    Ahmad, Munir
    Hameed, Noureen
    Bashir, Muhammad Salman
    Ali, Iftikhar
    Nawaz, Zahid
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (04) : 254 - 260
  • [50] Suicide Prediction in Twitter Data using Mining Techniques: A Survey
    Kumar, E. Rajesh
    Rao, A. K. V. S. N. Rama
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT SUSTAINABLE SYSTEMS (ICISS 2019), 2019, : 122 - 131