Prediction of benign and malignant breast cancer using data mining techniques

被引:94
|
作者
Chaurasia, Vikas [1 ]
Pal, Saurabh [1 ]
Tiwari, B. B. [2 ]
机构
[1] VBS Purvanchal Univ, Dept MCA, Jaunpur, UP, India
[2] VBS Purvanchal Univ, Fac Engn & Technol, Dept ECE, Jaunpur, India
关键词
Breast cancer; data mining; Naive Bayes; RBF Network; J48;
D O I
10.1177/1748301818756225
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Breast cancer is the second most leading cancer occurring in women compared to all other cancers. Around 1.1 million cases were recorded in 2004. Observed rates of this cancer increase with industrialization and urbanization and also with facilities for early detection. It remains much more common in high-income countries but is now increasing rapidly in middle- and low-income countries including within Africa, much of Asia, and Latin America. Breast cancer is fatal in under half of all cases and is the leading cause of death from cancer in women, accounting for 16% of all cancer deaths worldwide. The objective of this research paper is to present a report on breast cancer where we took advantage of those available technological advancements to develop prediction models for breast cancer survivability. We used three popular data mining algorithms (Naive Bayes, RBF Network, J48) to develop the prediction models using a large dataset (683 breast cancer cases). We also used 10-fold cross-validation methods to measure the unbiased estimate of the three prediction models for performance comparison purposes. The results (based on average accuracy Breast Cancer dataset) indicated that the Naive Bayes is the best predictor with 97.36% accuracy on the holdout sample (this prediction accuracy is better than any reported in the literature), RBF Network came out to be the second with 96.77% accuracy, J48 came out third with 93.41% accuracy.
引用
收藏
页码:119 / 126
页数:8
相关论文
共 50 条
  • [1] Prediction of Malignant and Benign Breast Cancer: A Data Mining Approach in Healthcare Applications
    Kumar, Vivek
    Mishra, Brojo Kishore
    Mazzara, Manuel
    Thanh, Dang N. H.
    Verma, Abhishek
    [J]. ADVANCES IN DATA SCIENCE AND MANAGEMENT, 2020, 37 : 435 - 442
  • [2] A Survey on Breast Cancer Prediction Using Data Mining Techniques
    Jacob, Dona Sara
    Viswan, Rakhi
    Manju, V.
    PadmaSuresh, L.
    Raj, Shine
    [J]. 2018 CONFERENCE ON EMERGING DEVICES AND SMART SYSTEMS (ICEDSS), 2018, : 256 - 258
  • [3] Breast Cancer Prediction Using Data Mining Classification Techniques
    Kazi, Abdul Karim
    Waseemullah
    Baig, Mirza Adnan
    Khan, Shahzaib
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (09): : 696 - 704
  • [4] A Review on Prediction Of Breast Cancer Using Various Data Mining Techniques
    Deepika, M.
    Gladence, L. Mary
    Keerthana, R. Madhu
    [J]. RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 (01): : 808 - 814
  • [5] Intelligent Breast Cancer Prediction Model Using Data Mining Techniques
    Shen, Runjie
    Yang, Yuanyuan
    Shao, Fengfeng
    [J]. 2014 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL 1, 2014, : 384 - 387
  • [6] Analysis and Prediction of Breast cancer and Diabetes disease datasets using Data mining classification Techniques
    Verma, Deepika
    Mishra, Nidhi
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT SUSTAINABLE SYSTEMS (ICISS 2017), 2017, : 533 - 538
  • [7] Analysis of breast cancer using data mining & statistical techniques
    Xiong, XC
    Kim, YO
    Baek, YC
    Rhee, DW
    Kim, SH
    [J]. Sixth International Conference on Software Engineerng, Artificial Intelligence, Networking and Parallel/Distributed Computing and First AICS International Workshop on Self-Assembling Wireless Networks, Proceedings, 2005, : 82 - 87
  • [8] Using Data Mining Techniques to Support Breast Cancer Diagnosis
    Diz, Joana
    Marreiros, Goreti
    Freitas, Alberto
    [J]. NEW CONTRIBUTIONS IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, PT 1, 2015, 353 : 689 - 700
  • [9] A Survey on Breast Cancer Analysis Using Data Mining Techniques
    Padmapriya, B.
    Velmurugan, T.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 1234 - 1237
  • [10] An Integrated Approach for Cancer Survival Prediction Using Data Mining Techniques
    Kaur, Ishleen
    Doja, M. N.
    Ahmad, Tanvir
    Ahmad, Musheer
    Hussain, Amir
    Nadeem, Ahmed
    Abd El-Latif, Ahmed A.
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021