Machine Learning Model for Breast Cancer Data Analysis Using Triplet Feature Selection Algorithm

被引:7
|
作者
Dhivya, P. [1 ]
Bazilabanu, A. [1 ]
Ponniah, Thirumalaikolundusubramanian [2 ]
机构
[1] Bannari Amman Inst Technol, Dept Comp Sci & Engn, Erode, India
[2] Trichy SRM Med Coll & Res Ctr, Dept Med, Trichy, India
关键词
Accuracy; benign; correlation; logistic regression; malignant; triplet feature selection; DIAGNOSIS;
D O I
10.1080/03772063.2021.1963861
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The machine learning techniques can be used for clinical investigations in breast cancer diagnosis. The researchers investigated various machine learning algorithms, such as Support Vector Machine, Naive Bayes, Logistic Regression (LR), Random Forest, Decision Tree and K Nearest Neighbor to diagnose the disease. Early detection of breast cancer cells from the features is essential. Feature selection is the process of reducing the input features to improve the performance of the model. This research aims to increase the accuracy, sensitivity, specificity and to reduce the False Positive Rate (FPR) and False Negative Rate (FNR) by feature selection. The proposed feature selection technique is comprised of two phases: feature grouping and feature selection. In the first phase, feature grouping uses the Pearson correlation techniques to identify the correlation among the features and group the features based on high-, medium- and low- level ranking. In the second phase, Triplet Feature Selection (TFS) method has been proposed to avoid collinearity among the features. In this, the features are selected based on the correlation differences in each subset when satisfying the race condition. Finally, select the features in the triplet group and apply LR classification technique to diagnose the disease. The proposed classifier achieved an accuracy (95.4%), FPR (1%), FNR (4%), sensitivity (97%) and specificity (96%) to detect the benign and malign ones. The effects of TFS feature selection with LR classifier were used and the performance of the proposed framework was compared with the existing feature selection methods and classifiers.
引用
收藏
页码:1789 / 1799
页数:11
相关论文
共 50 条
  • [1] Novel Feature Selection Using Machine Learning Algorithm for Breast Cancer Screening of Thermography Images
    Gupta, Kumod Kumar
    Pahadiya, Pallavi
    Saxena, Shivani
    Gupta, Meenakshi
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 131 (03) : 1929 - 1956
  • [2] Novel Feature Selection Using Machine Learning Algorithm for Breast Cancer Screening of Thermography Images
    Kumod Kumar Gupta
    Pallavi Ritu Vijay
    Shivani Pahadiya
    Meenakshi Saxena
    Wireless Personal Communications, 2023, 131 : 1929 - 1956
  • [3] A Comparative Study for Breast Cancer Prediction using Machine Learning and Feature Selection
    Dhanya, R.
    Paul, Irene Rose
    Akula, Sai Sindhu
    Sivakumar, Madhumathi
    Nair, Jyothisha J.
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 1049 - 1055
  • [4] Feature selection and classification in breast cancer prediction using IoT and machine learning
    Gopal, V. Nanda
    Al-Turjman, Fadi
    Kumar, R.
    Anand, L.
    Rajesh, M.
    MEASUREMENT, 2021, 178
  • [5] Breast cancer prediction with transcriptome profiling using feature selection and machine learning methods
    Eskandar Taghizadeh
    Sahel Heydarheydari
    Alihossein Saberi
    Shabnam JafarpoorNesheli
    Seyed Masoud Rezaeijo
    BMC Bioinformatics, 23
  • [6] Breast cancer prediction with transcriptome profiling using feature selection and machine learning methods
    Taghizadeh, Eskandar
    Heydarheydari, Sahel
    Saberi, Alihossein
    JafarpoorNesheli, Shabnam
    Rezaeijo, Seyed Masoud
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [7] Feature Selection-based Machine Learning Comparative Analysis for Predicting Breast Cancer
    Rajpoot, Chour Singh
    Sharma, Gajanand
    Gupta, Praveen
    Dadheech, Pankaj
    Yahya, Umar
    Aneja, Nagender
    APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [8] A feature selection using improved dragonfly algorithm with support vector machine for breast cancer prediction
    Mary, S. Roselin
    Prasad, R. Murali
    Suguna, R.
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (05): : 2039 - 2049
  • [9] ALGORITHM SELECTION AND IMPORTANCE OF MACHINE LEARNING IN PREDICTION OF BREAST CANCER
    Babu, B. Sankara
    Bethu, Srikanth
    Rao, P. S. V. Srinivasa
    Sowmya, V
    JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (06): : 283 - 315
  • [10] Machine learning and feature selection for the analysis of Alzheimer Metabolomics Data
    Belacel, Nabil
    Cuperlovic-Culf, Miroslava
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (ICPRAI 2018), 2018, : 222 - 226