A New hybrid Feature selection-Classification model to Improve Cancer Sample Classification Accuracy in Microarray Gene Expression Data

被引:1
|
作者
Bandyopadhyay, Ritaban [1 ]
Sharma, Arijt Das [1 ]
Dasgupta, Bidya [1 ]
Ghosh, Ankita [1 ]
Das, Chandra [1 ]
Bose, Shilpi [1 ]
机构
[1] Netaji Subhash Engn Coll, Dept CSE, Kolkata, India
关键词
DNA Microarray Technology; Gene expression data; Feature selection; Mutual information; TLBO; Classification; Bagging; RHEUMATOID-ARTHRITIS; PREDICTION; PATTERNS;
D O I
10.1109/ICCECE51049.2023.10085390
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Machine learning techniques are one kind of techniques of Artificial Intelligence that enables systems to learn and improve from data without being explicitly programmed. Machine learning techniques are widely used in medical applications since it has the property to detect inherent patterns from large and complex datasets. Cancer classification based on bio molecular gene expression data is a very crucial topic for medical science as it helps to improve the diagnostic accuracy of cancer samples and is very useful in cancer sample detection and prognosis. But the traditional classifiers performance vitiates due to presence of high feature dimensionality and class imbalance problem present in microarray data. So, in this research work, a new computer aided diagnostic tool is being proposed for cancer sample classification based on bio molecular gene expression data. This tool called MI-TLBO-EB operates in two phases. The first phase selects the best features from the dataset using mutual information and teaching learning based optimization algorithm named MI-TLBO algorithm and the second phase classifies the cancer samples with the help of an extended version of bagging. The proposed model is advantageous in many ways. It helps to curb the curse of higher dimensionality and increases the classification accuracy via handling class imbalance problem with the help of bagging model. The model is applied on different high dimensional microarray gene expression datasets for cancer sample classification and from the experimental results, it has been found that the generalization performance/testing accuracy of the proposed hybrid model is significantly better compared to other well-known existing models.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Improving classification accuracy of cancer types using parallel hybrid feature selection on microarray gene expression data
    Lokeswari Venkataramana
    Shomona Gracia Jacob
    Rajavel Ramadoss
    Dodda Saisuma
    Dommaraju Haritha
    Kunthipuram Manoja
    [J]. Genes & Genomics, 2019, 41 : 1301 - 1313
  • [2] Improving classification accuracy of cancer types using parallel hybrid feature selection on microarray gene expression data
    Venkataramana, Lokeswari
    Jacob, Shomona Gracia
    Ramadoss, Rajavel
    Saisuma, Dodda
    Haritha, Dommaraju
    Manoja, Kunthipuram
    [J]. GENES & GENOMICS, 2019, 41 (11) : 1301 - 1313
  • [3] A Survey on Hybrid Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification
    Almugren, Nada
    Alshamlan, Hala
    [J]. IEEE ACCESS, 2019, 7 : 78533 - 78548
  • [4] Feature Selection for Cancer Classification on Microarray Expression Data
    Hsu, Hui-Huang
    Lu, Ming-Da
    [J]. ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, PROCEEDINGS, 2008, : 153 - 158
  • [5] Distributed feature selection (DFS) strategy for microarray gene expression data to improve the classification performance
    Potharaju, Sai Prasad
    Sreedevi, M.
    [J]. CLINICAL EPIDEMIOLOGY AND GLOBAL HEALTH, 2019, 7 (02): : 171 - 176
  • [6] Feature selection methods on gene expression microarray data for cancer classification: A systematic review
    Alhenawi, Esra'a
    Al-Sayyed, Rizik
    Hudaib, Amjad
    Mirjalili, Seyedali
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 140
  • [7] A discrete bacterial algorithm for feature selection in classification of microarray gene expression cancer data
    Wang, Hong
    Jing, Xingjian
    Niu, Ben
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 126 : 8 - 19
  • [8] Hybrid Feature Selection Algorithm mRMR-ICA for Cancer Classification from Microarray Gene Expression Data
    Wang, Shuaiqun
    Kong, Wei
    Aorigele
    Deng, Jin
    Gao, Shangce
    Zeng, Weiming
    [J]. COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2018, 21 (06) : 420 - 430
  • [9] Feature Selection and Classification in gene expression cancer data
    Pavithra, D.
    Lakshmanan, B.
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS), 2017,
  • [10] A hybrid feature selection algorithm for gene expression data classification
    Lu, Huijuan
    Chen, Junying
    Yan, Ke
    Jin, Qun
    Xue, Yu
    Gao, Zhigang
    [J]. NEUROCOMPUTING, 2017, 256 : 56 - 62