Novel machine learning approach for classification of high-dimensional microarray data

被引:0
|
作者
Rabia Aziz Musheer
C. K. Verma
Namita Srivastava
机构
[1] VIT University Bhopal,Department of SASL (Mathematics)
[2] Maulana Azad National Institute of Technology,Department of Mathematics and Computer Application
来源
Soft Computing | 2019年 / 23卷
关键词
Independent component analysis (ICA); Artificial bee colony (ABC); Naïve Bayes (NB); Cancer classification;
D O I
暂无
中图分类号
学科分类号
摘要
Independent component analysis (ICA) is a powerful concept for reducing the dimension of big data in many applications. It has been used for the feature extraction of microarray gene expression data in numerous works. One of the merits of ICA is that a number of extracted features are always equal to the number of samples. When ICA is applied to microarray data, whenever, it faces the challenges of how to find the best subset of genes (features) from extracted features. To resolve this problem, in this paper, we propose a new (artificial bee colony) ABC-based feature selection approach for microarray data. Our approach is based on two stages: ICA-based extraction approach to reduce the size of data and ABC-based wrapper approach to optimize the reduced feature vectors. To validate our proposed approach, extensive experiments were conducted to compare the performance of ICA + ABC with the results obtained from recently published and other previously suggested methods of gene selection for Naïve Bayes (NB) classifier. To compare the performance of the proposed approach with other algorithms, a statistical hypothesis test was employed with six benchmark cancer classification datasets of the microarray. The experimental result shows that the proposed approach demonstrates an improvement over all the algorithms for NB classifier with a certain level of significance.
引用
收藏
页码:13409 / 13421
页数:12
相关论文
共 50 条
  • [1] Novel machine learning approach for classification of high-dimensional microarray data
    Musheer, Rabia Aziz
    Verma, C. K.
    Srivastava, Namita
    SOFT COMPUTING, 2019, 23 (24) : 13409 - 13421
  • [2] A Sparse Learning Machine for High-Dimensional Data with Application to Microarray Gene Analysis
    Cheng, Qiang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (04) : 636 - 646
  • [3] A novel feature learning framework for high-dimensional data classification
    Yanxia Li
    Yi Chai
    Hongpeng Yin
    Bo Chen
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 555 - 569
  • [4] A novel feature learning framework for high-dimensional data classification
    Li, Yanxia
    Chai, Yi
    Yin, Hongpeng
    Chen, Bo
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (02) : 555 - 569
  • [5] PERFORMANCE OF MACHINE LEARNING METHODS IN CLASSIFICATION MODELS WITH HIGH-DIMENSIONAL DATA
    Zekic-Susac, Marijana
    Pfeifer, Sanja
    Sarlija, Natasa
    SOR'13 PROCEEDINGS: THE 12TH INTERNATIONAL SYMPOSIUM ON OPERATIONAL RESEARCH IN SLOVENIA, 2013, : 219 - 224
  • [6] A Novel Multiobjective Genetic Programming Approach to High-Dimensional Data Classification
    Zhou, Yu
    Yang, Nanjian
    Huang, Xingyue
    Lee, Jaesung
    Kwong, Sam
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (09) : 5205 - 5216
  • [7] An Efficient Cancer Classification Model Using Microarray and High-Dimensional Data
    Fathi, Hanaa
    AlSalman, Hussain
    Gumaei, Abdu
    Manhrawy, Ibrahim I. M.
    Hussien, Abdelazim G.
    El-Kafrawy, Passent
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [8] A novel dimension reduction and dictionary learning framework for high-dimensional data classification
    Li, Yanxia
    Chai, Yi
    Zhou, Han
    Yin, Hongpeng
    PATTERN RECOGNITION, 2021, 112
  • [9] A novel ensemble machine learning for robust microarray data classification
    Peng, Yonghong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2006, 36 (06) : 553 - 573
  • [10] A novel LDA approach for high-dimensional data
    Feng, GY
    Hu, DW
    Li, M
    Zhou, ZT
    ADVANCES IN NATURAL COMPUTATION, PT 1, PROCEEDINGS, 2005, 3610 : 209 - 212