Gene Selection for Microarray Cancer Classification based on Manta Rays Foraging Optimization and Support Vector Machines

被引:10
|
作者
Houssein, Essam H. [1 ]
Hassan, Hager N. [1 ]
Al-Sayed, Mustafa M. [1 ]
Nabil, Emad [2 ,3 ]
机构
[1] Minia Univ, Fac Comp & Informat, Al Minya, Egypt
[2] Cairo Univ, Fac Comp & Artificial Intelligence, Giza, Egypt
[3] Islamic Univ Madinah, Fac Comp Sci & Informat Syst, Madinah, Saudi Arabia
关键词
Microarray; Gene expression; Gene selection; Cancer classification; Feature selection; Manta Ray Foraging Optimization algorithm; Support vector machines; Minimum Redundancy Maximum Relevance; PARTICLE SWARM OPTIMIZATION; EFFICIENT FEATURE-SELECTION; FEATURE SUBSET-SELECTION; RANDOM SUBSPACE METHOD; HIGH-DIMENSIONAL DATA; MOLECULAR CLASSIFICATION; MUTUAL INFORMATION; SVM-RFE; ALGORITHM; TUMOR;
D O I
10.1007/s13369-021-06102-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In DNA microarray applications, many techniques are proposed for cancer classification in order to detect normal and cancerous humans or classify different types of cancers. Gene selection is usually required as a preliminary step for a cancer classification problem. This step aims to select the most informative genes among a great number of genes, which represent an important issue. Although many studies have been proposed to address this issue, they lack getting the most informative and fewest number of genes with the highest accuracy and little effort from the high dimensionality of microarray datasets. Manta ray foraging optimization(MRFO) algorithm is a new meta-heuristic algorithm that mimics the nature of manta ray fishes in food foraging. MRFO has achieved promising results in other fields, such as solar generating units. Due to the high accuracy results of the support vector machines (SVM), it is the most commonly used classification algorithm in cancer studies, especially with microarray data. For exploiting the pros of both algorithms (i.e., MRFO and SVM), in this paper, a hybrid algorithm is proposed to select the most predictive and informative genes for cancer classification. A binary microarray dataset, which includes colon and leukemia1, and a multi-class microarray dataset that includes SRBCT, lymphoma, and leukemia2, are used to evaluate the accuracy of the proposed technique. Like other optimization techniques, MRFO suffers from some problems related to the high dimensionality and complexity of the microarray data. For solving such problems as well as improving the performance, the minimum redundancy maximum relevance (mRMR) method is used as a preprocessing stage. The proposed technique has been evaluated compared to the most common cancer classification algorithms. The experimental results show that our proposed technique achieves the highest accuracy with the fewest number of informative genes and little effort.
引用
收藏
页码:2555 / 2572
页数:18
相关论文
共 50 条
  • [31] Extraction of the cancer information from microarray of gene expression using Support Vector Machines
    Wilinski, A
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS IV, 2006, 6159
  • [32] Feature selection using adaptive manta ray foraging optimization for brain tumor classification
    Neetha, K. S.
    Narayan, Dayanand Lal
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (02)
  • [33] Gene selection and classification using non-linear kernel support vector machines based on gene expression data
    Zhang Qizhong
    2007 IEEE/ICME INTERNATIONAL CONFERENCE ON COMPLEX MEDICAL ENGINEERING, VOLS 1-4, 2007, : 1606 - 1611
  • [34] Combining One-Class Support Vector Machines for Microarray Classification
    Krawczyk, Bartosz
    2013 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2013, : 83 - 89
  • [35] On the parameter optimization of Support Vector Machines for binary classification
    Gaspar, Paulo
    Carbonell, Jaime
    Luis Oliveira, Jose
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2012, 9 (03)
  • [36] Stochastic Optimization Algorithms for Support Vector Machines Classification
    Bartkute-Norkuniene, Vaida
    INFORMATICA, 2009, 20 (02) : 173 - 186
  • [37] Geographical Classification of Tannat Wines Based on Support Vector Machines and Feature Selection
    Costa, Nattane Luiza
    Garcia Llobodanin, Laura Andrea
    Castro, Inar Alves
    Barbosa, Rommel
    BEVERAGES, 2018, 4 (04):
  • [38] An optimized features selection approach based on Manta Ray Foraging Optimization (MRFO) method for parasite malaria classification
    Amin, Javeria
    Sharif, Muhammad
    Mallah, Ghulam Ali
    Fernandes, Steven L.
    FRONTIERS IN PUBLIC HEALTH, 2022, 10
  • [39] Efficient parameter selection for support vector machines in classification and regression via model-based global optimization
    Fröhlich, H
    Zell, A
    Proceedings of the International Joint Conference on Neural Networks (IJCNN), Vols 1-5, 2005, : 1431 - 1436
  • [40] Brain Tumours Classification Using Support Vector Machines Based on Feature Selection by Binary Cat Swarm Optimization
    Hassan, Wid Ali
    Ali, Yossra Hussain
    Ibrahim, Nuha Jameel
    EMERGING TECHNOLOGY TRENDS IN INTERNET OF THINGS AND COMPUTING, TIOTC 2021, 2022, : 108 - 121