Gene Selection for Microarray Cancer Classification based on Manta Rays Foraging Optimization and Support Vector Machines

被引:10
|
作者
Houssein, Essam H. [1 ]
Hassan, Hager N. [1 ]
Al-Sayed, Mustafa M. [1 ]
Nabil, Emad [2 ,3 ]
机构
[1] Minia Univ, Fac Comp & Informat, Al Minya, Egypt
[2] Cairo Univ, Fac Comp & Artificial Intelligence, Giza, Egypt
[3] Islamic Univ Madinah, Fac Comp Sci & Informat Syst, Madinah, Saudi Arabia
关键词
Microarray; Gene expression; Gene selection; Cancer classification; Feature selection; Manta Ray Foraging Optimization algorithm; Support vector machines; Minimum Redundancy Maximum Relevance; PARTICLE SWARM OPTIMIZATION; EFFICIENT FEATURE-SELECTION; FEATURE SUBSET-SELECTION; RANDOM SUBSPACE METHOD; HIGH-DIMENSIONAL DATA; MOLECULAR CLASSIFICATION; MUTUAL INFORMATION; SVM-RFE; ALGORITHM; TUMOR;
D O I
10.1007/s13369-021-06102-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In DNA microarray applications, many techniques are proposed for cancer classification in order to detect normal and cancerous humans or classify different types of cancers. Gene selection is usually required as a preliminary step for a cancer classification problem. This step aims to select the most informative genes among a great number of genes, which represent an important issue. Although many studies have been proposed to address this issue, they lack getting the most informative and fewest number of genes with the highest accuracy and little effort from the high dimensionality of microarray datasets. Manta ray foraging optimization(MRFO) algorithm is a new meta-heuristic algorithm that mimics the nature of manta ray fishes in food foraging. MRFO has achieved promising results in other fields, such as solar generating units. Due to the high accuracy results of the support vector machines (SVM), it is the most commonly used classification algorithm in cancer studies, especially with microarray data. For exploiting the pros of both algorithms (i.e., MRFO and SVM), in this paper, a hybrid algorithm is proposed to select the most predictive and informative genes for cancer classification. A binary microarray dataset, which includes colon and leukemia1, and a multi-class microarray dataset that includes SRBCT, lymphoma, and leukemia2, are used to evaluate the accuracy of the proposed technique. Like other optimization techniques, MRFO suffers from some problems related to the high dimensionality and complexity of the microarray data. For solving such problems as well as improving the performance, the minimum redundancy maximum relevance (mRMR) method is used as a preprocessing stage. The proposed technique has been evaluated compared to the most common cancer classification algorithms. The experimental results show that our proposed technique achieves the highest accuracy with the fewest number of informative genes and little effort.
引用
收藏
页码:2555 / 2572
页数:18
相关论文
共 50 条
  • [21] RETRACTED: Manta Ray Foraging Optimization with Vector Quantization Based Microarray Image Compression Technique (Retracted Article)
    Alkhaldi, Nora A.
    Alsedais, Rawabi Abdulaziz Abdullah
    Halawani, Hanan T.
    Aboutaleb, Sayed M. Abdelkhalek
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [22] The application of ant colony optimization for gene selection in microarray-based cancer classification
    Chiang, Yu-Min
    Chiang, Huei-Min
    Lin, Shang-Yi
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 4001 - 4006
  • [23] Relevant vector machine based on gene pre-selection for cancer microarray expression classification
    Qiu Langbo
    Wang Zhengzhi
    Wang Guangyun
    ISTM/2007: 7TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-7, CONFERENCE PROCEEDINGS, 2007, : 2337 - 2340
  • [24] Comparing kernel functions of support vector machines for different gene selection methods in a microarray study
    Zararsiz, Gokmen
    Ozturk, Ahmet
    Elmali, Ferhan
    Kopru, Mehmet
    Cosgun, Erdal
    Karabulut, Erdem
    CURRENT OPINION IN BIOTECHNOLOGY, 2011, 22 : S59 - S59
  • [25] Grey wolf optimization based parameter selection for support vector machines
    Eswaramoorthy, Sathish
    Sivakumaran, N.
    Sekaran, Sankaranarayanan
    COMPEL-THE INTERNATIONAL JOURNAL FOR COMPUTATION AND MATHEMATICS IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2016, 35 (05) : 1513 - 1523
  • [26] A Classification-Based Segmentation of cDNA Microarray Images using Support Vector Machines
    Giannakeas, Nikolaos
    Karvelis, Petros S.
    Fotiadis, Dimitrios I.
    2008 30TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-8, 2008, : 875 - +
  • [27] Two-stage gene selection for support vector machine classification of microarray data
    Xia, Xiao-Lei
    Li, Kang
    Irwin, George W.
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2009, 8 (02) : 164 - 171
  • [28] Unleashing the power of Manta Rays Foraging Optimizer: A novel approach for hyper-parameter optimization in skin cancer classification
    Adamu, Shamsuddeen
    Alhussian, Hitham
    Aziz, Norshakirah
    Abdulkadir, Said Jadid
    Alwadin, Ayed
    Abdullahi, Mujaheed
    Garba, Aliyu
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 99
  • [29] Feature Selection and Parameters Optimization of Support Vector Machines Based on Hybrid Glowworm Swarm Optimization for Classification of Diabetic Retinopathy
    Karthikeyan, R.
    Alli, P.
    JOURNAL OF MEDICAL SYSTEMS, 2018, 42 (10)
  • [30] Feature Selection and Parameters Optimization of Support Vector Machines Based on Hybrid Glowworm Swarm Optimization for Classification of Diabetic Retinopathy
    R. Karthikeyan
    P. Alli
    Journal of Medical Systems, 2018, 42