Gene Selection in Multi-class Imbalanced Microarray Datasets Using Dynamic Length Particle Swarm Optimization

被引:3
|
作者
Priya, R. Devi [1 ]
Sivaraj, R. [2 ]
机构
[1] Kongu Engn Coll, Dept Informat Technol, Erode, Tamil Nadu, India
[2] Nandha Engn Coll, Dept Comp Sci & Engn, Erode, Tamil Nadu, India
关键词
Feature weighing; retained tomek link; dynamic PSO; apriori algorithm; microarray datasets; EXPRESSION DATA; CLASSIFICATION; ALGORITHM; PREDICTION; SYSTEM;
D O I
10.2174/1574893615999201002093834
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Microarray gene expression datasets usually contain a large number of genes that complicate further operations like classification, clustering and other kinds of analysis. During the classification process, the identification of salient genes is a brainstorming task and needs a careful selection. Methods: The classification of multi-class datasets is more critical when compared with binary classification. When there are multiple class labels, chances are more likely that the datasets are imbalanced. Large variations can be seen in the number of samples belonging to each class, and hence the classification process may go biased with incorrect samples chosen for training. There is no sufficient research work available to address all these three scenarios together in microarray datasets. Results and Discussion: The paper fills this gap with the following contributions: i) Selects salient genes for classification using multiSURF algorithm ii) Identifies right instances from imbalanced datasets using Retained Tomek Link algorithm and iii) Performs gene selection for multi-class classification using Dynamic Length Particle Swarm Optimization (DPSO). Conclusion: The proposed method is implemented on multi-class imbalanced microarray datasets, and the final classification performance is seen to be encouraging and better than other compared methods.
引用
收藏
页码:734 / 748
页数:15
相关论文
共 50 条
  • [1] Dynamic ensemble selection for multi-class imbalanced datasets
    Garcia, Salvador
    Zhang, Zhong-Liang
    Altalhi, Abdulrahman
    Alshomrani, Saleh
    Herrera, Francisco
    [J]. INFORMATION SCIENCES, 2018, 445 : 22 - 37
  • [2] Multi-Objective Particle Swarm Optimization Based Preprocessing of Multi-Class Extremely Imbalanced Datasets
    Priya, R. Devi
    Sivaraj, R.
    Abraham, Ajith
    Pravin, T.
    Sivasankar, P.
    Anitha, N.
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2022, 30 (05) : 735 - 755
  • [3] Multi-Class Image Annotation Approach using Particle Swarm Optimization
    Sami, Mohamed
    El-Bendary, Nashwa
    Hassanien, Aboul Ella
    [J]. 2012 12TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2012, : 103 - 108
  • [4] Gene selection for multi-class prediction of microarray data
    Chen, DC
    Hua, D
    Reifman, J
    Cheng, XZ
    [J]. PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, : 492 - 495
  • [5] Cost-Sensitive Variable Selection for Multi-Class Imbalanced Datasets Using Bayesian Networks
    Ramos-Lopez, Dario
    Maldonado, Ana D.
    [J]. MATHEMATICS, 2021, 9 (02) : 1 - 15
  • [6] Investigations into Particle Swarm Optimization for Multi-class Shape Recognition
    No, Ee Lee
    Lim, Mei Kuan
    Maul, Tomas
    Lai, Weng Kin
    [J]. ADVANCES IN NEURO-INFORMATION PROCESSING, PT II, 2009, 5507 : 599 - 606
  • [7] Diabetic retinopathy screening using deep learning for multi-class imbalanced datasets
    Saini, Manisha
    Susan, Seba
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 149
  • [8] Multi-class particle swarm model selection for automatic image annotation
    Jair Escalante, Hugo
    Montes, Manuel
    Enrique Sucar, L.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (12) : 11011 - 11021
  • [9] Gene Selection Using 1-Norm Regularization for Multi-Class Microarray Data
    Nan, Xiaofei
    Wang, Nan
    Gong, Ping
    Zhang, Chaoyang
    Chen, Yixin
    Wilkins, Dawn
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 520 - 524
  • [10] A gene selection algorithm for microarray cancer classification using an improved particle swarm optimization
    Nagra, Arfan Ali
    Khan, Ali Haider
    Abubakar, Muhammad
    Faheem, Muhammad
    Rasool, Adil
    Masood, Khalid
    Hussain, Muzammil
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):