A novel parallel feature rank aggregation algorithm for gene selection applied to microarray data classification

被引:0
|
作者
Longkumer, Imtisenla [1 ]
Mazumder, Dilwar Hussain [1 ]
机构
[1] Natl Inst Technol Nagaland, Dimapur 797103, Nagaland, India
关键词
Parallel rank aggregation; Gene selection; Feature ranking; Microarray cancer prediction; PREDICTION; TUMOR;
D O I
10.1016/j.compbiolchem.2024.108182
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Microarray data often comprises numerous genes, yet not all genes are relevant for predicting cancer. Feature selection becomes a crucial step to reduce the high dimensionality in these kinds of data. While no single feature selection method consistently outperforms others across diverse domains, the combination of multiple feature selectors or rankers tends to produce more effective results compared to relying on a single ranker alone. However, this approach can be computationally expensive, particularly when handling a large quantity of features. Hence, this paper presents a parallel feature rank aggregation that utilizes borda count as the rank aggregator. The concept of vertically partitioning the data along feature space was adapted to ease the parallel execution of the aggregation task. Features were selected based on the final aggregated rank list, and their classification performances were evaluated. The model's execution time was also observed across multiple worker nodes of the cluster. The experiment was conducted on six benchmark microarray datasets. The results show the capability of the proposed distributed framework compared to the sequential version in all the cases. It also illustrated the improved accuracy performance of the proposed method and its ability to select a minimal number of genes.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] A novel feature selection method for microarray data classification based on hidden Markov model
    Momenzadeh, Mohammadreza
    Sehhati, Mohammadreza
    Rabbani, Hossein
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 95
  • [42] Improving feature subset selection using a genetic algorithm for microarray gene expression data
    Tan, Feng
    Fu, Xuezheng
    Zhang, Yanqing
    Bourgeois, Anu G.
    [J]. 2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 2514 - 2519
  • [43] Hybrid feature selection using micro genetic algorithm on microarray gene expression data
    Pragadeesh, C.
    Jeyaraj, Rohana
    Siranjeevi, K.
    Abishek, R.
    Jeyakumar, G.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (03) : 2241 - 2246
  • [44] Gene Microarray Cancer Classification using Correlation Based Feature Selection Algorithm and Rules Classifiers
    Al-Batah, Mohammad
    Zaqaibeh, Belal
    Alomari, Saleh Ali
    Alzboon, Mowafaq Salem
    [J]. INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2019, 15 (08) : 62 - 73
  • [45] A Novel Feature Selection Algorithm using Particle Swarm Optimization for Cancer Microarray Data
    Sahu, Barnali
    Mishra, Debahuti
    [J]. INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 27 - 31
  • [46] A Novel Kernel-based Gene Selection and Classification Scheme for Microarray Data
    Huang, Hsiao-Yun
    Chang, Hui-Yi
    Liu, Jeng-Fu
    [J]. 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 1679 - 1683
  • [47] Gene selection for microarray data classification using a novel ant colony optimization
    Tabakhi, Sina
    Najafi, Ali
    Ranjbar, Reza
    Moradi, Parham
    [J]. NEUROCOMPUTING, 2015, 168 : 1024 - 1036
  • [48] A hybrid feature selection model based on improved squirrel search algorithm and rank aggregation using fuzzy techniques for biomedical data classification
    Nagarajan, Gayathri
    Babu, L. D. Dhinesh
    [J]. NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2021, 10 (01):
  • [49] A hybrid feature selection model based on improved squirrel search algorithm and rank aggregation using fuzzy techniques for biomedical data classification
    Gayathri Nagarajan
    L. D. Dhinesh Babu
    [J]. Network Modeling Analysis in Health Informatics and Bioinformatics, 2021, 10
  • [50] A Projected Feature Selection Algorithm for Data Classification
    Yin, Zhiwu
    Huang, Shangteng
    [J]. 2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 3665 - 3668