Gene Selection for Microarray Expression Data with Imbalanced Sample Distributions

被引:12
|
作者
Kamal, Abu H. M. [1 ]
Zhu, Xingquan [1 ]
Narayanan, Ramaswamy [2 ]
机构
[1] Florida Atlantic Univ, Dept Comp Sci & Engn, Boca Raton, FL 33431 USA
[2] Florida Atlantic Univ, Dept Comp & Biochem, Boca Raton, FL 33431 USA
关键词
D O I
10.1109/IJCBS.2009.117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Microarray expression data, which contain expression levels of a large number of simultaneously observed genes, have been used in many scientific research and clinical studies. Due to its high dimensionalities, selecting a small number of genes has shown to be beneficial for tasks such as building prediction models for molecular classification of cancers. Traditional gene selection methods, however, fail to take the sample distributions into consideration for gene selection. Due to the scarcity of the samples, in Biomedical research it is very common to have severely biased data distributions with one class of examples (e.g., diseased samples) significantly less than other classes (e.g., normal samples). Sample sets with biased distributions require special attention for identifying genes responsible for particular disease In this paper, we propose three filtering techniques, Higher Weight (HW), Differential Minority Repeat (DMR) and Balanced Minority Repeat (BMR), to identify genes relevant to fatal diseases for biased microarray expression data. Experimental comparisons with the traditional ReliefF method on five microarray datasets demonstrate the effectiveness of the proposed methods in selecting informative genes from microarray expression data with biased sample distributions.
引用
收藏
页码:3 / +
页数:2
相关论文
共 50 条
  • [31] Gene selection and gene identification in Microarray data analysis
    Chen, J. J.
    Zou, W.
    Chang, C-W
    Morris, S. M.
    [J]. ENVIRONMENTAL AND MOLECULAR MUTAGENESIS, 2008, 49 (07) : 558 - 558
  • [32] Variable selection and pattern recognition with gene expression data generated by the microarray technology
    Szabo, A
    Boucher, K
    Carroll, WL
    Klebanov, LB
    Tsodikov, AD
    Yakovlev, AY
    [J]. MATHEMATICAL BIOSCIENCES, 2002, 176 (01) : 71 - 98
  • [33] Feature selection methods in microarray gene expression data: a systematic mapping study
    Vahmiyan, Mahnaz
    Kheirabadi, Mohammadtaghi
    Akbari, Ebrahim
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (22): : 19675 - 19702
  • [34] A Top-r Feature Selection Algorithm for Microarray Gene Expression Data
    Sharma, Alok
    Imoto, Seiya
    Miyano, Satoru
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (03) : 754 - 764
  • [35] Feature Selection in Microarray Gene Expression Data Using Fisher Discriminant Ratio
    Sarbazi-Azad, Saeed
    Abadeh, Mohammad Saniee
    Abadi, Mehdi Irannejad Najaf
    [J]. 2018 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2018, : 225 - 230
  • [36] Combination of Feature Selection Methods for the Effective Classification of Microarray Gene Expression Data
    Sheela, T.
    Rangarajan, Lalitha
    [J]. RECENT TRENDS IN IMAGE PROCESSING AND PATTERN RECOGNITION (RTIP2R 2016), 2017, 709 : 137 - 145
  • [37] A New hybrid Feature selection-Classification model to Improve Cancer Sample Classification Accuracy in Microarray Gene Expression Data
    Bandyopadhyay, Ritaban
    Sharma, Arijt Das
    Dasgupta, Bidya
    Ghosh, Ankita
    Das, Chandra
    Bose, Shilpi
    [J]. 2023 INTERNATIONAL CONFERENCE ON COMPUTER, ELECTRICAL & COMMUNICATION ENGINEERING, ICCECE, 2023,
  • [38] Virtual gene: A gene selection algorithm for sample classification on microarray datasets
    Xu, X
    Zhang, AD
    [J]. COMPUTATIONAL SCIENCE - ICCS 2005, PT 2, 2005, 3515 : 1038 - 1045
  • [39] Hierarchical approach to the optimal gene selection for cancer recognition on the basis of microarray gene expression data
    Wilinski, Artur
    Osowski, Stanislaw
    [J]. PRZEGLAD ELEKTROTECHNICZNY, 2009, 85 (04): : 50 - 52
  • [40] An expert system to classify microarray gene expression data using gene selection by decision tree
    Horng, Jorng-Tzong
    Wu, Li-Cheng
    Liu, Baw-Juine
    Kuo, Jun-Li
    Kuo, Wen-Horng
    Zhang, Jin-Jian
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (05) : 9072 - 9081