Gene Selection for Microarray Expression Data with Imbalanced Sample Distributions

被引:12
|
作者
Kamal, Abu H. M. [1 ]
Zhu, Xingquan [1 ]
Narayanan, Ramaswamy [2 ]
机构
[1] Florida Atlantic Univ, Dept Comp Sci & Engn, Boca Raton, FL 33431 USA
[2] Florida Atlantic Univ, Dept Comp & Biochem, Boca Raton, FL 33431 USA
关键词
D O I
10.1109/IJCBS.2009.117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Microarray expression data, which contain expression levels of a large number of simultaneously observed genes, have been used in many scientific research and clinical studies. Due to its high dimensionalities, selecting a small number of genes has shown to be beneficial for tasks such as building prediction models for molecular classification of cancers. Traditional gene selection methods, however, fail to take the sample distributions into consideration for gene selection. Due to the scarcity of the samples, in Biomedical research it is very common to have severely biased data distributions with one class of examples (e.g., diseased samples) significantly less than other classes (e.g., normal samples). Sample sets with biased distributions require special attention for identifying genes responsible for particular disease In this paper, we propose three filtering techniques, Higher Weight (HW), Differential Minority Repeat (DMR) and Balanced Minority Repeat (BMR), to identify genes relevant to fatal diseases for biased microarray expression data. Experimental comparisons with the traditional ReliefF method on five microarray datasets demonstrate the effectiveness of the proposed methods in selecting informative genes from microarray expression data with biased sample distributions.
引用
收藏
页码:3 / +
页数:2
相关论文
共 50 条
  • [41] Hierarchical approach to the optimal gene selection for cancer recognition on the basis of microarray gene expression data
    Wilinski, Artur
    Osowski, Stanislaw
    [J]. PRZEGLAD ELEKTROTECHNICZNY, 2009, 85 (04): : 50 - 52
  • [42] Statistical Class Prediction Method for Efficient Microarray Gene Expression Data Sample Classification
    Sheela, T.
    Rangarajan, Lalitha
    [J]. 2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 73 - 78
  • [43] Analysis of microarray gene expression data
    Pham, Tuan D.
    Wells, Christine
    Crane, Denis I.
    [J]. CURRENT BIOINFORMATICS, 2006, 1 (01) : 37 - 53
  • [44] Visualization of microarray gene expression data
    Prasad, Tangirala Venkateswara
    Ahson, Syed Ismail
    [J]. BIOINFORMATION, 2006, 1 (04) : 141 - 145
  • [45] Efficient gene selection for classification of microarray data
    Ho, SY
    Lee, CC
    Chen, HM
    Huang, HL
    [J]. 2005 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-3, PROCEEDINGS, 2005, : 1753 - 1760
  • [46] Analyzing microarray gene expression data
    Lewin, A
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2005, 168 : 876 - 877
  • [47] Microarray gene expression data analysis
    Vachtsevanos, G
    Ding, YH
    Fairley, JA
    Gardner, AB
    Simeonova, P
    [J]. 2004 2ND IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: MACRO TO NANO, VOLS 1 AND 2, 2004, : 105 - 108
  • [48] A stable gene selection in microarray data analysis
    Yang, Kun
    Cai, Zhipeng
    Li, Jianzhong
    Lin, Guohui
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [49] Filtering for Improved Gene Selection on Microarray Data
    Canul-Reich, Juana
    Hall, Lawrence O.
    Goldgof, Dmitry
    Eschrich, Steven A.
    [J]. IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010, : 3250 - 3257
  • [50] A stable gene selection in microarray data analysis
    Kun Yang
    Zhipeng Cai
    Jianzhong Li
    Guohui Lin
    [J]. BMC Bioinformatics, 7