Gene Selection for Microarray Expression Data with Imbalanced Sample Distributions

被引:12
|
作者
Kamal, Abu H. M. [1 ]
Zhu, Xingquan [1 ]
Narayanan, Ramaswamy [2 ]
机构
[1] Florida Atlantic Univ, Dept Comp Sci & Engn, Boca Raton, FL 33431 USA
[2] Florida Atlantic Univ, Dept Comp & Biochem, Boca Raton, FL 33431 USA
关键词
D O I
10.1109/IJCBS.2009.117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Microarray expression data, which contain expression levels of a large number of simultaneously observed genes, have been used in many scientific research and clinical studies. Due to its high dimensionalities, selecting a small number of genes has shown to be beneficial for tasks such as building prediction models for molecular classification of cancers. Traditional gene selection methods, however, fail to take the sample distributions into consideration for gene selection. Due to the scarcity of the samples, in Biomedical research it is very common to have severely biased data distributions with one class of examples (e.g., diseased samples) significantly less than other classes (e.g., normal samples). Sample sets with biased distributions require special attention for identifying genes responsible for particular disease In this paper, we propose three filtering techniques, Higher Weight (HW), Differential Minority Repeat (DMR) and Balanced Minority Repeat (BMR), to identify genes relevant to fatal diseases for biased microarray expression data. Experimental comparisons with the traditional ReliefF method on five microarray datasets demonstrate the effectiveness of the proposed methods in selecting informative genes from microarray expression data with biased sample distributions.
引用
收藏
页码:3 / +
页数:2
相关论文
共 50 条
  • [1] The Impact of Gene Selection on Imbalanced Microarray Expression Data
    Kamal, Abu H. M.
    Zhu, Xingquan
    Pandya, Abhijit S.
    Hsu, Sam
    Shoaib, Muhammad
    [J]. BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, PROCEEDINGS, 2009, 5462 : 259 - 269
  • [2] Embedded Gene Selection for Imbalanced Microarray Data Analysis
    Li, Guo-Zheng
    Meng, Hao-Hua
    Ni, Jun
    [J]. 2008 INTERNATIONAL MULTISYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS), 2008, : 17 - +
  • [3] Sample selection for microarray gene expression studies
    Repsilber, D
    Fink, L
    Jacobsen, M
    Bläsing, O
    Ziegler, A
    [J]. METHODS OF INFORMATION IN MEDICINE, 2005, 44 (03) : 461 - 467
  • [4] Prototyping: Sample Selection for Imbalanced Data
    Schwalb, Edward
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 221 - 227
  • [5] A genetic approach for gene selection on microarray expression data
    Kim, YH
    Lee, SY
    Moon, BR
    [J]. GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2004, PT 1, PROCEEDINGS, 2004, 3102 : 346 - 355
  • [6] Stable Gene Selection from Microarray Data via Sample Weighting
    Yu, Lei
    Han, Yue
    Berens, Michael E.
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (01) : 262 - 272
  • [7] Exploiting the Accumulated Evidence for Gene Selection in Microarray Gene Expression Data
    Prat-Masramon, Gabriel
    Belanche-Munoz, Lluis A.
    [J]. ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 989 - +
  • [8] Unsupervised selection of informative genes in microarray gene expression data
    Liaghat, Samaneh
    Mansoori, Eghbal G.
    [J]. INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2016, 3 (04) : 351 - 367
  • [9] A hybrid feature selection approach for microarray gene expression data
    Tan, Feng
    Fu, Xuezheng
    Wang, Hao
    Zhang, Yanqing
    Bourgeois, Anu
    [J]. COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 678 - 685
  • [10] Parsimonious Selection of Useful Genes in Microarray Gene Expression Data
    Gonzalez-Navarro, Felix F.
    Belanche-Munoz, Lluis A.
    [J]. SOFTWARE TOOLS AND ALGORITHMS FOR BIOLOGICAL SYSTEMS, 2011, 696 : 45 - 55