Data mining for gene expression profiles from DNA, microarray

被引:14
|
作者
Cho, SB [1 ]
Won, HH [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
关键词
biological data mining; feature selection; classification; gene expression profile; MLP; KNN; SVM; SASOM; ensemble classifier;
D O I
10.1142/S0218194003001469
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Microarray technology has supplied a large volume of data, which changes many problems in biology into the problems of computing. As a result techniques for extracting useful information from the data are developed. In particular, microarray technology has been applied to prediction and diagnosis of cancer, so that it expectedly helps us to exactly predict and diagnose cancer. To precisely classify cancer we have to select genes related to cancer because the genes extracted from microarray have many noises. In this paper, we attempt to explore seven feature selection methods and four classifiers and propose ensemble classifiers in three benchmark datasets to systematically evaluate the performances of the feature selection methods and machine learning classifiers. Three benchmark datasets axe leukemia cancer dataset, colon cancer dataset and lymphoma cancer data set. The methods to combine the classifiers are majority voting, weighted voting, and Bayesian approach to improve the performance of classification. Experimental results show that the ensemble with several basis classifiers produces the best recognition rate on the benchmark datasets.
引用
收藏
页码:593 / 608
页数:16
相关论文
共 50 条
  • [32] Mining microarray gene expression data with unsupervised possibilistic clustering and proximity graphs
    Romdhane, L. B.
    Shili, H.
    Ayeb, B.
    [J]. APPLIED INTELLIGENCE, 2010, 33 (02) : 220 - 231
  • [33] Mining microarray gene expression data with unsupervised possibilistic clustering and proximity graphs
    L. B. Romdhane
    H. Shili
    B. Ayeb
    [J]. Applied Intelligence, 2010, 33 : 220 - 231
  • [34] Oligonucleotide microarray data mining: search for age-dependent gene expression
    Kirschner, M
    Pujol, G
    Radu, A
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2002, 298 (05) : 772 - 778
  • [35] A public repository for DNA microarray-based gene expression data
    Alvis Brazma
    Alan Robinson
    Jaak Vilo
    [J]. Nature Genetics, 1999, 23 (Suppl 3) : 34 - 34
  • [36] Microarray data mining using gene ontology
    Li, Songhui
    Becich, Michael J.
    [J]. Studies in Health Technology and Informatics, 2004, 107 : 778 - 782
  • [37] Microarray data mining using gene ontology
    Li, SH
    Becich, MJ
    Gilbertson, J
    [J]. MEDINFO 2004: PROCEEDINGS OF THE 11TH WORLD CONGRESS ON MEDICAL INFORMATICS, PT 1 AND 2, 2004, 107 : 778 - 782
  • [38] Mining microarray expression data by literature profiling
    Damien Chaussabel
    Alan Sher
    [J]. Genome Biology, 3 (10):
  • [39] Mining microarray expression data by literature profiling
    Chaussabel, Damien
    Sher, Alan
    [J]. GENOME BIOLOGY, 2002, 3 (10):
  • [40] Mining maximal local conserved gene clusters from microarray data
    Zhao, Yuhai
    Wang, Guoren
    Yin, Ying
    Xu, Guangyu
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 356 - 363