Data mining for gene expression profiles from DNA, microarray

被引:14
|
作者
Cho, SB [1 ]
Won, HH [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
关键词
biological data mining; feature selection; classification; gene expression profile; MLP; KNN; SVM; SASOM; ensemble classifier;
D O I
10.1142/S0218194003001469
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Microarray technology has supplied a large volume of data, which changes many problems in biology into the problems of computing. As a result techniques for extracting useful information from the data are developed. In particular, microarray technology has been applied to prediction and diagnosis of cancer, so that it expectedly helps us to exactly predict and diagnose cancer. To precisely classify cancer we have to select genes related to cancer because the genes extracted from microarray have many noises. In this paper, we attempt to explore seven feature selection methods and four classifiers and propose ensemble classifiers in three benchmark datasets to systematically evaluate the performances of the feature selection methods and machine learning classifiers. Three benchmark datasets axe leukemia cancer dataset, colon cancer dataset and lymphoma cancer data set. The methods to combine the classifiers are majority voting, weighted voting, and Bayesian approach to improve the performance of classification. Experimental results show that the ensemble with several basis classifiers produces the best recognition rate on the benchmark datasets.
引用
收藏
页码:593 / 608
页数:16
相关论文
共 50 条
  • [1] Mining Functional Biclusters of DNA Microarray Gene Expression Data
    Zhao, Hongya
    Huang, Qing-Hua
    Chan, Kwok Leung
    Cheng, Lee-Ming
    Yan, Hong
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1736 - 1741
  • [2] Data mining and visualisation of microarray gene expression data
    Alan Robinson
    Alvis Brazma
    [J]. Nature Genetics, 1999, 23 (Suppl 3) : 71 - 71
  • [3] Discriminatory mining of gene expression microarray data
    Wang, ZY
    Wang, Y
    Lu, JP
    Kung, SY
    Zhang, JY
    Lee, R
    Xuan, JH
    Khan, JV
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2003, 35 (03): : 255 - 272
  • [4] Discriminatory Mining of Gene Expression Microarray Data
    Zuyi Wang
    Yue Wang
    Jianping Lu
    Sun-Yuan Kung
    Junying Zhang
    Richard Lee
    Jianhua Xuan
    Javed Khan
    Robert Clarke
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 2003, 35 : 255 - 272
  • [5] Gene expression profiles in the mouse brain by using DNA microarray
    Takahashi, Y
    Washiyama, K
    Usui, M
    Kumanishi, T
    [J]. JOURNAL OF NEUROCHEMISTRY, 2001, 78 : 167 - 167
  • [6] DNA microarray expression analysis and data mining for blood cancer
    Li, Dongguang
    [J]. FBIE: 2008 INTERNATIONAL SEMINAR ON FUTURE BIOMEDICAL INFORMATION ENGINEERING, PROCEEDINGS, 2008, : 377 - 381
  • [7] Reliable Detection of Short Periodic Gene Expression Time Series Profiles in DNA Microarray Data
    Liew, Alan Wee-Chung
    Yan, Hong
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 4274 - +
  • [8] Boolean Association Rule Mining on Microarray Gene Expression Data
    Vengateshkumar, R.
    Alagukumar, S.
    Lawrance, R.
    [J]. ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 99 - 111
  • [9] Bayesian models for gene expression with DNA microarray data
    Ibrahim, JG
    Chen, MH
    Gray, RJ
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 88 - 99
  • [10] GEPRO: Gene Expression Profiler for DNA microarray data
    Kim, Beob G.
    Lindemann, Merlin D.
    Bridges, Phillip J.
    Ko, CheMyong
    [J]. REVISTA COLOMBIANA DE CIENCIAS PECUARIAS, 2009, 22 (01) : 12 - 18