Integration of microarray data for a comparative study of classifiers and identification of marker genes

被引:0
|
作者
Berrar, D [1 ]
Sturgeon, B [1 ]
Bradbury, I [1 ]
Downes, CS [1 ]
Dubitzky, W [1 ]
机构
[1] Univ Ulster, Sch Biomed Sci, Coleraine BT52 1SA, Londonderry, North Ireland
关键词
microarray; lung cancer; survival analysis; machine learning;
D O I
10.1007/0-387-23077-7_12
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Novel diagnostic tools promise the development of patient- tailored cancer treatment. However, one major step towards individualized therapy is to use a combination of various data sources, e.g. transcriptomic, proteomic, and clinical data. We have integrated clinical data and lung cancer microarray data that were generated on two different oligonucleotide platforms. We were interested in the question whether the prediction of survival outcome benefits from the integration of clinical and transcriptomic data. In addition, we attempted to identify those genes whose expression profiles correlate with survival outcome. We applied five machine learning techniques to predict survival risk groups, and we compared the models with respect to their performance and general user acceptance. Based on quantitative and qualitative evaluation criteria, we chose decision trees as the most relevant technique for this type of analysis. Our in silico analysis corroborates the role of numerous marker genes already described in lung adenocarcinomas. In addition, our study reveals a set of highly interesting genes whose expression profiles correlate with genetic risk groups of unexpected survival outcomes.
引用
收藏
页码:147 / 162
页数:16
相关论文
共 50 条
  • [1] Identification of marker genes in diabetic wounds by DNA microarray study
    Ni, T.
    Wang, N.
    Mao, Z. G.
    Yao, M.
    [J]. GENETICS AND MOLECULAR RESEARCH, 2013, 12 (04): : 5348 - 5355
  • [2] Genes Selection Comparative Study in Microarray Data Analysis
    Kaissi, Ouafae
    Nimpaye, Eric
    Singh, Tiratha Raj
    Vannier, Brigitte
    Ibrahimi, Azeddine
    Ghacham, Abdellatif Amrani
    Moussa, Ahmed
    [J]. BIOINFORMATION, 2013, 9 (20) : 1019 - 1022
  • [3] Robust prostate cancer marker genes emerge from direct integration of inter-study microarray data
    Xu, L
    Tan, AC
    Naiman, DQ
    Geman, D
    Winslow, RL
    [J]. BIOINFORMATICS, 2005, 21 (20) : 3905 - 3911
  • [4] A Comparative Study and Analysis of Data Mining Classifiers for Microarray based Cancer Pattern Diagnostics
    Subasree, S.
    Gopalan, N. P.
    Sakthivel, N. K.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
  • [5] On the Comparison of Classifiers for Microarray Data
    Hanczar, Blaise
    Dougherty, Edward R.
    [J]. CURRENT BIOINFORMATICS, 2010, 5 (01) : 29 - 39
  • [6] How to extract marker genes from microarray data sets
    Schachtner, R.
    Lutter, D.
    Theis, F. J.
    Lang, E. W.
    Schmitz, G.
    Tome, A. M.
    Vilda, P. Gomez
    [J]. 2007 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-16, 2007, : 4215 - 4218
  • [7] A comparative study of classifiers on a real data set
    Visa, S
    Ralescu, A
    [J]. FUZZY SETS AND SYSTEMS - IFSA 2003, PROCEEDINGS, 2003, 2715 : 338 - 345
  • [8] Comparative microarray data analysis for the expression of genes in the pathway of glioma
    Katara, Pramod
    Sharma, Neeru
    Sharma, Sugandha
    Khatri, Indu
    Kaushik, Akansha
    Kaushal, Lalima
    Sharma, Vinay
    [J]. BIOINFORMATION, 2010, 5 (01) : 31 - 34
  • [9] Identification of marker genes for intestinal immunomodulating effect of a fructooligosaccharide by DNA microarray analysis
    Fukasawa, Tomoyuki
    Murashima, Koichiro
    Matsumoto, Ichiro
    Hosono, Akira
    Ohara, Hiroki
    Nojiri, Chuhei
    Koga, Jinnichiro
    Kubota, Hidetoshi
    Kanegae, Minoru
    Kaminogawa, Shuichi
    Abe, Keiko
    Kono, Toshiaki
    [J]. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2007, 55 (08) : 3174 - 3179
  • [10] Identification of significant periodic genes in microarray gene expression data
    Chen, J
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)