Pattern Analysis in DNA Microarray Data through PCA-Based Gene Selection

被引:0
|
作者
Ocampo, Ricardo [1 ]
de Luna, Marco A. [1 ]
Vega, Roberto [1 ]
Sanchez-Ante, Gildardo [1 ]
Falcon-Morales, Luis E. [1 ]
Sossa, Humberto [2 ]
机构
[1] Tecnol Monterrey, Campus Guadalajara,Av Gral Ramon Corona 2514, Zapopan 45201, Jal, Mexico
[2] Inst Politecn Nacl, CIC, Mexico City 07738, DF, Mexico
关键词
CANCER CLASSIFICATION; PREDICTION; MACHINE; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
DNA microarrays is a technology that can be used to diagnose cancer and other diseases. To automate the analysis of such data, pattern recognition and machine learning algorithms can be applied. However, the curse of dimensionality is unavoidable: very few samples to train, and many attributes in each sample. As the predictive accuracy of supervised classifiers decays with irrelevant and redundant features, the necessity of a dimensionality reduction process is essential. In this paper, we propose a new methodology that is based on the application of Principal Component Analysis and other statistical tools to gain insight in the identification of relevant genes. We run the approaches using two benchmark datasets: Leukemia and Lymphoma. The results show that it is possible to reduce considerably the number of genes while increasing the performance of well known classifiers.
引用
收藏
页码:532 / 539
页数:8
相关论文
共 50 条
  • [41] Variable selection and pattern recognition with gene expression data generated by the microarray technology
    Szabo, A
    Boucher, K
    Carroll, WL
    Klebanov, LB
    Tsodikov, AD
    Yakovlev, AY
    MATHEMATICAL BIOSCIENCES, 2002, 176 (01) : 71 - 98
  • [42] A PCA-based method for ancestral informative markers selection in structured populations
    Zhang Feng
    Zhang Lei
    Deng Hong-Wen
    SCIENCE IN CHINA SERIES C-LIFE SCIENCES, 2009, 52 (10): : 972 - 976
  • [43] A PCA-based method for ancestral informative markers selection in structured populations
    Feng Zhang
    Lei Zhang
    Hong-Wen Deng
    Science in China Series C: Life Sciences, 2009, 52 : 972 - 976
  • [44] Prioritization of watershed through morphometric parameters: a PCA-based approach
    Meshram S.G.
    Sharma S.K.
    Applied Water Science, 2017, 7 (3) : 1505 - 1519
  • [45] A PCA-Based Change Detection Framework for Multidimensional Data Streams
    Qahtan, Abdulhakim
    Alharbi, Basma
    Wang, Suojin
    Zhang, Xiangliang
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 935 - 944
  • [46] Neural networks for gene expression analysis and gene selection from DNA microarray
    Patra, JC
    Zhen, Q
    Ang, EL
    Das, A
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 509 - 514
  • [47] Theoretical analysis of illumination in PCA-based vision systems
    Zhao, L
    Yang, YH
    PATTERN RECOGNITION, 1999, 32 (04) : 547 - 564
  • [48] A PCA-based method for ancestral informative markers selection in structured populations
    ZHANG Feng1
    2 Institute of Bioscience and Biotechnology
    3 Departments of Orthopedic Surgery and Basic Medical Science
    Science in China(Series C:Life Sciences), 2009, (10) : 972 - 976
  • [49] A novel PCA-based Bayes classifier and face analysis
    Jin, Z
    Davoine, F
    Lou, Z
    Yang, JY
    ADVANCES IN BIOMETRICS, PROCEEDINGS, 2006, 3832 : 144 - 150
  • [50] PCA-based analysis on factors of English translation ability
    Hong, Ting
    Computer Modelling and New Technologies, 2014, 18 (11): : 940 - 944