SPRING: A METHOD FOR IDENTIFYING DIFFERENTIALLY EXPRESSED GENES IN MICROARRAY DATA

被引:1
|
作者
Tian, Yuan [1 ,2 ]
Liu, Guixia [1 ,2 ]
Wu, Chunguo [1 ,2 ]
Rong, Guang [1 ,2 ]
Sun, An [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130023, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun 130023, Peoples R China
基金
中国国家自然科学基金;
关键词
microarray; self-organizing map; minimum spanning tree clustering; fuzzy clustering matrix; differentially expressed gene; ARTIFICIAL NEURAL-NETWORK; MEMBRANE-TRANSPORT; CANCER; DISEASE; IDENTIFICATION; BIOMARKERS; REPOSITORY; PROFILES; SAMPLES;
D O I
10.5504/BBEQ.2013.0083
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Analysis of 'omics' data is a central issue of system biology. As one of the most widely used 'omics' data, gene expression profiles from microarray experiments are applied to many frontier studies. The first and important step to analyze microarray data is to identify differentially expressed genes (DEGs) under two experimental conditions. Thereby, several DEG-identifying algorithms have been proposed. However, both traditional algorithms, such as Fold-Change, T-test and Significance Analysis of Microarrays (SAM), and modern ones, such as Rank Product, Outlier Robust t-statistic and Outlier Sums, are statistics-based approaches with the same core idea, which considers DEGs as the differences between two series of numbers. We present a novel view based on the hypothesis that DEGs are the differences between two input modes rather than the differences between two digital series, and then propose a novel non-statistical algorithm based on this idea, named Spring (SPG), which uses a Self-Organization Map (SOM) neural network to detect the input modals of DEGs under two sets of conditions. Firstly, the input matrix for SOM is constructed by reconstruction of the gene expression matrix, amplification of the difference of DEG and use of pairs of units divided from reconstructed gene expression matrix; and then, the strategy to improve the accuracy and stability is proposed by the Mass Spring Model, Minimum Spanning Tree Clustering and fuzzy clustering matrix. Compared with T-test and SAM, our algorithm obtains more DEGs in higher accuracy from both simulation and Homo sapiens datasets. Especially, we describe the details to transform SPG to a meta-analysis algorithm at the end.
引用
收藏
页码:4150 / 4156
页数:7
相关论文
共 50 条
  • [1] Ranking analysis of microarray data: A powerful method for identifying differentially expressed genes
    Tan, Yuan-De
    Fornage, Myriam
    Fu, Yun-Xin
    [J]. GENOMICS, 2006, 88 (06) : 846 - 854
  • [2] Identifying Differentially Expressed Genes in Time Course Microarray Data
    Ping Ma
    Wenxuan Zhong
    Jun S. Liu
    [J]. Statistics in Biosciences, 2009, 1 (2) : 144 - 159
  • [3] Identifying Differentially Expressed Genes in Time Course Microarray Data
    Ma, Ping
    Zhong, Wenxuan
    Liu, Jun S.
    [J]. STATISTICS IN BIOSCIENCES, 2009, 1 (02) : 144 - 159
  • [4] Nonparametric methods for identifying differentially expressed genes in microarray data
    Troyanskaya, OG
    Garber, ME
    Brown, PO
    Botstein, D
    Altman, RB
    [J]. BIOINFORMATICS, 2002, 18 (11) : 1454 - 1461
  • [5] Identifying differentially expressed genes in cDNA microarray experiments
    Baggerly, KA
    Coombes, KR
    Hess, KR
    Stivers, DN
    Abruzzo, LV
    Zhang, W
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (06) : 639 - 659
  • [6] An Integrative Bioinformatics Analysis of Microarray Data for Identifying Differentially Expressed Genes in Preeclampsia
    Song, L. M.
    Long, M.
    Song, S. J.
    Wang, J. R.
    Zhao, G. W.
    Zhao, N.
    [J]. RUSSIAN JOURNAL OF GENETICS, 2022, 58 (07) : 866 - 875
  • [7] An Integrative Bioinformatics Analysis of Microarray Data for Identifying Differentially Expressed Genes in Preeclampsia
    L. M. Song
    M. Long
    S. J. Song
    J. R. Wang
    G. W. Zhao
    N. Zhao
    [J]. Russian Journal of Genetics, 2022, 58 : 866 - 875
  • [8] Fold-based meta-analysis: a method for identifying differentially expressed genes in microarray data
    Tian, Yuan
    Bai, Tian
    Liu, Guixia
    Li, Zhangxiu
    Wu, Jianan
    Zhou, Chunguang
    [J]. Journal of Information and Computational Science, 2013, 10 (11): : 3453 - 3462
  • [9] Mixture distribution approach for identifying differentially expressed genes in microarray data of Arabidopsis thaliana
    Anjum, Arfa
    Jaggi, Seema
    Varghese, Eldho
    Lall, Shwetank
    Rai, Anil
    Bhowmik, Arpan
    Mishra, Dwijesh Chandra
    Sarika
    [J]. INDIAN JOURNAL OF AGRICULTURAL SCIENCES, 2020, 90 (10): : 139 - 143
  • [10] Sample size for identifying differentially expressed genes in microarray experiments
    Wang, SJ
    Chen, JJ
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2004, 11 (04) : 714 - 726