Semi-supervised gene shaving method for predicting low variation biological pathways from genome-wide data

被引:2
|
作者
Zhu, Dongxiao [1 ,2 ]
机构
[1] Univ New Orleans, Dept Comp Sci, New Orleans, LA 70148 USA
[2] Childrens Hosp, Res Inst Children, New Orleans, LA 70118 USA
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
SINGULAR-VALUE DECOMPOSITION; EXPRESSION; SET; INFORMATION; PATTERNS; NETWORK;
D O I
10.1186/1471-2105-10-S1-S54
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The gene shaving algorithm and many other clustering algorithms identify gene clusters showing high variation across samples. However, gene expression in many signaling pathways show only modest and concordant changes that fail to be identified by these methods. The increasingly available signaling pathway prior knowledge provide new opportunity to solve this problem. Results: We propose an innovative semi-supervised gene clustering algorithm, where the original gene shaving algorithm was extended and generalized so that prior knowledge of signaling pathways can be incorporated. Different from other methods, our method identifies gene clusters showing concerted and modest expression variation as well as strong expression correlation. Using available pathway gene sets as prior knowledge, whether complete or incomplete, our algorithm is capable of forming tightly regulated gene clusters showing modest variation across samples. We demonstrate the advantages of our algorithm over the original gene shaving algorithm using two microarray data sets. The stability of the gene clusters was accessed using a jackknife approach. Conclusion: Our algorithm represents one of the first clustering algorithms that is particularly designed to identify signaling pathways of low and concordant gene expression variation. The discriminating power is achieved by manufacturing a principal component enriched by signaling pathways.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Semi-supervised gene shaving method for predicting low variation biological pathways from genome-wide data
    Dongxiao Zhu
    BMC Bioinformatics, 10
  • [2] Simultaneous inference of biological networks of multiple species from genome-wide data and evolutionary information: a semi-supervised approach
    Kashima, Hisashi
    Yamanishi, Yoshihiro
    Kato, Tsuyoshi
    Sugiyama, Masashi
    Tsuda, Koji
    BIOINFORMATICS, 2009, 25 (22) : 2962 - 2968
  • [3] A Semi-Supervised Method for Predicting Cancer Survival Using Incomplete Clinical Data
    Hassanzadeh, Hamid Reza
    Phan, John H.
    Wang, May D.
    2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 210 - 213
  • [4] Predicting gene regulation by sigma factors in Bacillus subtilis from genome-wide data
    de Hoon, M. J. L.
    Makita, Y.
    Imoto, S.
    Kobayashi, K.
    Ogasawara, N.
    Nakai, K.
    Miyano, S.
    BIOINFORMATICS, 2004, 20 : 101 - 108
  • [5] A semi-supervised method for predicting transcription factor-gene interactions in Escherichia coli
    Ernst, Jason
    Beg, Qasim K.
    Kay, Krin A.
    Balázsi, Gábor
    Oltvai, Zoltán N.
    Bar-Joseph, Ziv
    PLoS Computational Biology, 2008, 4 (03)
  • [6] Genome-wide sequence-based prediction of peripheral proteins using a novel semi-supervised learning technique
    Nitin Bhardwaj
    Mark Gerstein
    Hui Lu
    BMC Bioinformatics, 11
  • [7] Genome-wide sequence-based prediction of peripheral proteins using a novel semi-supervised learning technique
    Bhardwaj, Nitin
    Gerstein, Mark
    Lu, Hui
    BMC BIOINFORMATICS, 2010, 11
  • [8] Semi-supervised methods to predict patient survival from gene expression data
    Bair, E
    Tibshirani, R
    PLOS BIOLOGY, 2004, 2 (04) : 511 - 522
  • [9] Semi-supervised Method for Gene Expression Data Classification with Gaussian Fields and Harmonic Functions
    Gong, Yun-Chao
    Chen, Chuan-Liang
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2217 - 2220
  • [10] Identification of New Biological Pathways Involved in Skin Aging From the Analysis of French Women Genome-Wide Data
    Rahmouni, Myriam
    Laville, Vincent
    Spadoni, Jean-Louis
    Jdid, Randa
    Eckhart, Leopold
    Gruber, Florian
    Labib, Taoufik
    Coulonges, Cedric
    Carpentier, Wassila
    Latreille, Julie
    Morizot, Frederique
    Tschachler, Erwin
    Ezzedine, Khaled
    Le Clerc, Sigrid
    Zagury, Jean-Francois
    FRONTIERS IN GENETICS, 2022, 13