Semi-supervised gene shaving method for predicting low variation biological pathways from genome-wide data

被引:2
|
作者
Zhu, Dongxiao [1 ,2 ]
机构
[1] Univ New Orleans, Dept Comp Sci, New Orleans, LA 70148 USA
[2] Childrens Hosp, Res Inst Children, New Orleans, LA 70118 USA
来源
BMC BIOINFORMATICS | 2009年 / 10卷
关键词
SINGULAR-VALUE DECOMPOSITION; EXPRESSION; SET; INFORMATION; PATTERNS; NETWORK;
D O I
10.1186/1471-2105-10-S1-S54
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The gene shaving algorithm and many other clustering algorithms identify gene clusters showing high variation across samples. However, gene expression in many signaling pathways show only modest and concordant changes that fail to be identified by these methods. The increasingly available signaling pathway prior knowledge provide new opportunity to solve this problem. Results: We propose an innovative semi-supervised gene clustering algorithm, where the original gene shaving algorithm was extended and generalized so that prior knowledge of signaling pathways can be incorporated. Different from other methods, our method identifies gene clusters showing concerted and modest expression variation as well as strong expression correlation. Using available pathway gene sets as prior knowledge, whether complete or incomplete, our algorithm is capable of forming tightly regulated gene clusters showing modest variation across samples. We demonstrate the advantages of our algorithm over the original gene shaving algorithm using two microarray data sets. The stability of the gene clusters was accessed using a jackknife approach. Conclusion: Our algorithm represents one of the first clustering algorithms that is particularly designed to identify signaling pathways of low and concordant gene expression variation. The discriminating power is achieved by manufacturing a principal component enriched by signaling pathways.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Identifying genetic risk factors for idiopathic Parkinson's disease by combining genome-wide copy number variation data and published data from genome-wide sib-pair linkage studies
    Abou-Sleiman, P.
    Vilarino-Guell, C.
    Quinn, N. P.
    Bhatia, K.
    Lees, A. J.
    Martinez, M.
    Pankratz, N.
    Foroud, T.
    Sebat, J.
    Wood, N. W.
    MOVEMENT DISORDERS, 2006, 21 : S548 - S548
  • [42] Genome-Wide Expression and Physiological Profiling of Pearl Millet Genotype Reveal the Biological Pathways and Various Gene Clusters Underlying Salt Resistance
    Awan, Samrah Afzal
    Khan, Imran
    Tariq, Rezwan
    Rizwan, Muhammad
    Wang, Xiaoshan
    Zhang, Xinquan
    Huang, Linkai
    FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [43] Method Systematic analysis of genome-wide fitness data in yeast reveals novel gene function and drug action
    Hillenmeyer, Maureen E.
    Ericson, Elke
    Davis, Ronald W.
    Nislow, Corey
    Koller, Daphne
    Giaever, Guri
    GENOME BIOLOGY, 2010, 11 (03):
  • [44] Prediction of regulatory networks: genome-wide identification of transcription factor targets from gene expression data
    Qian, J
    Lin, J
    Luscombe, NM
    Yu, HY
    Gerstein, M
    BIOINFORMATICS, 2003, 19 (15) : 1917 - 1926
  • [45] Integration of gene ontology pathways with North American Rheumatoid Arthritis Consortium genome-wide association data via linear modeling
    Jérémie JP Lebrec
    Tom WJ Huizinga
    René EM Toes
    Jeanine J Houwing-Duistermaat
    Hans C van Houwelingen
    BMC Proceedings, 3 (Suppl 7)
  • [46] A streamlined method for analysing genome-wide DNA methylation patterns from low amounts of FFPE DNA
    Jackie L. Ludgate
    James Wright
    Peter A. Stockwell
    Ian M. Morison
    Michael R. Eccles
    Aniruddha Chatterjee
    BMC Medical Genomics, 10
  • [47] A streamlined method for analysing genome-wide DNA methylation patterns from low amounts of FFPE DNA
    Ludgate, Jackie L.
    Wright, James
    Stockwell, Peter A.
    Morison, Ian M.
    Eccles, Michael R.
    Chatterjee, Aniruddha
    BMC MEDICAL GENOMICS, 2017, 10
  • [48] MeDEStrand: an improved method to infer genome-wide absolute methylation levels from DNA enrichment data
    Xu, Jingting
    Liu, Shimeng
    Yin, Ping
    Bulun, Serdar
    Dai, Yang
    BMC BIOINFORMATICS, 2018, 19
  • [49] MeDEStrand: an improved method to infer genome-wide absolute methylation levels from DNA enrichment data
    Jingting Xu
    Shimeng Liu
    Ping Yin
    Serdar Bulun
    Yang Dai
    BMC Bioinformatics, 19
  • [50] Semi-supervised machine learning method for predicting homogeneous ancestry groups to assess Hardy-Weinberg equilibrium in diverse whole-genome sequencing studies
    Shyr, Derek
    Dey, Rounak
    Li, Xihao
    Zhou, Hufeng
    Boerwinkle, Eric
    Buyske, Steve
    Daly, Mark
    Gibbs, Richard A.
    Hall, Ira
    Matise, Tara
    Reeves, Catherine
    Stitziel, Nathan O.
    Zody, Michael
    Neale, Benjamin M.
    Lin, Xihong
    AMERICAN JOURNAL OF HUMAN GENETICS, 2024, 111 (10)