Multiple-platform data integration method with application to combined analysis of microarray and proteomic data

被引:10
|
作者
Wu, Shicheng [1 ]
Xu, Yawen [1 ]
Feng, Zeny [2 ]
Yang, Xiaojian [2 ]
Wang, Xiaogang [1 ]
Gao, Xin [1 ]
机构
[1] York Univ, Dept Math & Stat, Toronto, ON M3J 1P3, Canada
[2] Univ Guelph, Dept Math & Stat, Guelph, ON N1G 2W1, Canada
来源
BMC BIOINFORMATICS | 2012年 / 13卷
关键词
GENE-EXPRESSION; METAANALYSIS; FRAMEWORK;
D O I
10.1186/1471-2105-13-320
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: It is desirable in genomic studies to select biomarkers that differentiate between normal and diseased populations based on related data sets from different platforms, including microarray expression and proteomic data. Most recently developed integration methods focus on correlation analyses between gene and protein expression profiles. The correlation methods select biomarkers with concordant behavior across two platforms but do not directly select differentially expressed biomarkers. Other integration methods have been proposed to combine statistical evidence in terms of ranks and p-values, but they do not account for the dependency relationships among the data across platforms. Results: In this paper, we propose an integration method to perform hypothesis testing and biomarkers selection based on multi-platform data sets observed from normal and diseased populations. The types of test statistics can vary across the platforms and their marginal distributions can be different. The observed test statistics are aggregated across different data platforms in a weighted scheme, where the weights take into account different variabilities possessed by test statistics. The overall decision is based on the empirical distribution of the aggregated statistic obtained through random permutations. Conclusion: In both simulation studies and real biological data analyses, our proposed method of multi-platform integration has better control over false discovery rates and higher positive selection rates than the uncombined method. The proposed method is also shown to be more powerful than rank aggregation method.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] A new clustering method for microarray data analysis
    Zhang, LX
    Zhu, S
    CSB2002: IEEE COMPUTER SOCIETY BIOINFORMATICS CONFERENCE, 2002, : 268 - 275
  • [32] MiMiR – an integrated platform for microarray data sharing, mining and analysis
    Chris Tomlinson
    Manjula Thimma
    Stelios Alexandrakis
    Tito Castillo
    Jayne L Dennis
    Anthony Brooks
    Thomas Bradley
    Carly Turnbull
    Ekaterini Blaveri
    Geraint Barton
    Norie Chiba
    Klio Maratou
    Pat Soutter
    Tim Aitman
    Laurence Game
    BMC Bioinformatics, 9
  • [33] EMMA:: a platform for consistent storage and efficient analysis of microarray data
    Dondrup, M
    Goesmann, A
    Bartels, D
    Kalinowski, J
    Krause, L
    Linke, B
    Rupp, O
    Sczyrba, A
    Pühler, A
    Meyer, F
    JOURNAL OF BIOTECHNOLOGY, 2003, 106 (2-3) : 135 - 146
  • [34] ANDROMEDA: A MATLAB automated cDNA Microarray data analysis platform
    Chatziioannou, Aristotelis
    Moulos, Panagiotis
    ARTIFICIAL INTELLIGENCE AND INNOVATIONS 2007: FROM THEORY TO APPLICATIONS, 2007, : 127 - +
  • [35] MiMiR - an integrated platform for microarray data sharing, mining and analysis
    Tomlinson, Chris
    Thimma, Manjula
    Alexandrakis, Stelios
    Castillo, Tito
    Dennis, Jayne L.
    Brooks, Anthony
    Bradley, Thomas
    Turnbull, Carly
    Blaveri, Ekaterini
    Barton, Geraint
    Chiba, Norie
    Maratou, Klio
    Soutter, Pat
    Aitman, Tim
    Game, Laurence
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [36] A scalable method for integration and functional analysis of multiple microarray datasets
    Huttenhower, Curtis
    Hibbs, Matt
    Myers, Chad
    Troyanskaya, Olga G.
    BIOINFORMATICS, 2006, 22 (23) : 2890 - 2897
  • [37] BioLadder: A bioinformatic platform primarily focused on proteomic data analysis
    Zhang, Yupeng
    Yang, Chunyuan
    Wang, Jinhao
    Wang, Lixin
    Zhao, Yan
    Sun, Longqing
    Sun, Wei
    Zhu, Yunping
    Li, Jingli
    Wu, Songfeng
    IMETA, 2024, 3 (04):
  • [38] Combined Method for Integration of Heterogeneous Ontology Models for Big Data Processing and Analysis
    Kureychik, Viktor
    Semenova, Alexandra
    ARTIFICIAL INTELLIGENCE TRENDS IN INTELLIGENT SYSTEMS, CSOC2017, VOL 1, 2017, 573 : 302 - 311
  • [39] A multivariate analysis approach to the integration of proteomic and gene expression data
    Fagan, Ailis
    Culhane, Aedin C.
    Higgins, Desmond G.
    PROTEOMICS, 2007, 7 (13) : 2162 - 2171
  • [40] Functional annotation and network reconstruction through cross-platform integration of microarray data
    Zhou, XHJ
    Kao, MCJ
    Huang, HY
    Wong, A
    Nunez-Iglesias, J
    Primig, M
    Aparicio, OM
    Finch, CE
    Morgan, TE
    Wong, WH
    NATURE BIOTECHNOLOGY, 2005, 23 (02) : 238 - 243