Multiple-platform data integration method with application to combined analysis of microarray and proteomic data

被引:10
|
作者
Wu, Shicheng [1 ]
Xu, Yawen [1 ]
Feng, Zeny [2 ]
Yang, Xiaojian [2 ]
Wang, Xiaogang [1 ]
Gao, Xin [1 ]
机构
[1] York Univ, Dept Math & Stat, Toronto, ON M3J 1P3, Canada
[2] Univ Guelph, Dept Math & Stat, Guelph, ON N1G 2W1, Canada
来源
BMC BIOINFORMATICS | 2012年 / 13卷
关键词
GENE-EXPRESSION; METAANALYSIS; FRAMEWORK;
D O I
10.1186/1471-2105-13-320
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: It is desirable in genomic studies to select biomarkers that differentiate between normal and diseased populations based on related data sets from different platforms, including microarray expression and proteomic data. Most recently developed integration methods focus on correlation analyses between gene and protein expression profiles. The correlation methods select biomarkers with concordant behavior across two platforms but do not directly select differentially expressed biomarkers. Other integration methods have been proposed to combine statistical evidence in terms of ranks and p-values, but they do not account for the dependency relationships among the data across platforms. Results: In this paper, we propose an integration method to perform hypothesis testing and biomarkers selection based on multi-platform data sets observed from normal and diseased populations. The types of test statistics can vary across the platforms and their marginal distributions can be different. The observed test statistics are aggregated across different data platforms in a weighted scheme, where the weights take into account different variabilities possessed by test statistics. The overall decision is based on the empirical distribution of the aggregated statistic obtained through random permutations. Conclusion: In both simulation studies and real biological data analyses, our proposed method of multi-platform integration has better control over false discovery rates and higher positive selection rates than the uncombined method. The proposed method is also shown to be more powerful than rank aggregation method.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Multiple testing in the survival analysis of microarray data
    Correa, JA
    Dudoit, S
    Goldstein, DR
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2002, 10 : 298 - 298
  • [22] WebArrayDB: cross-platform microarray data analysis and public data repository
    Xia, Xiao-Qin
    McClelland, Michael
    Porwollik, Steffen
    Song, Wenzhi
    Cong, Xianling
    Wang, Yipeng
    BIOINFORMATICS, 2009, 25 (18) : 2425 - 2429
  • [23] A software pipeline for multiple microarray data analysis
    Agapito, Giuseppe
    Cannataro, Mario
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 1941 - 1944
  • [24] Application of independent component analysis to microarray data
    Suri, RE
    INTERNATIONAL CONFERENCE ON INTEGRATION OF KNOWLEDGE INTENSIVE MULTI-AGENT SYSTEMS: KIMAS'03: MODELING, EXPLORATION, AND ENGINEERING, 2003, : 375 - 378
  • [25] Optimizing proteomic data access and analysis in the cloud: Leveraging Terra's integration with the Proteomic Data Commons
    LaPlante, Emily
    Huo, Bingxing
    Mani, D. R.
    Thangudu, Ratna R.
    CANCER RESEARCH, 2024, 84 (06)
  • [26] Analysis and summarization of correlations in data cubes and its application in microarray data analysis
    Chen, Chien-Yu
    Hwang, Shien-Ching
    Oyang, Yen-Jen
    INTELLIGENT DATA ANALYSIS, 2005, 9 (01) : 43 - 57
  • [27] Cross-platform microarray data integration using the Normalised Linear Transform
    Xiong, Huilin
    Zhang, Ya
    Chen, Xue-Wen
    Yu, Jiangsheng
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (02) : 142 - 157
  • [28] Combined gene selection methods for microarray data analysis
    Hu, Hong
    Li, Jiuyong
    Wang, Hua
    Daggard, Grant
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2006, 4251 : 976 - 983
  • [29] WGCNA Application to Proteomic and Metabolomic Data Analysis
    Pei, G.
    Chen, L.
    Zhang, W.
    PROTEOMICS IN BIOLOGY, PT A, 2017, 585 : 135 - 158
  • [30] A Systematic, Data-driven Approach to the Combined Analysis of Microarray and QTL Data
    Rennie, C.
    Hulme, H.
    Fisher, P.
    Hall, L.
    Agaba, M.
    Noyes, H. A.
    Kemp, S. J.
    Brass, A.
    ANIMAL GENOMICS FOR ANIMAL HEALTH, 2008, 132 : 293 - +