An Improved Data Integration Methodology for System Biology

被引:0
|
作者
Zhou, Xiaodong [1 ,2 ]
George, E. Olusegun [3 ]
机构
[1] Univ Tennessee, Hlth Sci Ctr, Dept Anat & Neurobiol, Knoxville, TN 37996 USA
[2] Univ Memphis, Dept Comp Sci, Memphis, TN 38152 USA
[3] Univ Memphis, Dept Math Sci, Memphis, TN 38152 USA
关键词
Optimal Weight; Pool P-value; Gene Expression; PROSTATE-CANCER; GENE-EXPRESSION;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Pooling P-values from independent experiments has been proven to improve power of statistical tests. Instead of assigning equal weight to each dataset,Hwang et al. proposed a data integration methodology for system biology, labeled Pontillist, to pool data using weighted P-values so as to maximize the number of significant genes discovered. Pontillist uses simulated null distribution of the weighted combination statistics. We have found several fatal statistical errors in Pontillist, and provide a correction to them. Also, Pontillist is intrinsically computationally inefficient requiring substantial, sometimes even prohibitive, computing time for convergence at low significance levels. We propose a new approach for optimal combination of P-values by using the approximated theoretical distribution of the Fisher's, Logit and Z omnibus combination statistics to estimate the P-value of weighted pooled statistics. Our computationally efficient approach guarantees convergence at any significance level, and produces accurate pooled P-values.
引用
收藏
页码:235 / 240
页数:6
相关论文
共 50 条
  • [1] A data integration methodology for systems biology: Experimental verification
    Hwang, D
    Smith, JJ
    Leslie, DM
    Weston, AD
    Rust, AG
    Ramsey, S
    Atauri, PD
    Siegel, AF
    Bolouri, H
    Aitchison, JD
    Hood, L
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (48) : 17302 - 17307
  • [2] Data integration and visualization system for enabling conceptual biology
    Gopalacharyulu, PV
    Lindfors, E
    Bounsaythip, C
    Kivioja, T
    Yetukuri, L
    Hollmén, J
    Oresic, M
    [J]. BIOINFORMATICS, 2005, 21 : I177 - I185
  • [3] Modeling a Community as a System of Systems: A Methodology For Data Integration
    Khayal, Inas
    [J]. 2018 13TH ANNUAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING (SOSE), 2018, : 387 - 391
  • [4] A New Methodology for System Integration
    Gutierrez-Alcaraz, J. Marcelo
    de Haan, Sjoerd
    Ferreira, J. A.
    [J]. 2009 IEEE 6TH INTERNATIONAL POWER ELECTRONICS AND MOTION CONTROL CONFERENCE, VOLS 1-4, 2009, : 966 - 970
  • [5] Data integration in genomics and systems biology
    Serra, Angela
    Fratello, Michele
    Greco, Dario
    Tagliaferri, Roberto
    [J]. 2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 1272 - 1279
  • [6] DATA INTEGRATION METHODOLOGY FOR AN OFFICE ENVIRONMENT
    MARINOS, L
    PAPAZOGLOU, MP
    CHRISTODOULAKIS, D
    [J]. COMPUTING SYSTEMS, 1991, 6 (03): : 143 - 151
  • [7] Methodology of integration of a clinical data warehouse with a clinical information system: the HEGP case
    Zapletal, Eric
    Rodon, Nicolas
    Grabar, Natalia
    Degoulet, Patrice
    [J]. MEDINFO 2010, PTS I AND II, 2010, 160 : 193 - 197
  • [8] Systems biology data analysis methodology in pharmacogenomics
    Rodin, Andrei S.
    Gogoshin, Grigoriy
    Boerwinkle, Eric
    [J]. PHARMACOGENOMICS, 2011, 12 (09) : 1349 - 1360
  • [9] Data integration and analysis for medical systems biology
    van Beek, JHGM
    [J]. COMPARATIVE AND FUNCTIONAL GENOMICS, 2004, 5 (02): : 201 - 204
  • [10] SVS: Data and knowledge integration in computational biology
    Zycinski, Grzegorz
    Barla, Annalisa
    Verri, Alessandro
    [J]. 2011 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2011, : 6474 - 6478