Benchmarking Sepsis Gene Expression Diagnostics Using Public Data

被引:67
|
作者
Sweeney, Timothy E. [1 ,2 ]
Khatri, Purvesh [1 ,2 ]
机构
[1] Stanford Univ, Sch Med, Stanford Inst Immun Transplantat & Infect, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Med, Sch Med, Div Biomed Informat Res, Stanford, CA 94305 USA
关键词
diagnosis; gene expression; microarray; sepsis; PEDIATRIC SEPTIC SHOCK; COMMUNITY-ACQUIRED PNEUMONIA; COMPREHENSIVE VALIDATION; MOLECULAR BIOMARKER; BACTERIAL-INFECTION; BLOOD TRANSCRIPTOME; PANDEMIC INFLUENZA; FAIM3PLAC8; RATIO; SIGNATURE; TIME;
D O I
10.1097/CCM.0000000000002021
中图分类号
R4 [临床医学];
学科分类号
1002 ; 100602 ;
摘要
Objective: In response to a need for better sepsis diagnostics, several new gene expression classifiers have been recently published, including the 11-gene "Sepsis MetaScore," the "FAIM3-to-PLAC8" ratio, and the Septicyte Lab. We performed a systematic search for publicly available gene expression data in sepsis and tested each gene expression classifier in all included datasets. We also created a public repository of sepsis gene expression data to encourage their future reuse. Data Sources: We searched National Institutes of Health Gene Expression Omnibus and EBI ArrayExpress for human gene expression microarray datasets. We also included the Glue Grant trauma gene expression cohorts. Study Selection: We selected clinical, time-matched, whole blood studies of sepsis and acute infections as compared to healthy and/or noninfectious inflammation patients. We identified 39 datasets composed of 3,241 samples from 2,604 patients. Data Extraction: All data were renormalized from raw data, when available, using consistent methods. Data Synthesis: Mean validation areas under the receiver operating characteristic curve for discriminating septic patients from patients with noninfectious inflammation for the Sepsis MetaScore, the FAIM3-to-PLAC8 ratio, and the Septicyte Lab were 0.82 (range, 0.73-0.89), 0.78 (range, 0.49-0.96), and 0.73 (range, 0.44-0.90), respectively. Paired-sample t tests of validation datasets showed no significant differences in area under the receiver operating characteristic curves. Mean validation area under the receiver operating characteristic curves for discriminating infected patients from healthy controls for the Sepsis MetaScore, FAIM3-to-PLAC8 ratio, and Septicyte Lab were 0.97 (range, 0.85-1.0), 0.94 (range, 0.65-1.0), and 0.71 (range, 0.24-1.0), respectively. There were few significant differences in any diagnostics due to pathogen type. Conclusions: The three diagnostics do not show significant differences in overall ability to distinguish noninfectious systemic inflammatory response syndrome from sepsis, though the performance in some datasets was low (area under the receiver operating characteristic curve, < 0.7) for the FAIM3-to-PLAC8 ratio and Septicyte Lab. The Septicyte Lab also demonstrated significantly worse performance in discriminating infections as compared to healthy controls. Overall, public gene expression data are a useful tool for benchmarking gene expression diagnostics.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [1] Novel diagnostics for sepsis: A decade of promise for gene expression profiling
    Cobb, J. Perren
    Hayden, Douglas L.
    Schoenfeld, David A.
    [J]. CRITICAL CARE MEDICINE, 2011, 39 (11) : 2579 - 2581
  • [2] Statistical benchmarking and class discovery in gene expression data
    Amir Ben-Dor
    Nir Friedman
    Zohar Yakhini
    [J]. Nature Genetics, 2001, 27 (Suppl 4) : 96 - 96
  • [3] Benchmarking in endoscopy -: Data transparency for the public?
    Rosien, U.
    Roesch, T.
    [J]. ZEITSCHRIFT FUR GASTROENTEROLOGIE, 2007, 45 (12): : 1227 - 1227
  • [4] Expression profiling:: Toward an application in sepsis diagnostics
    Prucha, M
    Ruryk, A
    Boriss, H
    Möller, E
    Zazula, R
    Herold, I
    Claus, RA
    Reinhart, KA
    Deigner, P
    Russwurm, S
    [J]. SHOCK, 2004, 22 (01): : 29 - 33
  • [5] Integrating biological knowledge and gene expression data using pathway-guided random forests: a benchmarking study
    Seifert, Stephan
    Gundlach, Sven
    Junge, Olaf
    Szymczak, Silke
    [J]. BIOINFORMATICS, 2020, 36 (15) : 4301 - 4308
  • [6] Discovery and Preclinical Validation of Drug Indications Using Compendia of Public Gene Expression Data
    Sirota, Marina
    Dudley, Joel T.
    Kim, Jeewon
    Chiang, Annie P.
    Morgan, Alex A.
    Sweet-Cordero, Alejandro
    Sage, Julien
    Butte, Atul J.
    [J]. SCIENCE TRANSLATIONAL MEDICINE, 2011, 3 (96)
  • [7] Broad-range 16S rRNA gene PCR using SepsiTest™ in conjunction with valid clinical data and sepsis biomarkers improve sepsis diagnostics
    Skvarc, M.
    Stublar, D.
    Rogina, P.
    [J]. INTERNATIONAL JOURNAL OF MEDICAL MICROBIOLOGY, 2012, 302 : 6 - 6
  • [8] ArrayExpress: a public database of gene expression data at EBI
    Rocca-Serra, P
    Brazma, A
    Parkinson, H
    Sarkans, U
    Shojatalab, M
    Contrino, S
    Vilo, J
    Abeygunawardena, N
    Mukherjee, G
    Holloway, E
    Kapushesky, M
    Kemmeren, P
    Lara, GG
    Oezcimen, A
    Sansone, SA
    [J]. COMPTES RENDUS BIOLOGIES, 2003, 326 (10-11) : 1075 - 1078
  • [9] Comparison of machine-learning methodologies for accurate diagnosis of sepsis using microarray gene expression data
    Schaack, Dominik
    Weigand, Markus A.
    Uhle, Florian
    [J]. PLOS ONE, 2021, 16 (05):
  • [10] Benchmarking big data architectures for social networks data processing using public cloud platforms
    Persico, Valerio
    Pescape, Antonio
    Picariello, Antonio
    Sperli, Giancarlo
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 89 : 98 - 109