A permutation-based non-parametric analysis of CRISPR screen data

被引:21
|
作者
Jia, Gaoxiang [1 ,2 ]
Wang, Xinlei [1 ]
Xiao, Guanghua [2 ,3 ,4 ]
机构
[1] Southern Methodist Univ, Dept Stat Sci, Dallas, TX 75205 USA
[2] Univ Texas Southwestern Med Ctr Dallas, Dept Clin Sci, Quantitat Biomed Res Ctr, Dallas, TX 75390 USA
[3] Univ Texas Southwestern Med Ctr Dallas, Dept Bioinformat, Dallas, TX 75390 USA
[4] Univ Texas Southwestern Med Ctr Dallas, Simmons Comprehens Canc Ctr, Dallas, TX 75390 USA
来源
BMC GENOMICS | 2017年 / 18卷
基金
美国国家卫生研究院;
关键词
Functional genomics; False discovery rate; RNA interference; Negative selection; Next generation sequencing; Positive selection; IDENTIFYING DIFFERENTIAL EXPRESSION; FALSE DISCOVERY RATE; ESSENTIAL GENES; OFF-TARGET; IDENTIFICATION; METAANALYSIS; SELECTION; TOOL;
D O I
10.1186/s12864-017-3938-5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Clustered regularly-interspaced short palindromic repeats (CRISPR) screens are usually implemented in cultured cells to identify genes with critical functions. Although several methods have been developed or adapted to analyze CRISPR screening data, no single specific algorithm has gained popularity. Thus, rigorous procedures are needed to overcome the shortcomings of existing algorithms. Methods: We developed a Permutation-Based Non-Parametric Analysis (PBNPA) algorithm, which computes p-values at the gene level by permuting sgRNA labels, and thus it avoids restrictive distributional assumptions. Although PBNPA is designed to analyze CRISPR data, it can also be applied to analyze genetic screens implemented with siRNAs or shRNAs and drug screens. Results: We compared the performance of PBNPA with competing methods on simulated data as well as on real data. PBNPA outperformed recent methods designed for CRISPR screen analysis, as well as methods used for analyzing other functional genomics screens, in terms of Receiver Operating Characteristics (ROC) curves and False Discovery Rate (FDR) control for simulated data under various settings. Remarkably, the PBNPA algorithm showed better consistency and FDR control on published real data as well. Conclusions: PBNPA yields more consistent and reliable results than its competitors, especially when the data quality is low. R package of PBNPA is available at: https://cran.r-project.org/web/packages/PBNPA/.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A permutation-based non-parametric analysis of CRISPR screen data
    Gaoxiang Jia
    Xinlei Wang
    Guanghua Xiao
    [J]. BMC Genomics, 18
  • [2] Rank-based permutation approaches for non-parametric factorial designs
    Umlauft, Maria
    Konietschke, Frank
    Pauly, Markus
    [J]. BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2017, 70 (03): : 368 - 390
  • [3] A non-parametric independence test using permutation entropy
    Matilla-Garcia, Mariano
    Marin, Manuel Ruiz
    [J]. JOURNAL OF ECONOMETRICS, 2008, 144 (01) : 139 - 155
  • [4] Non-parametric combination and related permutation tests for neuroimaging
    Winkler, Anderson M.
    Webster, Matthew A.
    Brooks, Jonathan C.
    Tracey, Irene
    Smith, Stephen M.
    Nichols, Thomas E.
    [J]. HUMAN BRAIN MAPPING, 2016, 37 (04) : 1486 - 1511
  • [5] Parametric and non-parametric statistical analysis of DT-MRI data
    Pajevic, S
    Basser, PJ
    [J]. JOURNAL OF MAGNETIC RESONANCE, 2003, 161 (01) : 1 - 14
  • [6] Runtime Analysis for Permutation-based Evolutionary Algorithms
    Benjamin Doerr
    Yassine Ghannane
    Marouane Ibn Brahim
    [J]. Algorithmica, 2024, 86 : 90 - 129
  • [7] Non-parametric Nevada Format Field Data Analysis
    Zhang, Jiliang
    [J]. 2018 ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM (RAMS), 2018,
  • [8] Design and analysis of permutation-based pyramid broadcasting
    Aggarwal, CC
    Wolf, JL
    Yu, PS
    [J]. MULTIMEDIA SYSTEMS, 1999, 7 (06) : 439 - 448
  • [9] Non-parametric Bayesian analysis of clustered survival data
    Lee, Jaeyong
    [J]. STATISTICS, 2008, 42 (06) : 515 - 526
  • [10] Sensitivity analysis of non-parametric tests for censored data
    Robin, S
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1996, 25 (08) : 1693 - 1715