HYPOTHESIS TESTING IN HIGH-DIMENSIONAL INSTRUMENTAL VARIABLES REGRESSION WITH AN APPLICATION TO GENOMICS DATA

被引:0
|
作者
Lu, Jiarui [1 ]
Li, Hongzhe [1 ]
机构
[1] Univ Penn, Perelman Sch Med, Dept Biostat Epidemiol & Informat, Philadelphia, PA 19104 USA
关键词
Key words and phrases; Debiased estimation; FDR control; genetical genomics; inverse regression; multiple testing; GENE-EXPRESSION; CONFIDENCE-INTERVALS; LINEAR-MODELS; ASSOCIATIONS; ENDOGENEITY; TRAITS; GWAS;
D O I
10.5705/ss.202019.0408
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Gene expression and phenotype association can be affected by potential unmeasured confounders from multiple sources, leading to biased estimates of the associations. Because genetic variants largely explain gene expression variations, they can be used as instrumental variables (IVs) when studying the association between gene expressions and phenotypes in a high-dimensional IV regression framework. Because the dimensions of both genetic variants and gene expressions are often larger than the sample size, statistical inferences (e.g., hypothesis testing) for such high-dimensional IV models are not trivial, and have not been investigated in the literature. The problem is made more challenging because the IVs (e.g., genetic variants) have to be selected from a large set of genetic variants. This study considers the problem of hypothesis testing for sparse IV regression models, and presents methods for testing a single regression coefficient and for multiple testing of multiple coefficients, where the test statistic for each single coefficient is constructed based on an inverse regression. A multiple testing procedure is developed for selecting variables, and is shown to control the false discovery rate. Simulations are conducted to evaluate the performance of our proposed methods. Lastly, we apply the proposed methods by analyzing a yeast data set in order to identify genes that are associated with growth in the presence of hydrogen peroxide.
引用
收藏
页码:613 / 633
页数:21
相关论文
共 50 条
  • [41] Inference in High-Dimensional Multivariate Response Regression with Hidden Variables
    Bing, Xin
    Cheng, Wei
    Feng, Huijie
    Ning, Yang
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023,
  • [42] Reprint: Hypothesis testing on high dimensional quantile regression
    Chen, Zhao
    Cheng, Vivian Xinyi
    Liu, Xu
    [J]. JOURNAL OF ECONOMETRICS, 2024, 239 (02)
  • [43] HYPOTHESIS TESTING IN HIGH-DIMENSIONAL LINEAR REGRESSION: A NORMAL-REFERENCE SCALE-INVARIANT TEST
    Zhu, Tianming
    Zhang, Liang
    Zhang, Jin-Ting
    [J]. STATISTICA SINICA, 2022, 32 : 1857 - 1879
  • [44] Global hypothesis testing for high-dimensional repeated measures outcomes
    Chi, Yueh-Yun
    Gribbin, Matthew
    Lamers, Yvonne
    Gregory, Jesse F., III
    Muller, Keith E.
    [J]. STATISTICS IN MEDICINE, 2012, 31 (08) : 724 - 742
  • [45] Power computation for hypothesis testing with high-dimensional covariance matrices
    Lin, Ruitao
    Liu, Zhongying
    Zheng, Shurong
    Yin, Guosheng
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 104 : 10 - 23
  • [46] HYPOTHESIS TESTING ON LINEAR STRUCTURES OF HIGH-DIMENSIONAL COVARIANCE MATRIX
    Zheng, Shurong
    Chen, Zhao
    Cui, Hengjian
    Li, Runze
    [J]. ANNALS OF STATISTICS, 2019, 47 (06): : 3300 - 3334
  • [47] High-dimensional general linear hypothesis testing under heteroscedasticity
    Zhou, Bu
    Guo, Jia
    Zhang, Jin-Ting
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2017, 188 : 36 - 54
  • [48] Linear Hypothesis Testing in Dense High-Dimensional Linear Models
    Zhu, Yinchu
    Bradic, Jelena
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (524) : 1583 - 1600
  • [49] High-dimensional data analysis: Selection of variables, data compression and graphics - Application to gene expression
    Laeuter, Juergen
    Horn, Friedernann
    Rosolowski, Maciej
    Glimm, Ekkehard
    [J]. BIOMETRICAL JOURNAL, 2009, 51 (02) : 235 - 251
  • [50] Dummy endogenous treatment effect estimation using high-dimensional instrumental variables
    Zhong, Wei
    Zhou, Wei
    Fan, Qingliang
    Gao, Yang
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2022, 50 (03): : 795 - 819