This article proposes an innovative method for constructing confidence intervals and assessing p-values in statistical inference for high-dimensional linear models. The proposed method has successfully broken the high-dimensional inference problem into a series of low-dimensional inference problems: For each regression coefficient beta(i), the confidence interval and p-value are computed by regressing on a subset of variables selected according to the conditional independence relations between the corresponding variable X-i and other variables. Since the subset of variables forms a Markov neighborhood of X-i in the Markov network formed by all the variables X-1, X-2, ... , X-p, the proposed method is coined as Markov neighborhood regression (MNR). The proposed method is tested on high-dimensional linear, logistic, and Cox regression. The numerical results indicate that the proposed method significantly outperforms the existing ones. Based on the MNR, a method of learning causal structures for high-dimensional linear models is proposed and applied to identification of drug sensitive genes and cancer driver genes. The idea of using conditional independence relations for dimension reduction is general and potentially can be extended to other high-dimensional or big data problems as well. Supplementary materials for this article are available online.
机构:
Fed Univ Rio Janeiro Rio De Janeiro, Inst Math, Rio De Janeiro, RJ, BrazilFed Univ Rio Janeiro Rio De Janeiro, Inst Math, Rio De Janeiro, RJ, Brazil
Ost, Guilherme
Takahashi, Daniel Y.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Fed Rio Grande do Norte, Brain Inst, Natal, RN, BrazilFed Univ Rio Janeiro Rio De Janeiro, Inst Math, Rio De Janeiro, RJ, Brazil
机构:
Department of Statistics, Harvard University, Cambridge,MA,02138, United StatesDepartment of Statistics, Columbia University, New York,NY,10027, United States
机构:
Department of Statistics, Rutgers University, Piscataway,NJ, United StatesDepartment of Statistics, Rutgers University, Piscataway,NJ, United States
Guo, Zijian
Rakshit, Prabrisha
论文数: 0引用数: 0
h-index: 0
机构:
Department of Statistics, Rutgers University, Piscataway,NJ, United StatesDepartment of Statistics, Rutgers University, Piscataway,NJ, United States
Rakshit, Prabrisha
Herman, Daniel S.
论文数: 0引用数: 0
h-index: 0
机构:
Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia,PA, United StatesDepartment of Statistics, Rutgers University, Piscataway,NJ, United States
Herman, Daniel S.
Chen, Jinbo
论文数: 0引用数: 0
h-index: 0
机构:
Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia,PA, United StatesDepartment of Statistics, Rutgers University, Piscataway,NJ, United States