Testing covariates in high dimension linear regression with latent factors

被引:5
|
作者
Lan, Wei [1 ,2 ]
Ding, Yue [3 ]
Fang, Zheng [4 ]
Fang, Kuangnan [5 ,6 ]
机构
[1] Southwestern Univ Finance & Econ, Sch Stat, Chengdu, Peoples R China
[2] Southwestern Univ Finance & Econ, Ctr Stat Res, Chengdu, Peoples R China
[3] Southwest Jiaotong Univ, Sch Econ & Management, Chengdu, Peoples R China
[4] Sichuan Univ, Sch Business, Chengdu 610064, Peoples R China
[5] Xiamen Univ, Sch Econ, Xiamen, Peoples R China
[6] Xiamen Univ, MOE Key Lab Econometr, Xiamen, Peoples R China
基金
中国国家自然科学基金;
关键词
Approximate factor model; Global significance testing; High dimension regression; Individual effect testing; MODELS;
D O I
10.1016/j.jmva.2015.10.013
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose here both F-test and z-test (or t-test) for testing global significance and individual effect of each single predictor respectively in high dimension regression model when the explanatory variables follow a latent factor structure (Wang, 2012). Under the null hypothesis, together with fairly mild conditions on the explanatory variables and latent factors, we show that the proposed F-test and t-test are asymptotically distributed as weighted chi-square and standard normal distribution respectively. That leads to quite different test statistics and inference procedures, as compared with that of Zhong and Chen (2011) when the explanatory variables are weakly dependent. Moreover, based on the p-value of each predictor, the method of Storey et al. (2004) can be used to implement the multiple testing procedure, and we can achieve consistent model selection as long as we can select the threshold value appropriately. All the results are further supported by extensive Monte Carlo simulation studies. The practical utility of the two proposed tests are illustrated via a real data example for index funds tracking in China stock market. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:25 / 37
页数:13
相关论文
共 50 条
  • [1] Regression with Variable Dimension Covariates
    Mueller, Peter
    Quintana, Fernando Andres
    Page, Garritt L.
    SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2024, 86 (SUPPL 1): : 185 - 198
  • [2] Testing covariates in high-dimensional regression
    Lan, Wei
    Wang, Hansheng
    Tsai, Chih-Ling
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2014, 66 (02) : 279 - 301
  • [3] Testing covariates in high-dimensional regression
    Wei Lan
    Hansheng Wang
    Chih-Ling Tsai
    Annals of the Institute of Statistical Mathematics, 2014, 66 : 279 - 301
  • [4] LINEAR REGRESSION IN HIGH DIMENSION AND/OR FOR CORRELATED INPUTS
    Jacques, J.
    Fraix-Burnet, D.
    STATISTICS FOR ASTROPHYSICS: METHODS AND APPLICATIONS OF THE REGRESSION, 2015, 66 : 149 - +
  • [5] Variable screening in multivariate linear regression with high-dimensional covariates
    Bizuayehu, Shiferaw B.
    Li, Lu
    Xu, Jin
    STATISTICAL THEORY AND RELATED FIELDS, 2022, 6 (03) : 241 - 253
  • [6] Testing a sub-hypothesis in linear regression models with long memory covariates and errors
    Hira L. Koul
    Donatas Surgailis
    Applications of Mathematics, 2008, 53 : 235 - 248
  • [7] Testing a sub-hypothesis in linear regression models with long memory covariates and errors
    Koul, Hira L.
    Surgailis, Donatas
    APPLICATIONS OF MATHEMATICS, 2008, 53 (03) : 235 - 248
  • [8] Linear regression models with incomplete categorical covariates
    Toutenburg, H
    Nittner, T
    COMPUTATIONAL STATISTICS, 2002, 17 (02) : 215 - 232
  • [9] Multiple linear regression with compositional response and covariates
    Chen, Jiajia
    Zhang, Xiaoqin
    Li, Shengjia
    JOURNAL OF APPLIED STATISTICS, 2017, 44 (12) : 2270 - 2285
  • [10] Linear Regression Models with Incomplete Categorical Covariates
    Helge Toutenburg
    Thomas Nittner
    Computational Statistics, 2002, 17 : 215 - 232