Correcting the bias in least squares regression with volume-rescaled sampling

被引:0
|
作者
Derezinski, Michal [1 ]
Warmuth, Manfred K. [2 ]
Hsu, Daniel [3 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Univ Calif Santa Cruz, Dept Comp Sci, Santa Cruz, CA 95064 USA
[3] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
来源
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89 | 2019年 / 89卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Consider linear regression where the examples are generated by an unknown distribution on R-d x R. Without any assumptions on the noise, the linear least squares solution for any i.i.d. sample will typically be biased w.r.t. the least squares optimum over the entire distribution. However, we show that if an i.i.d. sample of any size k is augmented by a certain small additional sample, then the solution of the combined sample becomes unbiased. We show this when the additional sample consists of d points drawn jointly according to the input distribution that is rescaled by the squared volume spanned by the points. Furthermore, we propose algorithms to sample from this volume-rescaled distribution when the data distribution is only known through an i.i.d sample.
引用
收藏
页码:944 / 953
页数:10
相关论文
共 50 条
  • [1] Uncertain Box-Cox Regression Analysis With Rescaled Least Squares Estimation
    Liu, Shiqin
    Fang, Liang
    Zhou, Zaiying
    Hong, Yiping
    IEEE ACCESS, 2020, 8 : 84769 - 84776
  • [2] BIAS CORRECTION ON CENSORED LEAST SQUARES REGRESSION MODELS
    Orbe, Jesus
    Nunez-Anton, Vicente
    KYBERNETIKA, 2012, 48 (05) : 1045 - 1063
  • [3] A BIAS BOUND FOR LEAST-SQUARES LINEAR-REGRESSION
    DUAN, N
    LI, KC
    STATISTICA SINICA, 1991, 1 (01) : 127 - 136
  • [4] On a near optimal sampling strategy for least squares polynomial regression
    Shin, Yeonjong
    Xiu, Dongbin
    JOURNAL OF COMPUTATIONAL PHYSICS, 2016, 326 : 931 - 946
  • [5] Discriminative Semi-Supervised Feature Selection via Rescaled Least Squares Regression-Supplement
    Yuan, Guowen
    Chen, Xiaojun
    Wang, Chen
    Nie, Feiping
    Jing, Liping
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8177 - 8178
  • [6] APPROXIMATING LEAST-SQUARES BIAS IN MULTIPLE REGRESSION WITH ERRORS IN VARIABLES
    BLOMQVIST, AG
    REVIEW OF ECONOMICS AND STATISTICS, 1972, 54 (02) : 202 - 204
  • [7] Least trimmed squares regression, least median squares regression, and mathematical programming
    Giloni, A
    Padberg, M
    MATHEMATICAL AND COMPUTER MODELLING, 2002, 35 (9-10) : 1043 - 1060
  • [8] The relative efficiency of ranked set sampling in ordinary least squares regression
    Murff, EJT
    Sager, TW
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2006, 13 (01) : 41 - 51
  • [9] Optimal learning rates for least squares regularized regression with unbounded sampling
    Wang, Cheng
    Zhou, Ding-Xuan
    JOURNAL OF COMPLEXITY, 2011, 27 (01) : 55 - 67
  • [10] The Relative Efficiency of Ranked Set Sampling in Ordinary Least Squares Regression
    Elizabeth J. Tipton Murff
    Thomas W. Sager
    Environmental and Ecological Statistics, 2006, 13 : 41 - 51