Model selection for Gaussian regression with random design

被引:14
|
作者
Birgé, L [1 ]
机构
[1] Univ Paris 06, CNRS, Lab Probabil & Modeles Aleatoires, UMR 7599, F-75252 Paris 05, France
关键词
Besov spaces; Hellinger distance; minimax risk; model selection; random design regression;
D O I
10.3150/bj/1106314849
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This paper is concerned with Gaussian regression with random design, where the observation are independent and indentically distributed. It is known from work by Le Cam that the rate of convergence of optimal estimators is closely connected to the metric structure of the parameter space with respect to the Hellinger distance. In particular, this metric structure essentially determines the risk when the loss function is a power of the Hellinger distance. For random design regression. one typically uses as loss function the squared L-2-distance between the estimator and the parameter. If the parameter space is bounded with respect to the L-infinity-norm, both distances are equivalent. Without this assumption, it may happen that there is a large distortion between the two distances, resulting in some unusual rates of convergence for the squared L-2-risk, as noticed by Baraud. We explain this phenomenon and then show that the use of the Hellinger distance instead of the L-2-distance allows us to recover the usual rates and to carry out model selection in great generality. An extension to the L-2-risk is given under a boundedness assumption similar to that Liven by Wegkamp and by Yang.
引用
收藏
页码:1039 / 1051
页数:13
相关论文
共 50 条
  • [31] Gaussian functional regression for output prediction: Model assimilation and experimental design
    Nguyen, N. C.
    Peraire, J.
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2016, 309 : 52 - 68
  • [32] Gaussian Model Selection
    Massart, Pascal
    [J]. CONCENTRATION INEQUALITIES AND MODEL SELECTION: ECOLE D'ETE DE PROBABILITES DE SAINT-FLOUR XXXIII - 2003, 2007, 1896 : 83 - 146
  • [33] Graphical Model Selection for Gaussian Conditional Random Fields in the Presence of Latent Variables
    Frot, Benjamin
    Jostins, Luke
    McVean, Gilean
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (526) : 723 - 734
  • [34] Model Selection in Local Approximation Gaussian Processes: A Markov Random Fields Approach
    Jalali, Hamed
    Pawelczyk, Martin
    Kasneci, Gjergji
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 768 - 778
  • [35] Gaussian Data-Aided Sensing With Multichannel Random Access and Model Selection
    Choi, Jinho
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (03): : 2412 - 2420
  • [36] Information-theoretic bounds on model selection for Gaussian Markov random fields
    Wang, Wei
    Wainwright, Martin J.
    Ramchandran, Kannan
    [J]. 2010 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2010, : 1373 - 1377
  • [37] VARIABLE SELECTION CONSISTENCY OF GAUSSIAN PROCESS REGRESSION
    Jiang, Sheng
    Tokdar, Surya T.
    [J]. ANNALS OF STATISTICS, 2021, 49 (05): : 2491 - 2505
  • [38] GEOMETRY-ADAPTED GAUSSIAN RANDOM FIELD REGRESSION
    Zhang, Zhen
    Wang, Mianzhi
    Xiang, Yijian
    Nehorai, Arye
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 6528 - 6532
  • [39] Random Regression Model for Genetic Evaluation and Early Selection in the Iranian Holstein Population
    Salimiyekta, Yasamin
    Vaez-Torshizi, Rasoul
    Abbasi, Mokhtar Ali
    Emmamjome-Kashan, Nasser
    Amin-Afshar, Mehdi
    Guo, Xiangyu
    Jensen, Just
    [J]. ANIMALS, 2021, 11 (12):
  • [40] On Kernel Design for Online Model Selection by Gaussian Multikernel Adaptive Filtering
    Toda, Osamu
    Yukawa, Masahiro
    [J]. 2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,