Polya tree-based nearest neighborhood regression

被引:0
|
作者
Haoxin Zhuang
Liqun Diao
Grace Yi
机构
[1] University of Waterloo,Department of Statistics and Actuarial Science
[2] University of Western Ontario,Department of Statistical and Actuarial Sciences, Department of Computer Science
来源
Statistics and Computing | 2022年 / 32卷
关键词
Polya tree; Nearest neighborhood method; Regression; Nonparametric Bayesian method;
D O I
暂无
中图分类号
学科分类号
摘要
Parametric regression, such as linear regression, plays an important role in statistics. The use of parametric regression models typically involves the specification of a regression function of the covariates, the distribution of response and the link between the response and covariates, which are commonly at the risk of misspecification. In this paper, we introduce a fully nonparametric regression model, a Polya tree (PT)-based nearest neighborhood regression. To approximate the true conditional probability measure of the response given the covariate value, we construct a PT-distributed probability measure of the response in the nearest neighborhood of the covariate value of interest. Our proposed method gives consistent and robust estimators, and has a faster convergence rate than the kernel density estimation. We conduct extensive simulation studies and analyze a Combined Cycle Power Plant dataset to compare the performance of our method relative to kernel density estimation, PT density estimation, and linear dependent tail-free process (LDTFP). The studies suggest that the proposed method exhibits the superiority to the kernel and PT density estimation methods in terms of the estimation accuracy and convergence rate and to LDTFP in terms of robustness.
引用
收藏
相关论文
共 50 条
  • [1] Polya tree-based nearest neighborhood regression
    Zhuang, Haoxin
    Diao, Liqun
    Yi, Grace
    STATISTICS AND COMPUTING, 2022, 32 (04)
  • [2] Tree-based classification and regression Part 3: Tree-based procedures
    Gunter, B
    QUALITY PROGRESS, 1998, 31 (02) : 121 - 123
  • [3] A regression model based on the nearest centroid neighborhood
    V. García
    J. S. Sánchez
    A. I. Marqués
    R. Martínez-Peláez
    Pattern Analysis and Applications, 2018, 21 : 941 - 951
  • [4] Regression tree-based active learning
    Ashna Jose
    João Paulo Almeida de Mendonça
    Emilie Devijver
    Noël Jakse
    Valérie Monbet
    Roberta Poloni
    Data Mining and Knowledge Discovery, 2024, 38 : 420 - 460
  • [5] A regression model based on the nearest centroid neighborhood
    Garcia, V
    Sanchez, J. S.
    Marques, A., I
    Martinez-Pelaez, R.
    PATTERN ANALYSIS AND APPLICATIONS, 2018, 21 (04) : 941 - 951
  • [6] Regression tree-based active learning
    Jose, Ashna
    de Mendonca, Joao Paulo Almeida
    Devijver, Emilie
    Jakse, Noel
    Monbet, Valerie
    Poloni, Roberta
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (02) : 420 - 460
  • [7] Tree-based regression for a circular response
    Lund, UJ
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2002, 31 (09) : 1549 - 1560
  • [8] Tree-based model checking for logistic regression
    Su, Xiaogang
    STATISTICS IN MEDICINE, 2007, 26 (10) : 2154 - 2169
  • [9] A Tree-Based Solution to Nonlinear Regression Problem
    Demir, Oguzhan
    Mohaghegh, Mohammadreza N.
    Delibalta, Ibrahim
    Kozat, Suleyman S.
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1233 - 1236
  • [10] Comparison of tree-based ensemble models for regression
    Park, Sangho
    Kim, Chanmin
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2022, 29 (05) : 561 - 590