Clustered partial linear regression

被引:6
|
作者
Torgo, L
Da Costa, JP
机构
[1] Univ Porto, LIACC, FEP, P-4150 Oporto, Portugal
[2] Univ Porto, LIACC, DMA, P-4150 Oporto, Portugal
关键词
multistrategy learning; regression; clustering; multiple models;
D O I
10.1023/A:1021770020534
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new method that deals with a supervised learning task usually known as multiple regression. The main distinguishing feature of our technique is the use of a multistrategy approach to this learning task. We use a clustering method to form sub-sets of the training data before the actual regression modeling takes place. This pre-clustering stage creates several training sub-samples containing cases that are "nearby" to each other from the perspective of the multidimensional input space. Supervised learning within each of these sub-samples is easier and more accurate as our experiments show. We call the resulting method clustered partial linear regression. Predictions using these models are preceded by a cluster membership query for each test case. The cluster membership probability of a test case is used as a weight in an averaging process that calculates the final prediction. This averaging process involves the predictions of the regression models associated to the clusters for which the test case may belong. We have tested this general multistrategy approach using several regression techniques and we have observed significant accuracy gains in several data sets. We have also compared our method to bagging that also uses an averaging process to obtain predictions. This experiment showed that the two methods are significantly different. Finally, we present a comparison of our method with several state-of-the-art regression methods showing its competitiveness.
引用
收藏
页码:303 / 319
页数:17
相关论文
共 50 条
  • [41] Gaussian functional regression for linear partial differential equations
    Nguyen, N. C.
    Peraire, J.
    [J]. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2015, 287 : 69 - 89
  • [42] Semi-functional partial linear quantile regression
    Ding, Hui
    Lu, Zhiping
    Zhang, Jian
    Zhang, Riquan
    [J]. STATISTICS & PROBABILITY LETTERS, 2018, 142 : 92 - 101
  • [43] Online Regression with Partial Information: Generalization and Linear Projection
    Ito, Shinji
    Hatano, Daisuke
    Sumita, Hanna
    Yabe, Akihiro
    Fukunaga, Takuro
    Kakimura, Naonori
    Kawarabayashi, Ken-ichi
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [44] Variable selection in partial linear regression with functional covariate
    Aneiros, G.
    Ferraty, F.
    Vieu, P.
    [J]. STATISTICS, 2015, 49 (06) : 1322 - 1347
  • [45] Bootstrap confidence bands and partial linear quantile regression
    Song, Song
    Ritov, Ya'acov
    Haerdle, Wolfgang K.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2012, 107 : 244 - 262
  • [46] Partial envelopes for efficient estimation in multivariate linear regression
    Su, Zhihua
    Cook, R. Dennis
    [J]. BIOMETRIKA, 2011, 98 (01) : 133 - 146
  • [47] Subgroup analysis for functional partial linear regression model
    Ma, Haiqiang
    Liu, Chao
    Xu, Sheng
    Yang, Jin
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2023, 51 (02): : 559 - 579
  • [48] Rank method for partial functional linear regression models
    Cao, Ruiyuan
    Xie, Tianfa
    Yu, Ping
    [J]. JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2021, 50 (02) : 354 - 379
  • [49] Robust Estimation for Partial Functional Linear Regression Model Based on Modal Regression
    Ping Yu
    Zhongyi Zhu
    Jianhong Shi
    Xikai Ai
    [J]. Journal of Systems Science and Complexity, 2020, 33 : 527 - 544
  • [50] Robust Estimation for Partial Functional Linear Regression Model Based on Modal Regression
    YU Ping
    ZHU Zhongyi
    SHI Jianhong
    AI Xikai
    [J]. Journal of Systems Science & Complexity, 2020, 33 (02) : 527 - 544