Prediction with a flexible finite mixture-of-regressions

被引:4
|
作者
Ahonena, Ilmari [1 ,2 ]
Nevalainen, Jaakko [1 ,3 ]
Larocque, Denis [4 ]
机构
[1] Univ Turku, Dept Math & Stat, Turku, Finland
[2] Univ Turku, Inst Biomed, Turku, Finland
[3] Univ Tampere, Hlth Sci, Fac Social Sci, Tampere, Finland
[4] HEC Montreal, Dept Decis Sci, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Finite mixture regression; Random forest; Prediction intervals; Bootstrap; Penalization; VARIABLE SELECTION; MODEL SELECTION;
D O I
10.1016/j.csda.2018.01.012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Finite mixture regression (FMR) is widely used for modeling data that originate from heterogeneous populations. In these settings, FMR can offer increased predictive power compared to more traditional one-class models. However, existing FMR methods rely heavily on mixtures of linear models, where the linear predictor must be given as an input. A flexible FMR model is presented using a combination of the random forest learner and a penalized linear FMR. The performance of the new method is assessed by predictive log-likelihood in extensive simulation studies. The method is shown to achieve equal performance with the existing FMR methods when the true regression functions are in fact linear and superior performance in cases where at least one of the regression functions is nonlinear. The method can handle a large number of covariates, and its predictive ability is not greatly affected by surplus variables. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:212 / 224
页数:13
相关论文
共 50 条