Adaptive distributed support vector regression of massive data

被引:2
|
作者
Liang, Shu-na [1 ]
Sun, Fei [1 ]
Zhang, Qi [1 ]
机构
[1] Qingdao Univ, Sch Math & Stat, Qingdao, Peoples R China
关键词
Massive datasets; smoothing; support vector regression; distributed; QUANTILE REGRESSION; INFERENCE; ALGORITHM; MACHINES;
D O I
10.1080/03610926.2022.2153604
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Massive datasets bring new challenges to traditional statistical inference, particularly in terms of memory restriction and computation time. Support vector regression is a robust and efficient estimation method. We first adopt smoothing techniques to develop smoothed support vector regression (S-SVR) estimation method. Then we propose distributed S-SVR (DS-SVR) algorithm for massive datasets. The proposed method solves the problems of memory restriction and computation time, and the resulting estimate can achieve the same efficiency as the estimator computed on all data. We also establish the asymptotic normality of the resulting estimate. In addition, we propose an adaptive learning process of parameters by using a combination of grid search and k- fold cross-validation, in which the optimal parameters (lambda,epsilon) are automatically selected by each data. Finally, the performance of the proposed method is illustrated well by simulation studies.
引用
收藏
页码:3365 / 3382
页数:18
相关论文
共 50 条
  • [21] Data Selection Using Support Vector Regression
    Michael B.RICHMAN
    Lance M.LESLIE
    Theodore B.TRAFALIS
    Hicham MANSOURI
    AdvancesinAtmosphericSciences, 2015, 32 (03) : 277 - 286
  • [22] Support vector regression methods for functional data
    Hernandez, Noslen
    Biscay, Rolando J.
    Talavera, Isneri
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2007, 4756 : 564 - +
  • [23] Data selection using support vector regression
    Michael B. Richman
    Lance M. Leslie
    Theodore B. Trafalis
    Hicham Mansouri
    Advances in Atmospheric Sciences, 2015, 32 : 277 - 286
  • [24] Support vector regression for polyhedral and missing data
    Gianluca Gazzola
    Myong K. Jeong
    Annals of Operations Research, 2021, 303 : 483 - 506
  • [25] Support vector regression for right censored data
    Goldberg, Yair
    Kosorok, Michael R.
    ELECTRONIC JOURNAL OF STATISTICS, 2017, 11 (01): : 532 - 569
  • [26] Support vector regression with input data uncertainty
    Zhong, Ping
    Wang, Laisheng
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2008, 4 (09): : 2325 - 2332
  • [27] Pairing support vector algorithm for data regression
    Hao, Pei-Yi
    NEUROCOMPUTING, 2017, 225 : 174 - 187
  • [28] Data selection using support vector regression
    Richman, Michael B.
    Leslie, Lance M.
    Trafalis, Theodore B.
    Mansouri, Hicham
    ADVANCES IN ATMOSPHERIC SCIENCES, 2015, 32 (03) : 277 - 286
  • [29] River stage prediction based on a distributed support vector regression
    Wu, C. L.
    Chau, K. W.
    Li, Y. S.
    JOURNAL OF HYDROLOGY, 2008, 358 (1-2) : 96 - 111
  • [30] Support vector regression for rate prediction in distributed video coding
    Nickaein, Isaac
    Rahmati, Mohammad
    Hamzei, Nazanin
    INTELLIGENT DATA ANALYSIS, 2014, 18 (03) : 465 - 477