Adaptive distributed support vector regression of massive data

被引:2
|
作者
Liang, Shu-na [1 ]
Sun, Fei [1 ]
Zhang, Qi [1 ]
机构
[1] Qingdao Univ, Sch Math & Stat, Qingdao, Peoples R China
关键词
Massive datasets; smoothing; support vector regression; distributed; QUANTILE REGRESSION; INFERENCE; ALGORITHM; MACHINES;
D O I
10.1080/03610926.2022.2153604
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Massive datasets bring new challenges to traditional statistical inference, particularly in terms of memory restriction and computation time. Support vector regression is a robust and efficient estimation method. We first adopt smoothing techniques to develop smoothed support vector regression (S-SVR) estimation method. Then we propose distributed S-SVR (DS-SVR) algorithm for massive datasets. The proposed method solves the problems of memory restriction and computation time, and the resulting estimate can achieve the same efficiency as the estimator computed on all data. We also establish the asymptotic normality of the resulting estimate. In addition, we propose an adaptive learning process of parameters by using a combination of grid search and k- fold cross-validation, in which the optimal parameters (lambda,epsilon) are automatically selected by each data. Finally, the performance of the proposed method is illustrated well by simulation studies.
引用
收藏
页码:3365 / 3382
页数:18
相关论文
共 50 条
  • [31] Distributed smoothed rank regression with heterogeneous errors for massive data
    Xiaohui Yuan
    Xinran Zhang
    Yue Wang
    Chunjie Wang
    Journal of the Korean Statistical Society, 2023, 52 : 1078 - 1103
  • [32] Distributed smoothed rank regression with heterogeneous errors for massive data
    Yuan, Xiaohui
    Zhang, Xinran
    Wang, Yue
    Wang, Chunjie
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2023, 52 (04) : 1078 - 1103
  • [33] Optimal subsample selection for massive logistic regression with distributed data
    Zuo, Lulu
    Zhang, Haixiang
    Wang, HaiYing
    Sun, Liuquan
    COMPUTATIONAL STATISTICS, 2021, 36 (04) : 2535 - 2562
  • [34] Optimal subsample selection for massive logistic regression with distributed data
    Lulu Zuo
    Haixiang Zhang
    HaiYing Wang
    Liuquan Sun
    Computational Statistics, 2021, 36 : 2535 - 2562
  • [35] GPR Data Regression and Clustering by the Fuzzy Support Vector Machine and Regression
    Hosseinzadeh, Shahram
    Shaghaghi, Mehdi
    PROGRESS IN ELECTROMAGNETICS RESEARCH M, 2020, 93 : 175 - 184
  • [36] Regression forecast and abnormal data detection based on support vector regression
    Wang, Lei
    Zhang, Rui-Qing
    Sheng, Wei
    Xu, Zhi-Gao
    Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2009, 29 (08): : 92 - 96
  • [37] Soft computing based on support vector regression with adaptive parameters
    Wang, Ling-Yun
    Gui, Wei-Hua
    Xie, Yong-Fang
    Yang, Chun-Hua
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2007, 28 (SUPPL. 1): : 41 - 44
  • [38] An Adaptive Fuzzy Predictive Control Based on Support Vector Regression
    Boulkaibet, I.
    Bououden, S.
    Marwala, T.
    Twala, B.
    Ali, A.
    ADVANCED CONTROL ENGINEERING METHODS IN ELECTRICAL ENGINEERING SYSTEMS, 2019, 522 : 182 - 197
  • [39] Adaptive EWMA control chart by using support vector regression
    Kazmi, Muhammad Waqas
    Noor-ul-Amin, Muhammad
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2024, 40 (07) : 3831 - 3843
  • [40] Support vector regression based adaptive power system stabilizer
    Boonprasert, U
    Theera-Umpon, N
    Rakpenthai, C
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL III: GENERAL & NONLINEAR CIRCUITS AND SYSTEMS, 2003, : 371 - 374