Adaptive distributed support vector regression of massive data

被引:2
|
作者
Liang, Shu-na [1 ]
Sun, Fei [1 ]
Zhang, Qi [1 ]
机构
[1] Qingdao Univ, Sch Math & Stat, Qingdao, Peoples R China
关键词
Massive datasets; smoothing; support vector regression; distributed; QUANTILE REGRESSION; INFERENCE; ALGORITHM; MACHINES;
D O I
10.1080/03610926.2022.2153604
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Massive datasets bring new challenges to traditional statistical inference, particularly in terms of memory restriction and computation time. Support vector regression is a robust and efficient estimation method. We first adopt smoothing techniques to develop smoothed support vector regression (S-SVR) estimation method. Then we propose distributed S-SVR (DS-SVR) algorithm for massive datasets. The proposed method solves the problems of memory restriction and computation time, and the resulting estimate can achieve the same efficiency as the estimator computed on all data. We also establish the asymptotic normality of the resulting estimate. In addition, we propose an adaptive learning process of parameters by using a combination of grid search and k- fold cross-validation, in which the optimal parameters (lambda,epsilon) are automatically selected by each data. Finally, the performance of the proposed method is illustrated well by simulation studies.
引用
收藏
页码:3365 / 3382
页数:18
相关论文
共 50 条
  • [1] Parameter adaptive support vector regression for big data
    Cao W.
    Ni J.
    Jiang B.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (02): : 511 - 521
  • [2] The Support Vector Regression with Adaptive Norms
    Zhang, Chunhua
    Li, Dewei
    Tan, Junyan
    2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 1730 - 1736
  • [3] Adaptive support vector machines for regression
    Palaniswami, M
    Shilton, A
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 1043 - 1049
  • [4] Distributed quantile regression for massive heterogeneous data
    Hu, Aijun
    Jiao, Yuling
    Liu, Yanyan
    Shi, Yueyong
    Wu, Yuanshan
    NEUROCOMPUTING, 2021, 448 : 249 - 262
  • [5] Distributed Penalized Modal Regression for Massive Data
    Jin Jun
    Liu Shuangzhe
    Ma Tiefeng
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2023, 36 (02) : 798 - 821
  • [6] Distributed Penalized Modal Regression for Massive Data
    Jun Jin
    Shuangzhe Liu
    Tiefeng Ma
    Journal of Systems Science and Complexity, 2023, 36 : 798 - 821
  • [7] Robust distributed modal regression for massive data
    Wang, Kangning
    Li, Shaomin
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 160
  • [8] Distributed Penalized Modal Regression for Massive Data
    JIN Jun
    LIU Shuangzhe
    MA Tiefeng
    Journal of Systems Science & Complexity, 2023, 36 (02) : 798 - 821
  • [9] Projection support vector regression algorithms for data regression
    Peng, Xinjun
    Xu, Dong
    KNOWLEDGE-BASED SYSTEMS, 2016, 112 : 54 - 66
  • [10] Support vector machine in big data: smoothing strategy and adaptive distributed inference
    Wang, Kangning
    Liu, Jin
    Sun, Xiaofei
    STATISTICS AND COMPUTING, 2024, 34 (06)