Online Updating Algorithms of Statistical Methods for Big Data

被引:0
|
作者
Li, Yihao [1 ]
Wang, Jin [2 ,3 ]
机构
[1] Huazhong Univ Sci & Technol, Dept Math, Wuhan, Hubei, Peoples R China
[2] Valdosta State Univ, Dept Math, Valdosta, GA USA
[3] Valdosta State Univ, Data Sci Lab, Valdosta, GA USA
关键词
Online Algorithm; Big Data; Linear Regression; Skewness; Kurtosis; Sample Moment;
D O I
10.1145/3366650.3366667
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we discuss online updating algorithms for Big Data. One of the main challenges of Big Data is the limitation of data storage. In the Big Data stream environment, online computation sometimes requires fast updates without the use of historical data. The focus of this research is on efficient online update algorithms for basic statistical computations, including mean, variance, covariance, skewness, kurtosis, confidence interval, test statistic, and linear regression. We demonstrate the implementation of R Language through a linear regress example.
引用
收藏
页码:81 / 85
页数:5
相关论文
共 50 条
  • [1] Online Updating of Statistical Inference in the Big Data Setting
    Schifano, Elizabeth D.
    Wu, Jing
    Wang, Chun
    Yan, Jun
    Chen, Ming-Hui
    TECHNOMETRICS, 2016, 58 (03) : 393 - 403
  • [2] Statistical methods and computing for big data
    Wang, Chun
    Chen, Ming-Hui
    Schifano, Elizabeth
    Wu, Jing
    Yan, Jun
    STATISTICS AND ITS INTERFACE, 2016, 9 (04) : 399 - 414
  • [3] Online updating method with new variables for big data streams
    Wang, Chun
    Chen, Ming-Hui
    Wu, Jing
    Yan, Jun
    Zhang, Yuping
    Schifano, Elizabeth
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2018, 46 (01): : 123 - 146
  • [4] Online updating Huber robust regression for big data streams
    Tao, Chunbai
    Wang, Shanshan
    STATISTICS, 2024, 58 (05) : 1197 - 1223
  • [5] Online updating method to correct for measurement error in big data streams
    Lee, JooChul
    Wang, HaiYing
    Schifano, Elizabeth D.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 149
  • [6] Online Algorithms for Uploading Deferrable Big Data to The Cloud
    Zhang, Linquan
    Li, Zongpeng
    Wu, Chuan
    Chen, Minghua
    2014 PROCEEDINGS IEEE INFOCOM, 2014, : 2022 - 2030
  • [7] Online updating of information based model selection in the big data setting
    Xue, Yishu
    Hu, Guanyu
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2021, 50 (11) : 3516 - 3529
  • [8] Sampling-based Collection and Updating of Online Big Graph Data
    Yin Z.-D.
    Yue K.
    Zhang B.-B.
    Li J.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (11): : 3540 - 3558
  • [9] A Review of Anonymization Algorithms and Methods in Big Data
    Shamsinejad E.
    Banirostam T.
    Pedram M.M.
    Rahmani A.M.
    Annals of Data Science, 2025, 12 (1) : 253 - 279
  • [10] Online learning algorithms for big data analytics: A survey
    Li, Zhijie
    Li, Yuanxiang
    Wang, Feng
    He, Guoliang
    Kuang, Li
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (08): : 1707 - 1721