Spatial Rank-Based Augmentation for Nonparametric Online Monitoring and Adaptive Sampling of Big Data Streams

被引:6
|
作者
Zan, Xin [1 ]
Wang, Di [2 ]
Xian, Xiaochen [1 ]
机构
[1] Univ Florida, Dept Ind & Syst Engn, Gainesville, FL 32611 USA
[2] Shanghai Jiao Tong Univ, Sch Mech Engn, Dept Ind Engn & Management, Shanghai, Peoples R China
基金
美国国家科学基金会; 上海市自然科学基金; 中国国家自然科学基金;
关键词
Data augmentation; Distribution-free; Internet of Things (IoT); Partial observations; Statistical process control (SPC); CONTROL CHARTS; MEAN VECTOR; THINGS IOT; INTERNET;
D O I
10.1080/00401706.2022.2143903
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The age of Internet of Things (IoT) has witnessed the rapid development of modern data acquisition devices and communicating-actuating networks, which enables the generation of big data streams shared across platforms for remote and efficient decision making of many critical systems. The monitoring of big data streams remains a challenging task in various practical applications mainly due to their complexity in interrelationships, large volume, and high velocity, which places prohibitive demands on monitoring methodologies and resources. To tackle the challenges of monitoring unexchangeable and correlated big data streams with only partial observations available under resource constraints, we propose a method by incorporating spatial rank-based statistics with effective data augmentation techniques for the online unobservable data streams that can analytically inform the monitoring and sampling decisions based only on partially observed data streams. By exploiting historical data, the proposed method preserves strong descriptive power of general big data streams under partial observations and can explicitly use the correlation among data streams, and thus allows effective monitoring and equitable sampling over general heterogeneous and correlated big data streams, which is free of simplified assumptions (e.g., exchangeability) compared to existing methods. Theoretical investigations are carried out to evaluate the effectiveness of the augmentation statistics as well as the sampling strategy, which guarantee the superiority of the sampling performance over existing methods. Simulations under various scenarios and two real case studies are also conducted to evaluate and validate the performance of the proposed method.
引用
收藏
页码:243 / 256
页数:14
相关论文
共 50 条
  • [21] Adaptive network diagram constructions for representing big data event streams on monitoring dashboards
    Mantzaris, Alexander V.
    Walker, Thomas G.
    Taylor, Cameron E.
    Ehling, Dustin
    JOURNAL OF BIG DATA, 2019, 6 (01)
  • [22] Adaptive network diagram constructions for representing big data event streams on monitoring dashboards
    Alexander V. Mantzaris
    Thomas G. Walker
    Cameron E. Taylor
    Dustin Ehling
    Journal of Big Data, 6
  • [23] Sampling-based Collection and Updating of Online Big Graph Data
    Yin Z.-D.
    Yue K.
    Zhang B.-B.
    Li J.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (11): : 3540 - 3558
  • [24] Kernel-based Low-rank Feature Extraction on a Budget for Big Data Streams
    Sheikholeslami, Fatemeh
    Berberidis, Dimitris
    Giannakis, Georgios B.
    2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 928 - 932
  • [25] An Adaptive Sampling Strategy for Online Monitoring and Diagnosis of High-Dimensional Streaming Data
    Gomez, Ana Maria Estrada
    Li, Dan
    Paynabar, Kamran
    TECHNOMETRICS, 2022, 64 (02) : 253 - 269
  • [26] Current Transformer Condition Online Monitoring Platform Based on Big Data
    Wang, Dan
    Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [27] Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy
    Yan, Yan
    Zhang, Lianxiu
    Sheng, Quan Z.
    Wang, Bingqian
    Gao, Xin
    Cong, Yiming
    IEEE ACCESS, 2019, 7 : 164962 - 164974
  • [28] Big Data Transfer Optimization Based on Offline Knowledge Discovery and Adaptive Sampling
    Nine, Md S. Q. Zulkar
    Guner, Kemal
    Huang, Ziyun
    Wang, Xiangyu
    Xu, Jinhui
    Kosar, Tevfik
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 465 - 472
  • [29] Novel Online Censoring Based Learning Algorithm For Complex-Valued Big Data Streams
    Guvenc, Buket Colak
    Eren, Yusuf
    Menguc, Engin Cemal
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [30] Adaptive Ensembles for Evolving Data Streams - Combining Block-Based and Online Solutions
    Stefanowski, Jerzy
    NEW FRONTIERS IN MINING COMPLEX PATTERNS, 2016, 9607 : 3 - 16