A spatial-adaptive sampling procedure for online monitoring of big data streams

被引:34
|
作者
Wang, Andi [1 ]
Xian, Xiaochen [2 ]
Tsung, Fugee [3 ]
Liu, Kaibo [2 ]
机构
[1] Georgia Inst Technol, Dept Ind & Syst Engn, Atlanta, GA 30332 USA
[2] Univ Wisconsin, Dept Ind & Syst Engn, Madison, WI 53706 USA
[3] Hong Kong Univ Sci & Technol, Dept Ind Engn & Decis Anal, Hong Kong, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
big data streams; cumulative-sum statistics; high-dimensional and high-frequency data; partial information; scalable monitoring schemes; statistical process control; STRATEGY;
D O I
10.1080/00224065.2018.1507560
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
With the improvement of data-acquisition technology, big data streams that involve continuous observations with high dimensionality and large volume frequently appear in modern applications, which poses significant challenges for statistical process control. In this article we consider the problem of online monitoring a class of big data streams where each data stream is associated with a spatial location. Our goal is to quickly detect shifts occurring in such big data streams when only partial information can be observed at each time and the out-of-control variables are clustered in a small and unknown region. To achieve this goal, we propose a novel spatial-adaptive sampling and monitoring (SASAM) procedure that aims to leverage the spatial information of the data streams for quick change detection. Specifically, the proposed sampling strategy will adaptively and intelligently integrate two seemingly contradictory ideas: (1) random sampling that quickly searches for possible out-of-control variables; and (2) directional sampling that focuses on highly suspicious out-of-control variables that may duster in a small region. Simulation and real case studies show that the proposed method significantly outperforms the existing sampling strategy without taking the spatial information of the data streams into consideration.
引用
收藏
页码:329 / 343
页数:15
相关论文
共 50 条
  • [1] A Nonparametric Adaptive Sampling Strategy for Online Monitoring of Big Data Streams
    Xian, Xiaochen
    Wang, Andi
    Liu, Kaibo
    [J]. 2017 13TH IEEE CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2017, : 844 - 846
  • [2] A Nonparametric Adaptive Sampling Strategy for Online Monitoring of Big Data Streams
    Xian, Xiaochen
    Wang, Andi
    Liu, Kaibo
    [J]. TECHNOMETRICS, 2018, 60 (01) : 14 - 25
  • [3] Spatial Rank-Based Augmentation for Nonparametric Online Monitoring and Adaptive Sampling of Big Data Streams
    Zan, Xin
    Wang, Di
    Xian, Xiaochen
    [J]. TECHNOMETRICS, 2023, 65 (02) : 243 - 256
  • [4] Online monitoring of big data streams: A rank-based sampling algorithm by data augmentation
    Xian, Xiaochen
    Zhang, Chen
    Bonk, Scott
    Liu, Kaibo
    [J]. JOURNAL OF QUALITY TECHNOLOGY, 2021, 53 (02) : 135 - 153
  • [5] An adaptive approach for online monitoring of large-scale data streams
    Cao, Shuchen
    Zhang, Ruizhi
    [J]. IISE TRANSACTIONS, 2023,
  • [6] An adaptive approach for online monitoring of large-scale data streams
    Cao, Shuchen
    Zhang, Ruizhi
    [J]. IISE Transactions, 2023,
  • [7] H-tree: Hierarchy index for online monitoring of big data streams
    Research Center of Information Security, Institute of Computing Technology, Chinese Academy of Sciences, Beijing
    100190, China
    不详
    100093, China
    不详
    100876, China
    [J]. Jisuanji Xuebao, 1 (35-44):
  • [8] Online nonparametric monitoring of heterogeneous data streams with partial observations based on Thompson sampling
    Ye, Honghan
    Xian, Xiaochen
    Cheng, Jing-Ru C.
    Hable, Brock
    Shannon, Robert W.
    Elyaderani, Mojtaba Kadkhodaie
    Liu, Kaibo
    [J]. IISE TRANSACTIONS, 2023, 55 (04) : 392 - 404
  • [9] A two-stage online monitoring procedure for high-dimensional data streams
    Li, Jun
    [J]. JOURNAL OF QUALITY TECHNOLOGY, 2019, 51 (04) : 392 - 406
  • [10] Adaptive network diagram constructions for representing big data event streams on monitoring dashboards
    Mantzaris, Alexander V.
    Walker, Thomas G.
    Taylor, Cameron E.
    Ehling, Dustin
    [J]. JOURNAL OF BIG DATA, 2019, 6 (01)