Spatial Rank-Based Augmentation for Nonparametric Online Monitoring and Adaptive Sampling of Big Data Streams

被引:6
|
作者
Zan, Xin [1 ]
Wang, Di [2 ]
Xian, Xiaochen [1 ]
机构
[1] Univ Florida, Dept Ind & Syst Engn, Gainesville, FL 32611 USA
[2] Shanghai Jiao Tong Univ, Sch Mech Engn, Dept Ind Engn & Management, Shanghai, Peoples R China
基金
美国国家科学基金会; 上海市自然科学基金; 中国国家自然科学基金;
关键词
Data augmentation; Distribution-free; Internet of Things (IoT); Partial observations; Statistical process control (SPC); CONTROL CHARTS; MEAN VECTOR; THINGS IOT; INTERNET;
D O I
10.1080/00401706.2022.2143903
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The age of Internet of Things (IoT) has witnessed the rapid development of modern data acquisition devices and communicating-actuating networks, which enables the generation of big data streams shared across platforms for remote and efficient decision making of many critical systems. The monitoring of big data streams remains a challenging task in various practical applications mainly due to their complexity in interrelationships, large volume, and high velocity, which places prohibitive demands on monitoring methodologies and resources. To tackle the challenges of monitoring unexchangeable and correlated big data streams with only partial observations available under resource constraints, we propose a method by incorporating spatial rank-based statistics with effective data augmentation techniques for the online unobservable data streams that can analytically inform the monitoring and sampling decisions based only on partially observed data streams. By exploiting historical data, the proposed method preserves strong descriptive power of general big data streams under partial observations and can explicitly use the correlation among data streams, and thus allows effective monitoring and equitable sampling over general heterogeneous and correlated big data streams, which is free of simplified assumptions (e.g., exchangeability) compared to existing methods. Theoretical investigations are carried out to evaluate the effectiveness of the augmentation statistics as well as the sampling strategy, which guarantee the superiority of the sampling performance over existing methods. Simulations under various scenarios and two real case studies are also conducted to evaluate and validate the performance of the proposed method.
引用
收藏
页码:243 / 256
页数:14
相关论文
共 50 条
  • [41] An Adaptive Soft Sensor Method based on Online Deep Evolving Fuzzy System for Industrial Process Data Streams
    Gao, Yu
    Jin, Huaiping
    Wang, Bin
    Yang, Biao
    Yu, Wangyang
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1799 - 1804
  • [42] GridMesa: A NoSQL-based big spatial data management system with an adaptive grid approximation model
    Yang, Xiangyang
    Guan, Xuefeng
    Pang, Zhaoxing
    Kui, Xing
    Wu, Huayi
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 155 : 324 - 339
  • [43] SJCBMQ: A novel spatial join-based algorithm for continuous border monitoring query processing in data streams
    Zhang, Yunyi
    Huang, Chongzheng
    Zhang, Deyun
    2007 2ND INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND APPLICATIONS, VOLS 1 AND 2, 2007, : 291 - +
  • [44] Safety monitoring of sluice-pump station project based on online correlation analysis and clustering of multichannel data streams
    Bao J.
    Qian J.
    Zhang W.
    Tang H.
    Tang F.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2019, 35 (03): : 101 - 108
  • [45] RETRACTED: Online Big Data Physical Education Classroom Based on Monitoring Network and Artificial Intelligence (Retracted Article)
    Zhang, YaWen
    Wang, YanQiong
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [46] Parametric Machine Learning-Based Adaptive Sampling Algorithm for Efficient IoT Data Collection in Environmental Monitoring
    Algabroun, Hatem
    Hakansson, Lars
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2025, 33 (01)
  • [47] Data Flow Tracking and Real-Time Monitoring Framework of College Online Education Platform based on Spark Big Data Analytic Tools
    Zhou, Liqin
    Chen, Ping
    International Conference on Edge Computing and Applications, ICECAA 2022 - Proceedings, 2022, : 176 - 179
  • [48] A Multi-Source Big Data Security System of Power Monitoring Network Based on Adaptive Combined Public Key Algorithm
    Jiang, Chengzhi
    Huang, Chuanfeng
    Huang, Qiwei
    Shi, Jian
    SYMMETRY-BASEL, 2021, 13 (09):
  • [49] Data-driven and Physical Model-based Designs of Probabilistic Spatial Dictionary for Online Meeting Diarization and Adaptive Beamforming
    Ito, Nobutaka
    Araki, Shoko
    Nakatani, Tomohiro
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 1165 - 1169
  • [50] Big Model Strategy for Bridge Structural Health Monitoring Based on Data-Driven, Adaptive Method and Convolutional Neural Network (CNN) Group
    Nanjing Highway Development Center, Changzhou
    211106, China
    不详
    210000, China
    不详
    CA
    93405, United States
    不详
    21544, Egypt
    不详
    不详
    LS2 9JT, United Kingdom
    SDHM Struct. Durability Health Monit., 6 (763-783):