Efficient Transmission and Reconstruction of Dependent Data Streams via Edge Sampling

被引:2
|
作者
Wolfrath, Joel [1 ]
Chandra, Abhishek [1 ]
机构
[1] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
关键词
Stream processing; edge computing; big data; approximate computing;
D O I
10.1109/IC2E55432.2022.00013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data stream processing is an increasingly important topic due to the prevalence of smart devices and the demand for real-time analytics. Geo-distributed streaming systems, where cloud-based queries utilize data streams from multiple distributed devices, face challenges since wide-area network (WAN) bandwidth is often scarce or expensive. Edge computing allows us to address these bandwidth costs by utilizing resources close to the devices, e.g. to perform sampling over the incoming data streams, which trades downstream query accuracy to reduce the overall transmission cost. In this paper, we leverage the fact that correlations between data streams may exist across devices located in the same geographical region. Using this insight, we develop a hybrid edge-cloud system which systematically trades off between sampling at the edge and estimation of missing values in the cloud to reduce traffic over the WAN. We present an optimization framework which computes sample sizes at the edge and systematically bounds the number of samples we can estimate in the cloud given the strength of the correlation between streams. Our evaluation with three real-world datasets shows that compared to existing sampling techniques, our system could provide comparable error rates over multiple aggregate queries while reducing WAN traffic by 27-42%.
引用
收藏
页码:47 / 57
页数:11
相关论文
共 50 条
  • [1] Efficient reservoir sampling for transactional data streams
    Dash, Manoranjan
    Ng, Willie
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 662 - +
  • [2] Efficient sampling of non-strict turnstile data streams
    Barkay, Neta
    Porat, Ely
    Shalem, Bar
    THEORETICAL COMPUTER SCIENCE, 2015, 590 : 106 - 117
  • [3] Secure Transmission of Compressed Sampling Data Using Edge Clouds
    Zhang, Yushu
    Wang, Ping
    Fang, Liming
    He, Xing
    Han, Hao
    Chen, Bing
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (10) : 6641 - 6651
  • [4] SAMPLING OF RANDOM DATA STREAMS
    Cepciansky, Gustav
    Schwartz, Ladislav
    ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2011, 9 (01) : 1 - 6
  • [5] MR DATA ACQUISITION AND RECONSTRUCTION USING EFFICIENT SAMPLING SCHEMES
    EHRHARDT, JC
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 1990, 9 (03) : 305 - 309
  • [6] Artifacts and sampling requirement in transmission CT reconstruction with truncated projection data
    Gregoriou, GK
    Tsui, BMW
    Frey, EC
    Lalush, DS
    1995 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE RECORD, VOLS 1-3, 1996, : 1336 - 1340
  • [7] Reservoir Pattern Sampling in Data Streams
    Giacometti, Arnaud
    Soulet, Arnaud
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 337 - 352
  • [8] Sampling in dynamic data streams and applications
    Frahling, Gereon
    Indyk, Piotr
    Sohler, Christian
    INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 2008, 18 (1-2) : 3 - 28
  • [9] Experiential sampling on multiple data streams
    Kankanhalli, Mohan S.
    Wang, Jun
    Jain, Ramesh
    IEEE TRANSACTIONS ON MULTIMEDIA, 2006, 8 (05) : 947 - 955
  • [10] Efficient solution of boundary-value problems for image reconstruction via sampling
    Fox, C
    Nicholls, G
    Palm, M
    JOURNAL OF ELECTRONIC IMAGING, 2000, 9 (03) : 251 - 259