Adaptive Partitioning and Order-Preserved Merging of Data Streams

被引:0
|
作者
Pohl, Constantin [1 ]
Sattler, Kai-Uwe [1 ]
机构
[1] TU Ilmenau, Databases & Informat Syst Grp, Ilmenau, Germany
关键词
Adaptive partitioning; Order preservation; Stream processing; Parallelism; Many-core; Xeon Phi;
D O I
10.1007/978-3-030-28730-6_17
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Partitioning is a key concept for utilizing modern hardware, especially to exploit parallelism opportunities from many-core CPUs. In data streaming scenarios where parameters like tuple arrival rates can vary, adaptive strategies for partitioning solve the problem of overestimating or underestimating query workloads. While there are many possibilities to partition the data flow, threads running partitions independently from each other lead to unordered output inevitably. This is a considerable difficulty for applications where tuple order matters, like in stream reasoning or complex event processing scenarios. In this paper, we address this problem by combining an adaptive partitioning approach with an order-preserving merge algorithm. Since reordering output tuples can only worsen latency, we mainly focus on the throughput of queries while keeping the delay on individual tuples minimal. We run micro-benchmarks as well as the Linear Road benchmark, demonstrating correctness and effectiveness of our approach while scaling out on a single Xeon Phi many-core CPU up to 256 partitions.
引用
收藏
页码:267 / 282
页数:16
相关论文
共 50 条
  • [1] Adaptive spatial partitioning for multidimensional data streams
    Hershberger, J
    Shrivastava, N
    Suri, S
    Tóth, CD
    [J]. ALGORITHMS AND COMPUTATION, 2004, 3341 : 522 - 533
  • [2] Adaptive spatial partitioning for multidimensional data streams
    Hershberger, John
    Shrivastava, Nisheeth
    Suri, Subhash
    Toth, Csaba D.
    [J]. ALGORITHMICA, 2006, 46 (01) : 97 - 117
  • [3] Adaptive Spatial Partitioning for Multidimensional Data Streams
    John Hershberger
    Nisheeth Shrivastava
    Subhash Suri
    Csaba D. Toth
    [J]. Algorithmica, 2006, 46 : 97 - 117
  • [4] Order-preserved Tensor Completion For Accurate Network-wide Monitoring
    Li, Xiaocan
    Xie, Kun
    Wang, Xin
    Xie, Gaogang
    Li, Kenli
    Zhang, Dafang
    Wen, Jigang
    [J]. 2022 IEEE/ACM 30TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2022,
  • [5] Order-Preserved Preset-Time Cooperative Control:A Monotone System-Based Approach
    Boda Ning
    Qing-Long Han
    [J]. IEEE/CAA Journal of Automatica Sinica, 2022, 9 (09) : 1603 - 1611
  • [6] Order-Preserved Preset-Time Cooperative Control: A Monotone System-Based Approach
    Ning, Boda
    Han, Qing-Long
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (09) : 1603 - 1611
  • [7] A statistical μ-partitioning method for clustering data streams
    Park, NH
    Lee, WS
    [J]. COMPUTER AND INFORMATION SCIENCES - ISCIS 2003, 2003, 2869 : 292 - 299
  • [8] Dalton: Learned Partitioning for Distributed Data Streams
    Zapridou, Eleni
    Mytilinis, Ioannis
    Ailamaki, Anastasia
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 16 (03): : 491 - 504
  • [9] Interestingness Measure on Privacy Preserved Data with Horizontal Partitioning
    KumaraSwamy, S.
    Manjula, S. H.
    Venugopal, K. R.
    Patnaik, L. M.
    [J]. 2014 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2014,
  • [10] Partitioning for Scalable Complex Event Processing on Data Streams
    Saleh, Omran
    Betz, Heiko
    Sattler, Kai-Uwe
    [J]. NEW TRENDS IN DATABASE AND INFORMATION SYSTEMS II, 2015, 312 : 185 - 197