A memory-optimal many-to-many semi-stream join

被引:6
|
作者
Naeem, M. Asif [1 ]
Weber, Gerald [2 ]
Lutteroth, Christof [3 ]
机构
[1] Auckland Univ Technol, Sch Engn Comp & Math Sci, Private Bag 92006, Auckland, New Zealand
[2] Univ Auckland, Dept Comp Sci, Private Bag 92019, Auckland, New Zealand
[3] Univ Bath, Dept Comp Sci, Bath BA2 7AY, Avon, England
基金
英国工程与自然科学研究理事会;
关键词
Many-to-many semi-stream join; Cache optimization; Performance evaluation; ARCHITECTURE;
D O I
10.1007/s10619-018-7247-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semi-stream join algorithms join a fast stream input with a disk-based master data relation. A common class of these algorithms is derived from hash joins: they use the stream as build input for a main hash table, and also include a cache for frequent master data. The composition of the cache is very important for performance; however, the decision of which master data to cache has so far been solely based on heuristics. We present the first formal criterion, a cache inequality that leads to a provably optimal composition of the cache in a semi-stream many-to-many equijoin algorithm. We propose a novel algorithm, Semi-Stream Balanced Join (SSBJ), which exploits this cache inequality to achieve a given service rate with a provably minimal amount of memory for all stream distributions. We present a cost model for SSBJ and compare its service rate empirically and analytically with other related approaches.
引用
收藏
页码:623 / 649
页数:27
相关论文
共 33 条
  • [1] A memory-optimal many-to-many semi-stream join
    M. Asif Naeem
    Gerald Weber
    Christof Lutteroth
    [J]. Distributed and Parallel Databases, 2019, 37 : 623 - 649
  • [2] SEARCH MEMORY FOR MANY-TO-MANY COMPARISONS
    DIGBY, DW
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1973, C-22 (08) : 768 - 772
  • [3] Multimodule associative memory for many-to-many associations
    Hattori, M
    Hagiwara, M
    [J]. NEUROCOMPUTING, 1998, 19 (1-3) : 99 - 119
  • [4] Semi-Stream Similarity Join Processing in a Distributed Environment
    Kim, Hong-Ji
    Lee, Ki-Hoon
    [J]. IEEE ACCESS, 2020, 8 : 130194 - 130204
  • [5] Optimal partition of QoS requirements for many-to-many connections
    Lorenz, DH
    Orda, A
    Raz, D
    [J]. IEEE INFOCOM 2003: THE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2003, : 1670 - 1679
  • [6] Pareto Optimal Matchings in Many-to-Many Markets with Ties
    Cechlarova, Katarina
    Eirinakis, Pavlos
    Fleiner, Tamas
    Magos, Dimitrios
    Manlove, David
    Mourtos, Ioannis
    Ocelakova, Eva
    Rastegari, Baharak
    [J]. THEORY OF COMPUTING SYSTEMS, 2016, 59 (04) : 700 - 721
  • [7] Pareto Optimal Matchings in Many-to-Many Markets with Ties
    Cechlarova, Katarina
    Eirinakis, Pavlos
    Fleiner, Tamas
    Magos, Dimitrios
    Manlove, David F.
    Mourtos, Ioannis
    Ocelakova, Eva
    Rastegari, Baharak
    [J]. ALGORITHMIC GAME THEORY, SAGT 2015, 2015, 9347 : 27 - 39
  • [8] Pareto Optimal Matchings in Many-to-Many Markets with Ties
    Katarína Cechlárová
    Pavlos Eirinakis
    Tamás Fleiner
    Dimitrios Magos
    David Manlove
    Ioannis Mourtos
    Eva Ocel̆áková
    Baharak Rastegari
    [J]. Theory of Computing Systems, 2016, 59 : 700 - 721
  • [9] Semi-supervised Many-to-many Music Timbre Transfer
    Chang, Yu-Chen
    Chen, Wen-Cheng
    Hu, Min-Chun
    [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 442 - 446
  • [10] A Cache-Based Semi-Stream Join to deal with Unmatched Stream Data
    Naeem, M. Asif
    Bajwa, Imran Sarwar
    Jamil, Noreen
    [J]. DATABASES THEORY AND APPLICATIONS, 2015, 9093 : 54 - 65