A distributed B+Tree indexing method for processing range queries over streaming data

被引:0
|
作者
Shahab Safaee
Meghdad Mirabi
Amir Masoud Rahmani
Ali Asghar Safaei
机构
[1] Islamic Azad University,Department of Computer Engineering, Faculty of Engineering, South Tehran Branch
[2] National Yunlin University of Science and Technology,Future Technology Research Center
[3] Tarbiat Modares University,Department of Medical Informatics, Faculty of Medical Sciences
来源
Cluster Computing | 2024年 / 27卷
关键词
B+Tree index; Distributed query processing; Map-Reduce model; Range query; Streaming data;
D O I
暂无
中图分类号
学科分类号
摘要
A data stream exhibits as a massive unbounded sequence of data elements continuously generated at a high rate. Stream databases raise new challenges for query processing due to both the streaming nature of data which constantly changes over time and the wider range of queries submitted by the user when compared with the traditional databases. In this paper, we propose a system architecture which includes components for both distributed indexing of streaming data and distributed processing of range queries on streaming data. Instead of creating a large and centralized B+Tree index structure, we create a set of small B+Tree indexes in such a way that a B+Tree index can be created for every partition of streaming data. We also design a distributed range search algorithm which can be used by each individual machine inside a Spark cluster to independently process range queries on each partition of streaming data. By exploiting the proposed system architecture, the process of indexing of streaming data and the process of querying over streaming data can be performed in a distributed and parallel manner. By performing several experiments, we demonstrate that our proposed indexing method is scalable and efficient for processing range queries on streaming data compared to the existing centralized B+Tree indexing methods and therefore, it can be used for applications involving data streams with a large volume of data elements and a large number of range queries.
引用
收藏
页码:1251 / 1274
页数:23
相关论文
共 50 条
  • [21] Range queries over skip tree graphs
    Gonzalez-Beltran, A.
    Milligan, P.
    Sage, P.
    COMPUTER COMMUNICATIONS, 2008, 31 (02) : 358 - 374
  • [22] Compact N-Tree: an Indexing Structure for Distance Range Queries
    Najjar, Faiza
    Slimani, Hassenet
    ISCC: 2009 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS, VOLS 1 AND 2, 2009, : 212 - +
  • [23] Supporting multidimensional range queries in Hierarchically Distributed Tree
    Gu, Yunfeng
    Boukerche, Azzedine
    De Grande, Robson E.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (06): : 1848 - 1869
  • [24] Efficient B-tree Based Indexing for Cloud Data Processing
    Wu, Sai
    Jiang, Dawei
    Ooi, Beng Chin
    Wu, Kun-Lung
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01): : 1207 - 1218
  • [25] A METHOD FOR PROCESSING DISTRIBUTED DATABASE QUERIES
    PERRIZO, W
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1984, 10 (04) : 466 - 471
  • [26] Sherman: A Write-Optimized Distributed B+Tree Index on Disaggregated Memory
    Wang, Qing
    Lu, Youyou
    Shu, Jiwu
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 1033 - 1048
  • [27] An Efficient Indexing Approach for Continuous Spatial Approximate Keyword Queries over Geo-Textual Streaming Data
    Deng, Ze
    Wang, Meng
    Wang, Lizhe
    Huan, Xiaohui
    Han, Wei
    Chu, Junde
    Zomaya, Albert Y.
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (02)
  • [28] The BoND-Tree: An Efficient Indexing Method for Box Queries in Nonordered Discrete Data Spaces
    Chen, Changqing
    Watve, Alok
    Pramanik, Sakti
    Zhu, Qiang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (11) : 2629 - 2643
  • [29] An efficient processing of range-MIN/MAX queries over data cube
    Kim, DW
    Lee, EJ
    Kim, MH
    Lee, YJ
    INFORMATION SCIENCES, 1998, 112 (1-4) : 223 - 237
  • [30] A Novel Indexing Method for Spatial-Keyword Range Queries
    Tampakis, Panagiotis
    Spyrellis, Dimitris
    Doulkeridis, Christos
    Pelekis, Nikos
    Kalyvas, Christos
    Vlachou, Akrivi
    PROCEEDINGS OF 17TH INTERNATIONAL SYMPOSIUM ON SPATIAL AND TEMPORAL DATABASES, SSTD 2021, 2021, : 54 - 63