A distributed B+Tree indexing method for processing range queries over streaming data

被引:0
|
作者
Shahab Safaee
Meghdad Mirabi
Amir Masoud Rahmani
Ali Asghar Safaei
机构
[1] Islamic Azad University,Department of Computer Engineering, Faculty of Engineering, South Tehran Branch
[2] National Yunlin University of Science and Technology,Future Technology Research Center
[3] Tarbiat Modares University,Department of Medical Informatics, Faculty of Medical Sciences
来源
Cluster Computing | 2024年 / 27卷
关键词
B+Tree index; Distributed query processing; Map-Reduce model; Range query; Streaming data;
D O I
暂无
中图分类号
学科分类号
摘要
A data stream exhibits as a massive unbounded sequence of data elements continuously generated at a high rate. Stream databases raise new challenges for query processing due to both the streaming nature of data which constantly changes over time and the wider range of queries submitted by the user when compared with the traditional databases. In this paper, we propose a system architecture which includes components for both distributed indexing of streaming data and distributed processing of range queries on streaming data. Instead of creating a large and centralized B+Tree index structure, we create a set of small B+Tree indexes in such a way that a B+Tree index can be created for every partition of streaming data. We also design a distributed range search algorithm which can be used by each individual machine inside a Spark cluster to independently process range queries on each partition of streaming data. By exploiting the proposed system architecture, the process of indexing of streaming data and the process of querying over streaming data can be performed in a distributed and parallel manner. By performing several experiments, we demonstrate that our proposed indexing method is scalable and efficient for processing range queries on streaming data compared to the existing centralized B+Tree indexing methods and therefore, it can be used for applications involving data streams with a large volume of data elements and a large number of range queries.
引用
收藏
页码:1251 / 1274
页数:23
相关论文
共 50 条
  • [1] A distributed B plus Tree indexing method for processing range queries over streaming data
    Safaee, Shahab
    Mirabi, Meghdad
    Rahmani, Amir Masoud
    Safaei, Ali Asghar
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (02): : 1251 - 1274
  • [2] StreamFilter: a framework for distributed processing of range queries over streaming data with fine-grained access control
    Safaee, Shahab
    Mirabi, Meghdad
    Safaei, Ali Asghar
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (07): : 9221 - 9241
  • [3] The SSP-Tree: A Method for Distributed Processing of Range Monitoring Queries in Road Networks
    Jung, HaRim
    Kim, Ung-Mo
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2017, 6 (11)
  • [4] Efficient Processing of Range Queries over Distributed Relational Databases
    Price, Richard
    Ramaswamy, Lakshmish
    Pouriyeh, Seyedamin
    2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2018, : 331 - 337
  • [5] Distributed Processing of Continuous Range Queries Over Moving Objects
    Zhou, Jin
    Teng, Hao
    Yu, Ziqiang
    Wang, Dong
    Wang, Jiaqi
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT II, 2017, 10362 : 800 - 810
  • [6] Efficient data retrieval using adaptive clustered indexing for continuous queries over streaming data
    Sumalatha, M. R.
    Ananthi, M.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 5): : 10503 - 10517
  • [7] Efficient data retrieval using adaptive clustered indexing for continuous queries over streaming data
    M. R. Sumalatha
    M. Ananthi
    Cluster Computing, 2019, 22 : 10503 - 10517
  • [8] Parallel Processing of Dynamic Continuous Queries over Streaming Data Flows
    Deng, Ze
    Wu, Xiaoming
    Wang, Lizhen
    Chen, Xiaodao
    Ranjan, Rajiv
    Zomaya, Albert
    Chen, Dan
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (03) : 834 - 846
  • [9] Indexing Uncertain Data for Supporting Range Queries
    Zhu, Rui
    Wang, Bin
    Wang, Guoren
    WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 72 - 83
  • [10] Indexing Metric Uncertain Data for Range Queries
    Chen, Lu
    Gao, Yunjun
    Li, Xinhan
    Jensen, Christian S.
    Chen, Gang
    Zheng, Baihua
    SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 951 - 965