PA-LBF: Prefix-Based and Adaptive Learned Bloom Filter for Spatial Data

被引:1
|
作者
Zeng, Meng [1 ,2 ]
Zou, Beiji [1 ,2 ]
Kui, Xiaoyan [1 ]
Zhu, Chengzhang [2 ,3 ]
Xiao, Ling [1 ,2 ]
Chen, Zhi [1 ,2 ]
Du, Jingyu [4 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
[2] Mobile Hlth Minist Educ China Mobile Joint Lab, Changsha 410083, Peoples R China
[3] Cent South Univ, Coll Literature & Journalism, Changsha 410083, Peoples R China
[4] Shenzhen Polytech, Sch Artificial Intelligence, Shenzhen 518055, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Adaptive filters - Data structures - Learning systems - Query processing;
D O I
10.1155/2023/4970776
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recently proposed learned bloom filter (LBF) opens a new perspective on how to reconstruct bloom filters with machine learning. However, the LBF has a massive time cost and does not apply to multidimensional spatial data. In this paper, we propose a prefix-based and adaptive learned bloom filter (PA-LBF) for spatial data, which efficiently supports the insertion and deletion. The proposed PA-LBF is divided into three parts: (1) the prefix-based classification. The Z-order space-filling curve is used to extract data, prefix it, and classify it. (2) The adaptive learning process. The multiple independent adaptive sub-LBFs are designed to train the suffixes of data, combined with part 1, to reduce the false positive rate (FPR), query, and learning process time consumption. (3) The backup filter uses CBF. Two kinds of backup CBF are constructed to meet the situation of different insertion and deletion frequencies. Experimental results prove the validity of the theory and show that the PA-LBF reduces the FPR by 84.87%, 79.53%, and 43.01% with the same memory usage compared with the LBF on three real-world spatial datasets. Moreover, the time consumption of PA-LBF can be reduced to 5x and 2.05x that of the LBF on the query and learning process, respectively.
引用
下载
收藏
页数:19
相关论文
共 50 条
  • [31] Bloom-filter-based request node collaboration caching for named data networking
    Rui Hou
    Lang Zhang
    Tingting Wu
    Tengyue Mao
    Jiangtao Luo
    Cluster Computing, 2019, 22 : 6681 - 6692
  • [32] Bloom Filter Approach for Autonomous Data Acquisition in the Edge-Based MCS Scenario
    Antonic, Martina
    Antonic, Aleksandar
    Zarko, Ivana Podnar
    SENSORS, 2022, 22 (03)
  • [33] Bloom-filter-based request node collaboration caching for named data networking
    Hou, Rui
    Zhang, Lang
    Wu, Tingting
    Mao, Tengyue
    Luo, Jiangtao
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 3): : S6681 - S6692
  • [35] An Adaptive Spatial Target Tracking Method Based on Unscented Kalman Filter
    Rong, Dandi
    Wang, Yi
    Sensors, 2024, 24 (18)
  • [36] An Adaptive Segmentation Algorithm for Spatial Data based on CNN
    Wei H.
    Du Y.
    Zhang J.
    Sun L.
    Journal of Geo-Information Science, 2022, 24 (06) : 1099 - 1106
  • [38] A speckle filter based on spatial texture analysis of SAR data
    D'Hondt, O
    Ferro-Famil, L
    Pottier, E
    IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 4283 - 4286
  • [39] Learned Image Compression with Adaptive Channel and Window-Based Spatial Entropy Models
    Wang, Jian
    Ling, Qiang
    IEEE Transactions on Consumer Electronics, 2024, 70 (04): : 6430 - 6441
  • [40] A Bloom Filter Based Scalable Data Integrity Check Tool for Large-scale Dataset
    Xiong, Sisi
    Wang, Feiyi
    Cao, Qing
    PROCEEDINGS OF PDSW-DISCS 2016 - 1ST JOINT INTERNATIONAL WORKSHOP ON PARALLEL DATA STORAGE AND DATA INTENSIVE SCALABLE COMPUTING SYSTEMS, 2016, : 55 - 60