Learned index for non-key queries

被引:0
|
作者
Zhu, Rui [1 ]
Wang, Hongzhi [1 ]
Xia, Sheng [1 ]
Zheng, Bo [2 ]
机构
[1] Harbin Inst Technol, Comp Sci & Technol, 92 Xidazhi St, Harbin 150000, Heilongjiang, Peoples R China
[2] CnosDB, Beijing 100000, Peoples R China
基金
中国国家自然科学基金;
关键词
Bloom filter; Learned index; Non-key query; Index;
D O I
10.1007/s10115-024-02233-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learned indexes have attracted a lot of interest lately due to their superior performance over conventional indexes. When there is a lot of data traffic, the learned index efficiently addresses the issue of the standard index's large memory usage. In this paper, we concentrate on a well-known learned index, the recursive model index (RMI). Since the machine learning model is unbiased while calculating, when there are too many non-key queried, the model will calculate the position of the key as if it were positive key, which wastes a lot of time on unnecessary calculations. To deal with this condition, we propose a hierarchical learned index structure based on Bloom filter named HBFdex. HBFdex can effectively prune non-keys, which means most non-key return in layer of BF before they get to machine learning model. By lowering the number of layers traversed by non-key and the time spent looking for non-key within the error bound that is provided by machine learning model, HBFdex decreases the average query time of learned index. We compare HBFdex with B-Tree and RMI, and the results prove that our new structure optimizes the performance of RMI in the case of non-key queries.
引用
收藏
页码:497 / 519
页数:23
相关论文
共 50 条
  • [1] Learned Index for Spatial Queries
    Wang, Haixin
    Fu, Xiaoyi
    Xu, Jianliang
    Lu, Hua
    2019 20TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2019), 2019, : 569 - 574
  • [2] Jefferson Smurfit divests non-key assets
    Boxboard Containers International, 2001, 109 (03):
  • [3] DISTRIBUTION OF NON-KEY COMPONENTS IN MULTICOMPONENT DISTILLATION
    TSUBAKI, M
    HIRAIWA, H
    INTERNATIONAL CHEMICAL ENGINEERING, 1973, 13 (01): : 183 - 191
  • [4] NokeaRM: Employing Non-key Attributes in Record Matching
    Yang, Qiang
    Li, Zhixu
    Jiang, Jun
    Zhao, Pengpeng
    Liu, Guanfeng
    Liu, An
    Zhu, Jia
    WEB-AGE INFORMATION MANAGEMENT (WAIM 2015), 2015, 9098 : 438 - 442
  • [5] PROTECTION SCHEME FOR NON-KEY STORAGE SYSTEMS.
    Anon
    IBM technical disclosure bulletin, 1986, 28 (10): : 4538 - 4539
  • [6] LK-Index: A Learned Index for KNN Queries
    Peng, Yongxin
    IEEE ACCESS, 2024, 12 : 103096 - 103103
  • [7] A learned spatial textual index for efficient keyword queries
    Xiaofeng Ding
    Yinting Zheng
    Zuan Wang
    Kim-Kwang Raymond Choo
    Hai Jin
    Journal of Intelligent Information Systems, 2023, 60 : 803 - 827
  • [8] A learned spatial textual index for efficient keyword queries
    Ding, Xiaofeng
    Zheng, Yinting
    Wang, Zuan
    Choo, Kim-Kwang Raymond
    Jin, Hai
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2023, 60 (03) : 803 - 827
  • [9] SPRIG: A Learned Spatial Index for Range and kNN Queries
    Zhang, Songnian
    Ray, Suprio
    Lu, Rongxing
    Zheng, Yandong
    PROCEEDINGS OF 17TH INTERNATIONAL SYMPOSIUM ON SPATIAL AND TEMPORAL DATABASES, SSTD 2021, 2021, : 96 - 105
  • [10] Incorporating non-key traits in selecting the Pinus radiata production population
    Kennedy, Stuart G.
    Yanchuk, Alvin D.
    Stackpole, Desmond J.
    Jefferson, Paul A.
    NEW ZEALAND JOURNAL OF FORESTRY SCIENCE, 2014, 44