Enabling high-dimensional range queries using kNN indexing techniques: approaches and empirical results

被引:0
|
作者
Tim Wylie
Michael A. Schuh
Rafal A. Angryk
机构
[1] University of Texas - Rio Grande Valley,
[2] Montana State University,undefined
[3] Georgia State University,undefined
来源
关键词
Indexing; Nearest neighbor; NN; Range queries; High-dimensional data; iDistance; Wildcard search; Sphere cover;
D O I
暂无
中图分类号
学科分类号
摘要
Many modern search applications are high-dimensional and depend on efficient orthogonal range queries. These applications span web-based and scientific needs as well as uses for data mining. Although k-nearest neighbor queries are becoming increasingly common due to mobile and geospatial applications, orthogonal range queries in high-dimensional data remain extremely important and relevant. For efficient querying, data is typically stored in an index optimized for either kNN or range queries. This can be problematic when data is optimized for kNN retrieval and a user needs a range query or vice versa. Here, we address the issue of using a kNN-based index for range queries, as well as outline the general computational geometry problem of adapting these systems to range queries. We refer to these methods as space-based decompositions and provide a straightforward heuristic for this problem. Using iDistance as our applied kNN indexing technique, we also develop an optimal (data-based) algorithm designed specifically for its indexing scheme. We compare this method to the suggested naïve approach using real world datasets. The data-based algorithm consistently performs better.
引用
收藏
页码:1107 / 1132
页数:25
相关论文
共 48 条
  • [1] Enabling high-dimensional range queries using kNN indexing techniques: approaches and empirical results
    Wylie, Tim
    Schuh, Michael A.
    Angryk, Rafal A.
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2016, 32 (04) : 1107 - 1132
  • [2] Approximating High-Dimensional Range Queries with kNN Indexing Techniques
    Schuh, Michael A.
    Wylie, Tim
    Liu, Chang
    Angryk, Rafal A.
    COMPUTING AND COMBINATORICS, COCOON 2014, 2014, 8591 : 369 - 380
  • [3] Efficient parallel processing of high-dimensional spatial kNN queries
    Jiang, Tao
    Zhang, Bin
    Lin, Dan
    Gao, Yunjun
    Li, Qing
    SOFT COMPUTING, 2022, 26 (22) : 12291 - 12316
  • [4] FPGA Acceleration of Approximate KNN Indexing on High-Dimensional Vectors
    Danopoulos, Dimitrios
    Kachris, Christoforos
    Soudris, Dimitrios
    2019 14TH INTERNATIONAL SYMPOSIUM ON RECONFIGURABLE COMMUNICATION-CENTRIC SYSTEMS-ON-CHIP (RECOSOC 2019), 2019, : 59 - 65
  • [5] Efficient parallel processing of high-dimensional spatial kNN queries
    Tao Jiang
    Bin Zhang
    Dan Lin
    Yunjun Gao
    Qing Li
    Soft Computing, 2022, 26 : 12291 - 12316
  • [6] A learned index for approximate kNN queries in high-dimensional spaces
    Lingli Li
    Jingwen Cai
    Jie Xu
    Knowledge and Information Systems, 2022, 64 : 3325 - 3342
  • [7] A learned index for approximate kNN queries in high-dimensional spaces
    Li, Lingli
    Cai, Jingwen
    Xu, Jie
    KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (12) : 3325 - 3342
  • [8] Two-Level Indexing for High-Dimensional Range Queries in Peer-to-Peer Networks
    Zhang, Lelin
    Wang, Zhiyong
    Feng, Dagan
    2009 IEEE INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2009), 2009, : 117 - 121
  • [9] Survey on Exact kNN Queries over High-Dimensional Data Space
    Ukey, Nimish
    Yang, Zhengyi
    Li, Binghao
    Zhang, Guangjian
    Hu, Yiheng
    Zhang, Wenjie
    SENSORS, 2023, 23 (02)
  • [10] Towards Efficient Evaluation of ABAC Policies using High-Dimensional Indexing Techniques
    Paul, Proteet
    Sural, Shamik
    2021 THIRD IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS (TPS-ISA 2021), 2021, : 243 - 251