Efficient indexing of high-dimensional data through dimensionality reduction

被引:10
|
作者
Goh, CH [1 ]
Lim, A [1 ]
Ooi, BC [1 ]
Tan, KL [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore 119260, Singapore
关键词
R-trees; dimensionally-reduced R-trees; Hilbert curve; high dimensional space; linear space;
D O I
10.1016/S0169-023X(99)00031-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of the R-tree indexing method is known to deteriorate rapidly when the dimensionality of data increases. In this paper, we present a technique for dimensionality reduction by grouping d distinct attributes into k disjoint clusters and mapping each cluster to a linear space. The resulting k-dimensional space (which may be much smaller than d) can then be indexed using an R-tree efficiently. We present algorithms for decomposing a query region on the native rt-dimensional space to corresponding query regions in the k-dimensional space, as well as search and update operations for the "dimensionally-reduced" R-tree. Experiments using real data sets for point, region, and OLAP queries were conducted. The results indicate that there is potential for significant performance gains over a naive strategy in which an R-tree index is created on the native d-dimensional space. (C) 2000 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:115 / 130
页数:16
相关论文
共 50 条
  • [1] An adaptive and efficient dimensionality reduction algorithm for high-dimensional indexing
    Jin, H
    Ooi, BC
    Shen, HT
    Yu, C
    Zhou, AY
    [J]. 19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, : 87 - 98
  • [2] Effective indexing and searching with dimensionality reduction in high-dimensional space
    Jeong, Seungdo
    Kim, Sang-Wook
    Choi, Byung-Uk
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2016, 31 (04): : 291 - 302
  • [3] An adaptive and dynamic dimensionality reduction method for high-dimensional indexing
    Shen, Heng Tao
    Zhou, Xiaofang
    Zhou, Aoying
    [J]. VLDB JOURNAL, 2007, 16 (02): : 219 - 234
  • [4] An adaptive and dynamic dimensionality reduction method for high-dimensional indexing
    Heng Tao Shen
    Xiaofang Zhou
    Aoying Zhou
    [J]. The VLDB Journal, 2007, 16 : 219 - 234
  • [5] Efficient Dimensionality Reduction for High-Dimensional Network Estimation
    Celik, Safiye
    Logsdon, Benjamin A.
    Lee, Su-In
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 1953 - 1961
  • [6] Dimensionality reduction for visualizing high-dimensional biological data
    Malepathirana, Tamasha
    Senanayake, Damith
    Vidanaarachchi, Rajith
    Gautam, Vini
    Halgamuge, Saman
    [J]. BIOSYSTEMS, 2022, 220
  • [7] Dimensionality Reduction for Registration of High-Dimensional Data Sets
    Xu, Min
    Chen, Hao
    Varshney, Pramod K.
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (08) : 3041 - 3049
  • [8] A dimensionality reduction method for efficient search of high-dimensional databases
    Aghbari, Z
    Kaneko, K
    Makinouchi, A
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (06): : 1032 - 1041
  • [9] An Efficient Dimensionality Reduction Approach for Small-sample Size and High-dimensional Data Modeling
    Qiu, Xintao
    Fu, Dongmei
    Fu, Zhenduo
    [J]. JOURNAL OF COMPUTERS, 2014, 9 (03) : 576 - 580
  • [10] A Scalable Unsupervised Feature Merging Approach to Efficient Dimensionality Reduction of High-dimensional Visual Data
    Liu, Lingqiao
    Wang, Lei
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3008 - 3015