An adaptive and dynamic dimensionality reduction method for high-dimensional indexing

被引:17
|
作者
Shen, Heng Tao [1 ]
Zhou, Xiaofang [1 ]
Zhou, Aoying [1 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
来源
VLDB JOURNAL | 2007年 / 16卷 / 02期
关键词
high-dimensional indexing; dimensionality reduction; correlated clustering; subspace; projection;
D O I
10.1007/s00778-005-0167-3
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The notorious "dimensionality curse" is a well-known phenomenon for any multi-dimensional indexes attempting to scale up to high dimensions. One well-known approach to overcome degradation in performance with respect to increasing dimensions is to reduce the dimensionality of the original dataset before constructing the index. However, identifying the correlation among the dimensions and effectively reducing them are challenging tasks. In this paper, we present an adaptive Multi-level Mahalanobis-based Dimensionality Reduction (MMDR) technique for high-dimensional indexing. Our MMDR technique has four notable features compared to existing methods. First, it discovers elliptical clusters for more effective dimensionality reduction by using only the low-dimensional subspaces. Second, data points in the different axis systems are indexed using a single B+-tree. Third, our technique is highly scalable in terms of data size and dimension. Finally, it is also dynamic and adaptive to insertions. An extensive performance study was conducted using both real and synthetic datasets, and the results show that our technique not only achieves higher precision, but also enables queries to be processed efficiently.
引用
收藏
页码:219 / 234
页数:16
相关论文
共 50 条
  • [1] An adaptive and dynamic dimensionality reduction method for high-dimensional indexing
    Heng Tao Shen
    Xiaofang Zhou
    Aoying Zhou
    [J]. The VLDB Journal, 2007, 16 : 219 - 234
  • [2] An adaptive and efficient dimensionality reduction algorithm for high-dimensional indexing
    Jin, H
    Ooi, BC
    Shen, HT
    Yu, C
    Zhou, AY
    [J]. 19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, : 87 - 98
  • [3] Effective indexing and searching with dimensionality reduction in high-dimensional space
    Jeong, Seungdo
    Kim, Sang-Wook
    Choi, Byung-Uk
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2016, 31 (04): : 291 - 302
  • [4] Efficient indexing of high-dimensional data through dimensionality reduction
    Goh, CH
    Lim, A
    Ooi, BC
    Tan, KL
    [J]. DATA & KNOWLEDGE ENGINEERING, 2000, 32 (02) : 115 - 130
  • [5] A dimensionality reduction method for efficient search of high-dimensional databases
    Aghbari, Z
    Kaneko, K
    Makinouchi, A
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (06): : 1032 - 1041
  • [6] Adaptive Indexing in High-Dimensional Metric Spaces
    Lampropoulos, Konstantinos
    Zardbani, Fatemeh
    Mamoulis, Nikos
    Karras, Panagiotis
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (10): : 2525 - 2537
  • [7] A hybrid dimensionality reduction method for outlier detection in high-dimensional data
    Guanglei Meng
    Biao Wang
    Yanming Wu
    Mingzhe Zhou
    Tiankuo Meng
    [J]. International Journal of Machine Learning and Cybernetics, 2023, 14 : 3705 - 3718
  • [8] A hybrid dimensionality reduction method for outlier detection in high-dimensional data
    Meng, Guanglei
    Wang, Biao
    Wu, Yanming
    Zhou, Mingzhe
    Meng, Tiankuo
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (11) : 3705 - 3718
  • [9] Adaptive Cluster Distance Bounding for High-Dimensional Indexing
    Ramaswamy, Sharadh
    Rose, Kenneth
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (06) : 815 - 830
  • [10] A sparse grid based method for generative dimensionality reduction of high-dimensional data
    Bohn, Bastian
    Garcke, Jochen
    Griebel, Michael
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2016, 309 : 1 - 17