Toward efficient multifeature query processing

被引:7
|
作者
Jagadish, HV [1 ]
Ooi, BC
Shen, HT
Tan, KL
机构
[1] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[2] Natl Univ Singapore, Dept Comp Sci, Sch Comp, Singapore 117543, Singapore
[3] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
关键词
multifeature; indexing; query processing; high-dimensional; weighted query;
D O I
10.1109/TKDE.2006.51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many advanced applications, data are described by multiple high-dimensional features. Moreover, different queries may weight these features differently; some may not even specify all the features. In this paper, we propose our solution to support efficient query processing in these applications. We devise a novel representation that compactly captures f features into two components: The first component is a 2D vector that reflects a distance range ( minimum and maximum values) of the f features with respect to a reference point ( the center of the space) in a metric space and the second component is a bit signature, with two bits per dimension, obtained by analyzing each feature's descending energy histogram. This representation enables two levels of filtering: The first component prunes away points that do not share similar distance ranges, while the bit signature filters away points based on the dimensions of the relevant features. Moreover, the representation facilitates the use of a single index structure to further speed up processing. We employ the classical B+-tree for this purpose. We also propose a KNN search algorithm that exploits the access orders of critical dimensions of highly selective features and partial distances to prune the search space more effectively. Our extensive experiments on both real-life and synthetic data sets show that the proposed solution offers significant performance advantages over sequential scan and retrieval methods using single and multiple VA-files.
引用
收藏
页码:350 / 362
页数:13
相关论文
共 50 条
  • [31] Efficient Query Processing with Reduced Implicate Tries
    Neil V. Murray
    Erik Rosenthal
    [J]. Journal of Automated Reasoning, 2007, 38 : 155 - 172
  • [32] Efficient maximal reverse skyline query processing
    Farnoush Banaei-Kashani
    Parisa Ghaemi
    Bahman Movaqar
    Seyed Jalal Kazemitabar
    [J]. GeoInformatica, 2017, 21 : 549 - 572
  • [33] Efficient parallel query processing by graph ranking
    Dereniowski, D
    Kubale, M
    [J]. FUNDAMENTA INFORMATICAE, 2006, 69 (03) : 273 - 285
  • [34] Efficient query processing with reduced implicate tries
    Murray, Neil V.
    Rosenthal, Erik
    [J]. Journal of Automated Reasoning, 2007, 38 (1-3): : 155 - 172
  • [35] Efficient query processing with reduced implicate tries
    Murray, Neil V.
    Rosenthal, Erik
    [J]. JOURNAL OF AUTOMATED REASONING, 2007, 38 (1-3) : 155 - 172
  • [36] Efficient Processing of the Skyline-CL Query
    Huang, Zhenhua
    Zhang, Juan
    Tian, Chunqi
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2016, 41 (08) : 2801 - 2811
  • [37] Combining efficient XML compression with query processing
    Skibinski, Przemyslaw
    Swacha, Jakub
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4690 : 330 - +
  • [38] Efficient maximal reverse skyline query processing
    Banaei-Kashani, Farnoush
    Ghaemi, Parisa
    Movaqar, Bahman
    Kazemitabar, Seyed Jalal
    [J]. GEOINFORMATICA, 2017, 21 (03) : 549 - 572
  • [39] Efficient Query Processing for Scalable Web Search
    Tonellotto, Nicola
    Macdonald, Craig
    Ounis, Iadh
    [J]. FOUNDATIONS AND TRENDS IN INFORMATION RETRIEVAL, 2018, 12 (4-5): : 319 - 500
  • [40] Efficient query processing on uncertain graph databases
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
    [J]. Jisuanji Xuebao, 2009, 10 (2066-2079): : 2066 - 2079