Indexing Metric Uncertain Data for Range Queries

被引:8
|
作者
Chen, Lu [1 ]
Gao, Yunjun [1 ,2 ]
Li, Xinhan [1 ]
Jensen, Christian S. [3 ]
Chen, Gang [1 ]
Zheng, Baihua [4 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou, Zhejiang, Peoples R China
[2] Zhejiang Univ, Innovat Joint Res Ctr Cyber Phys Soc Syst, Hangzhou, Zhejiang, Peoples R China
[3] Aalborg Univ, Dept Comp Sci, Aalborg, Denmark
[4] Singapore Management Univ, Sch Informat Syst, Singapore, Singapore
关键词
Range query; Uncertain data; Metric space; Index structure; NEAREST-NEIGHBOR SEARCH;
D O I
10.1145/2723372.2723728
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Range queries in metric spaces have applications in many areas such as multimedia retrieval, computational biology, and location-based services, where metric uncertain data exists in different forms, resulting from equipment limitations, high-throughput sequencing technologies, privacy preservation, or others. In this paper, we represent metric uncertain data by using an object-level model and a bi-level model, respectively. Two novel indexes, the uncertain pivot B+ -tree (UPB-tree) and the uncertain pivot B+- forest (UPB-forest), are proposed accordingly in order to support probabilistic range queries w.r.t. a wide range of uncertain data types and similarity metrics. Both index structures use a small set of effective pivots chosen based on a newly defined criterion, and employ the B+ -tree(s) as the underlying index. By design, they are easy to be integrated into any existing DBMS. In addition, we present efficient metric probabilistic range query algorithms, which utilize the validation and pruning techniques based on our derived probability lower and upper bounds. Extensive experiments with both real and synthetic data sets demonstrate that, compared against existing state-of-the-art indexes for metric uncertain data, the UPB-tree and UPB-forest incur much lower construction costs, consume smaller storage spaces, and can support more efficient metric probabilistic range queries.
引用
收藏
页码:951 / 965
页数:15
相关论文
共 50 条
  • [31] On high dimensional indexing of uncertain data
    Aggarwal, Charu C.
    Yu, Philip S.
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1460 - +
  • [32] A Novel Indexing Method for Spatial-Keyword Range Queries
    Tampakis, Panagiotis
    Spyrellis, Dimitris
    Doulkeridis, Christos
    Pelekis, Nikos
    Kalyvas, Christos
    Vlachou, Akrivi
    PROCEEDINGS OF 17TH INTERNATIONAL SYMPOSIUM ON SPATIAL AND TEMPORAL DATABASES, SSTD 2021, 2021, : 54 - 63
  • [33] Indexing range sum queries in spatio-temporal databases
    Cho, Hyung-Ju
    Chung, Chin-Wan
    INFORMATION AND SOFTWARE TECHNOLOGY, 2007, 49 (04) : 324 - 331
  • [34] A comparison of selectivity estimators for range queries on metric attributes
    Blohsfeld, B
    Korus, D
    Seeger, B
    SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999: SIGMOD99: PROCEEDINGS OF THE 1999 ACM SIGMOD - INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 1999, : 239 - 250
  • [35] Perils of Combining Parallel Distance Computations with Metric and Ptolemaic Indexing in kNN Queries
    Krulis, Martin
    Kirchhoff, Steffen
    Yaghob, Jakub
    SIMILARITY SEARCH AND APPLICATIONS, 2014, 8821 : 127 - 138
  • [36] Uncertain Distance-Based Range Queries over Uncertain Moving Objects
    Chen, Yi-Fei
    Qin, Xiao-Lin
    Liu, Liang
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2010, 25 (05) : 982 - 998
  • [37] Uncertain Distance-Based Range Queries over Uncertain Moving Objects
    陈逸菲
    秦小麟
    刘亮
    JournalofComputerScience&Technology, 2010, 25 (05) : 982 - 998
  • [38] Uncertain Distance-Based Range Queries over Uncertain Moving Objects
    Yi-Fei Chen
    Xiao-Lin Qin
    Liang Liu
    Journal of Computer Science and Technology, 2010, 25 : 982 - 998
  • [39] RRSi: indexing XML data for proximity twig queries
    Patrick K. L. Ng
    Vincent T. Y. Ng
    Knowledge and Information Systems, 2008, 17 : 193 - 216
  • [40] RRSi: indexing XML data for proximity twig queries
    Ng, Patrick K. L.
    Ng, Vincent T. Y.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 17 (02) : 193 - 216