Data Acquisition for Probabilistic Nearest-Neighbor Query

被引:5
|
作者
Lin, Yu-Chieh [1 ]
Yang, De-Nian [1 ,2 ]
Shuai, Hong-Han [3 ]
Chen, Ming-Syan [1 ,4 ]
机构
[1] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 115, Taiwan
[2] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[3] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei 10764, Taiwan
[4] Natl Taiwan Univ, Dept Elect Engn, Taipei 10764, Taiwan
关键词
Uncertainty; algorithm design and analysis; query processing; nearest neighbor searches;
D O I
10.1109/TKDE.2013.2297916
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Management of uncertain data in spatial queries has drawn extensive research interests to consider the granularity of devices and noises in the collection and the delivery of data. Most previous works usually model and handle uncertain data to find the required results directly. However, it is more difficult for users to obtain useful insights when data uncertainty dramatically increases. In this case, users are usually willing to invest more resources to improve the result by reducing the data uncertainty in order to obtain more interesting observations with the existing schemes. In light of this important need, this paper formulates a new problem of selecting a given number of uncertain data objects for acquiring their attribute values to improve the result of the Probabilistic k-Nearest-Neighbor (k-PNN) query. We prove that better query results are guaranteed to be returned with data acquisition, and we devise several algorithms to maximize the expected improvement. We first explore the optimal single-object acquisition for 1-PNN to examine the fundamental problem structure and then propose an efficient algorithm that discovers crucial properties to simplify the probability derivation in varied situations. We extend the proposed algorithm to achieve the optimal multi-object acquisition for 1-PNN by deriving an upper bound to facilitate efficient pruning of unnecessary sets of objects. Moreover, for data acquisition of k-PNN, we extract the k-PNN answers with sufficiently large probabilities to trim the search space and properly exploit the result of single-object acquisition for estimating the gain from multi-object acquisition. The experimental results demonstrate that the probability of k-PNN can be significantly improved even with only a small number of objects for data acquisition.
引用
收藏
页码:410 / 427
页数:18
相关论文
共 50 条
  • [1] Probabilistic nearest-neighbor query on uncertain objects
    Kriegel, Hans-Peter
    Kunath, Peter
    Renz, Matthias
    [J]. ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 337 - +
  • [2] A PROBABILISTIC FILTER PROTOCOL FOR CONTINUOUS NEAREST-NEIGHBOR QUERY
    Zhu, Jianpeng
    Jin, Jian
    Wang, Ying
    [J]. 2009 IEEE YOUTH CONFERENCE ON INFORMATION, COMPUTING AND TELECOMMUNICATION, PROCEEDINGS, 2009, : 399 - +
  • [3] Range nearest-neighbor query
    Hu, HB
    Lee, DL
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (01) : 78 - 91
  • [4] Probabilistic nearest neighbor query processing on distributed uncertain data
    Amagata, Daichi
    Sasaki, Yuya
    Hara, Takahiro
    Nishio, Shojiro
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2016, 34 (02) : 259 - 287
  • [5] Probabilistic nearest neighbor query processing on distributed uncertain data
    Daichi Amagata
    Yuya Sasaki
    Takahiro Hara
    Shojiro Nishio
    [J]. Distributed and Parallel Databases, 2016, 34 : 259 - 287
  • [6] Efficient Nearest-Neighbor Query and Clustering of Planar Curves
    Aronov, Boris
    Filtser, Omrit
    Horton, Michael
    Katz, Matthew J.
    Sheikhan, Khadijeh
    [J]. ALGORITHMS AND DATA STRUCTURES, WADS 2019, 2019, 11646 : 28 - 42
  • [7] MapReduce Algorithms for the K Group Nearest-Neighbor Query
    Moutafis, Panagiotis
    Garcia-Garcia, Francisco
    Mavrommatis, George
    Vassilakopoulos, Michael
    Corral, Antonio
    Iribarne, Luis
    [J]. SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 448 - 455
  • [8] Efficient Processing of Probabilistic Group Nearest Neighbor Query on Uncertain Data
    Li, Jiajia
    Wang, Botao
    Wang, Guoren
    Bi, Xin
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, PT I, 2014, 8421 : 436 - 450
  • [9] Efficient Probabilistic Reverse Nearest Neighbor Query Processing on Uncertain Data
    Bernecker, Thomas
    Emrich, Tobias
    Kriegel, Hans-Peter
    Renz, Matthias
    Zankl, Stefan
    Zuefle, Andreas
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (10): : 669 - 680
  • [10] Probabilistic verifiers: Evaluating Constrained Nearest-Neighbor queries over uncertain data
    Cheng, Reynold
    Chen, Jinchuan
    Mokbel, Mohamed
    Chow, Chi-Yin
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 973 - +