Efficient Range Queries over Uncertain Strings

被引:0
|
作者
Dai, Dongbo [1 ]
Xie, Jiang [1 ,3 ]
Zhang, Huiran [1 ]
Dong, Jiaqi [2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
[3] Univ Calif Irvine, Dept Math, Irvine, CA USA
来源
SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, SSDBM 2012 | 2012年 / 7338卷
关键词
uncertain strings; range query; filtering;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edit distance based string range query is used extensively in the data integration, keyword search, biological function prediction and many others. In the presence of uncertainty, however, answering range queries is more challenging than those in deterministic scenarios since there are exponentially many possible worlds to be considered. This work extends existing filtering techniques tailored for deterministic strings to uncertain settings. We first design probabilistic q-gram filtering method that can work both efficiently and effectively. Another filtering technique, frequency distance based filtering, is also adapted to work with uncertain strings. To achieve further speed-up, we combined two state-of-the-art approaches based on cumulative distribution functions and local perturbation to improve lower bounds and upper bounds. Comprehensive experiment results show that our filter-based scheme, in the uncertain settings, is more efficient than existing methods only leveraging cumulative distribution functions or local perturbation.
引用
收藏
页码:75 / 95
页数:21
相关论文
共 50 条
  • [41] Range Queries over Untangled Chains
    Claude, Francisco
    Munro, J. Ian
    Nicholson, Patrick K.
    STRING PROCESSING AND INFORMATION RETRIEVAL, 2010, 6393 : 82 - 93
  • [42] Efficient Privacy-Preserving Range Queries over Encrypted Data in Cloud Computing
    Samanthula, Bharath K.
    Jiang, Wei
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 51 - 58
  • [43] Join Queries on Uncertain Data: Semantics and Efficient Processing
    Ge, Tingjian
    IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 697 - 708
  • [45] Probabilistic Inverse Ranking Queries over Uncertain Data
    Lian, Xiang
    Chen, Lei
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2009, 5463 : 35 - 50
  • [46] Group Location Selection Queries over Uncertain Objects
    Xu, Chuanfei
    Gu, Yu
    Zimmermann, Roger
    Lin, Shukuan
    Yu, Ge
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (12) : 2796 - 2808
  • [47] Probabilistic Convex Hull Queries over Uncertain Data
    Yan, Da
    Zhao, Zhou
    Ng, Wilfred
    Liu, Steven
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (03) : 852 - 865
  • [48] Top-k Queries Over Uncertain Scores
    Liu, Qing
    Basu, Debabrota
    Abdessalem, Talel
    Bressan, Stephane
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2016 CONFERENCES, 2016, 10033 : 245 - 262
  • [49] PROBABILISTIC SKYLINE QUERIES OVER UNCERTAIN MOVING OBJECTS
    Ding, Xiaofeng
    Jin, Hai
    Xu, Hui
    Song, Wei
    COMPUTING AND INFORMATICS, 2013, 32 (05) : 987 - 1012
  • [50] Query Rewriting on Aggregate Queries over Uncertain Database
    Xie, Dong
    Long, Hai
    COMPUTING AND INTELLIGENT SYSTEMS, PT IV, 2011, 234 : 25 - 31