Efficient Range Queries over Uncertain Strings

被引:0
|
作者
Dai, Dongbo [1 ]
Xie, Jiang [1 ,3 ]
Zhang, Huiran [1 ]
Dong, Jiaqi [2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
[3] Univ Calif Irvine, Dept Math, Irvine, CA USA
关键词
uncertain strings; range query; filtering;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edit distance based string range query is used extensively in the data integration, keyword search, biological function prediction and many others. In the presence of uncertainty, however, answering range queries is more challenging than those in deterministic scenarios since there are exponentially many possible worlds to be considered. This work extends existing filtering techniques tailored for deterministic strings to uncertain settings. We first design probabilistic q-gram filtering method that can work both efficiently and effectively. Another filtering technique, frequency distance based filtering, is also adapted to work with uncertain strings. To achieve further speed-up, we combined two state-of-the-art approaches based on cumulative distribution functions and local perturbation to improve lower bounds and upper bounds. Comprehensive experiment results show that our filter-based scheme, in the uncertain settings, is more efficient than existing methods only leveraging cumulative distribution functions or local perturbation.
引用
收藏
页码:75 / 95
页数:21
相关论文
共 50 条
  • [1] Uncertain Distance-Based Range Queries over Uncertain Moving Objects
    Chen, Yi-Fei
    Qin, Xiao-Lin
    Liu, Liang
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2010, 25 (05) : 982 - 998
  • [2] Uncertain Distance-Based Range Queries over Uncertain Moving Objects
    陈逸菲
    秦小麟
    刘亮
    JournalofComputerScience&Technology, 2010, 25 (05) : 982 - 998
  • [3] Uncertain Distance-Based Range Queries over Uncertain Moving Objects
    Yi-Fei Chen
    Xiao-Lin Qin
    Liang Liu
    Journal of Computer Science and Technology, 2010, 25 : 982 - 998
  • [4] Range queries on uncertain data
    Li, Jian
    Wang, Haitao
    THEORETICAL COMPUTER SCIENCE, 2016, 609 : 32 - 48
  • [5] Range Queries on Uncertain Data
    Li, Jian
    Wang, Haitao
    ALGORITHMS AND COMPUTATION, ISAAC 2014, 2014, 8889 : 326 - 337
  • [6] An efficient scheme for probabilistic skyline queries over distributed uncertain data
    Xiaoyong Li
    Yijie Wang
    Jie Yu
    Telecommunication Systems, 2015, 60 : 225 - 237
  • [7] An efficient scheme for probabilistic skyline queries over distributed uncertain data
    Li, Xiaoyong
    Wang, Yijie
    Yu, Jie
    TELECOMMUNICATION SYSTEMS, 2015, 60 (02) : 225 - 237
  • [8] GDPS: An Efficient Approach for Skyline Queries over Distributed Uncertain Data
    Li, Xiaoyong
    Wang, Yijie
    Li, Xiaoling
    Wang, Xiaowei
    yu, Jie
    BIG DATA RESEARCH, 2014, 1 (01) : 23 - 36
  • [9] Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data
    Ding, Xiaofeng
    Jin, Hai
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (08) : 1448 - 1462
  • [10] Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data
    Ding, Xiaofeng
    Jin, Hai
    2010 INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS ICDCS 2010, 2010,