A practical approach for efficiently answering top-k relational queries

被引:6
|
作者
Ayanso, Anteneh
Goes, Paulo B.
Mehta, Kumar
机构
[1] George Mason Univ, Sch Management, Decis Sci & Management Informat Syst, Fairfax, VA 22030 USA
[2] Brock Univ, Dept Finance Operat & Informat Syst, St Catharines, ON L2S 3A1, Canada
[3] Univ Connecticut, Dept Operat & Informt Management, Storrs, CT 06269 USA
关键词
similarity search; top-k query; uncertainty modeling; RDBMSs;
D O I
10.1016/j.dss.2007.04.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An increasing number of application areas now rely on obtaining the "best matches" to a given query as opposed to exact matches sought by traditional transactions. This type of exploratory querying (also called top-k querying) can significantly improve the performance of web-based applications such as consumer reviews, price comparisons and recommendations for products/services. Due to the lack of support for specialized indexes and/or data structures in relational database management systems (RDBMSs), recent research has focused on utilizing summary statistics (histograms) maintained by RDBMSs for translating the top-k request into a traditional range query. Because the RDBMS query engines are already optimized for execution of range queries, such approach has both practical as well as efficiency advantages. In this paper, we review the strengths and weaknesses of common histogram construction techniques with regard to their structural characteristics, accuracy in approximating the true distribution of the underlying data, and implications for top-k retrieval. We also present our top-k retrieval strategy (Query-Level Optimal Cost Strategy - QLOCS) and demonstrate its "histogram-independent" performance. Based on comparative experimental and statistical analyses with the best-known histogram-based strategy in the literature, we show that QLOCS is not only more efficient but also provides more consistent performance across commonly used histogram types in RDBMSs. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:326 / 349
页数:24
相关论文
共 50 条
  • [1] Answering Top-k Keyword Queries on Relational Databases
    Thein, Myint Myint
    Thwin, Mie Mie Su
    [J]. INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2012, 2 (03) : 36 - 57
  • [2] Efficiently answering probabilistic threshold top-k queries on uncertain data
    Hua, Ming
    Pei, Jian
    Zhang, Wenjie
    Lin, Xuemin
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1403 - +
  • [3] CrowdK: Answering top-k queries with crowdsourcing
    Lee, Jongwuk
    Lee, Dongwon
    Hwang, Seung-won
    [J]. INFORMATION SCIENCES, 2017, 399 : 98 - 120
  • [4] Answering Top-k Similar Region Queries
    Sheng, Chang
    Zheng, Yu
    Hsu, Wynne
    Lee, Mong Li
    Xie, Xing
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, PROCEEDINGS, 2010, 5981 : 186 - +
  • [5] Answering Top-k Exemplar Trajectory Queries
    Wang, Sheng
    Bao, Zhifeng
    Culpepper, J. Shane
    Sellis, Timos
    Sanderson, Mark
    Qin, Xiaolin
    [J]. 2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 597 - 608
  • [6] Efficiently answering top-k frequent term queries in temporal-categorical range
    He, Zhenying
    Wang, Lu
    Lu, Chang
    Jing, Yinan
    Zhang, Kai
    Han, Weili
    Li, Jianxin
    Liu, Chengfei
    Wang, X. Sean
    [J]. INFORMATION SCIENCES, 2021, 574 : 238 - 258
  • [7] Answering Why-not Questions on Top-k Queries
    He, Zhian
    Lo, Eric
    [J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 750 - 761
  • [8] Answering Why-Not Questions on Top-K Queries
    He, Zhian
    Lo, Eric
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (06) : 1300 - 1315
  • [9] Answering Top-k Representative Queries on Graph Databases
    Ranu, Sayan
    Minh Hoang
    Singh, Ambuj
    [J]. SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 1163 - 1174
  • [10] Efficiently answer top-k queries on typed intervals
    Xu, Jianqiu
    Lu, Hua
    [J]. INFORMATION SYSTEMS, 2017, 71 : 164 - 181