Efficient Prediction of Difficult Keyword Queries over Databases

被引:2
|
作者
Cheng, Shiwen [1 ]
Termehchy, Arash [2 ]
Hristidis, Vagelis [1 ]
机构
[1] Univ Calif Riverside, Riverside, CA 92521 USA
[2] Oregon State Univ, Corvallis, OR 97331 USA
基金
美国国家科学基金会;
关键词
Query performance; query effectiveness; keyword query; robustness; databases; PERFORMANCE; MODEL;
D O I
10.1109/TKDE.2013.140
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyword queries on databases provide easy access to data, but often suffer from low ranking quality, i.e., low precision and/or recall, as shown in recent benchmarks. It would be useful to identify queries that are likely to have low ranking quality to improve the user satisfaction. For instance, the system may suggest to the user alternative queries for such hard queries. In this paper, we analyze the characteristics of hard queries and propose a novel framework to measure the degree of difficulty for a keyword query over a database, considering both the structure and the content of the database and the query results. We evaluate our query difficulty prediction model against two effectiveness benchmarks for popular keyword search ranking methods. Our empirical results show that our model predicts the hard queries with high accuracy. Further, we present a suite of optimizations to minimize the incurred time overhead.
引用
收藏
页码:1507 / 1520
页数:14
相关论文
共 50 条
  • [1] Efficient processing of keyword queries over graph databases for finding effective answers
    Park, Chang-Sup
    Lim, Sungchae
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (01) : 42 - 57
  • [2] Efficient Match-Based Candidate Network Generation for Keyword Queries Over Relational Databases
    de Oliveira, Pericles Silva
    da Silva, Altigran
    de Moura, Edleno
    de Freitas, Rosiane
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (04) : 1735 - 1750
  • [3] CLASCN:: Candidate network selection for efficient top-k keyword queries over databases
    Zhang, Jun
    Peng, Zhao-Hui
    Wang, Shan
    Nie, Hui-Jing
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2007, 22 (02) : 197 - 207
  • [4] CLASCN: Candidate Network Selection for Efficient Top-k Keyword Queries over Databases
    Jun Zhang
    Zhao-Hui Peng
    Shan Wang
    Hui-Jing Nie
    [J]. Journal of Computer Science and Technology, 2007, 22 : 197 - 207
  • [5] Supporting Schema References in Keyword Queries Over Relational Databases
    Martins, Paulo
    da Silva, Altigran Soares
    Afonso, Ariel
    Cavalcanti, Joao
    de Moura, Edleno
    [J]. IEEE ACCESS, 2023, 11 : 92365 - 92390
  • [6] Analyzing Temporal Keyword Queries for Interactive Search over Temporal Databases
    Gao, Qiao
    Lee, Mong Li
    Ling, Tok Wang
    Dobbie, Gillian
    Zeng, Zhong
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2018, PT I, 2018, 11029 : 355 - 371
  • [7] Cluster Based Prediction of Keyword Query Over Databases
    Geethanjali, K.
    Suresh, K.
    Raju, R. Kanaka
    [J]. COMPUTER COMMUNICATION, NETWORKING AND INTERNET SECURITY, 2017, 5 : 253 - 260
  • [8] Ranking Based Prediction of Keyword Over Big Databases
    Kothawade, Abhijeet
    Bagul, Jagdish
    Harak, Milan
    Patil, Bharati
    [J]. 2015 INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT), 2015, : 899 - 903
  • [9] Combining user and database perspective for solving keyword queries over relational databases
    Bergamaschi, Sonia
    Guerra, Francesco
    Interlandi, Matteo
    Trillo-Lado, Raquel
    Velegrakis, Yannis
    [J]. INFORMATION SYSTEMS, 2016, 55 : 1 - 19
  • [10] Keyword Queries by Matching Synonyms in Relational Databases
    Huang, Dingfang
    Xie, Dong
    Liu, Heyun
    [J]. Proceedings of the 2016 6th International Conference on Applied Science, Engineering and Technology (ICASET), 2016, 77 : 203 - 208