Evaluating refined queries in top-k retrieval systems

被引:22
|
作者
Chakrabarti, K
Ortega-Binderberger, M
Mehrotra, S
Porkaew, K
机构
[1] Microsoft Res, Redmond, WA 98052 USA
[2] IBM Silicon Valley Lab, San Jose, CA 95141 USA
[3] Univ Calif Irvine, Sch Informat & Comp Sci, Irvine, CA 92697 USA
基金
美国国家科学基金会;
关键词
multidimensional indexing; k-nearest neighbor search; similarity queries; query refinement; relevance feedback;
D O I
10.1109/TKDE.2004.1269602
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many applications, users specify target values for certain attributes/features without requiring exact matches to these values in return. Instead, the result is typically a ranked list of "top k" objects that best match the specified feature values. User subjectivity is an important aspect of such queries, i.e., which objects are relevant to the user and which are not depends on the, perception of the user. Due to the subjective nature of top-k queries, the answers returned by the system to an user query often do not satisfy the users need right away, either be,cause the weights and the distance functions associated with the features do not accurately capture the users perception or because the specified target values do not fully capture her information need or both. In such cases, the user would like to refine the query and resubmit it in order to-get back a better set of answers. While there has been a lot of research on query refinement models, there is no work that we are aware of on supporting refinement of top-k queries efficiently in a database system. Done naively, each "refined" query can be treated as a "starting" query and evaluated from scratch. This paper explores alternative approaches that significantly improve the cost of evaluating refined queries by exploiting the observation that the refined queries are not modified drastically from one iteration to another. Our experiments over a real-life multimedia data set show that the proposed techniques Save more than 80 percent of the execution cost of refined queries over the naive approach and is more than an order of magnitude faster than a simple sequential scan.
引用
收藏
页码:256 / 270
页数:15
相关论文
共 50 条
  • [31] Top-k Dominating Queries on Incomplete Data
    Miao, Xiaoye
    Gao, Yunjun
    Zheng, Baihua
    Chen, Gang
    Cui, Huiyong
    [J]. 2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1500 - 1501
  • [32] CrowdK: Answering top-k queries with crowdsourcing
    Lee, Jongwuk
    Lee, Dongwon
    Hwang, Seung-won
    [J]. INFORMATION SCIENCES, 2017, 399 : 98 - 120
  • [33] Processing top-k queries from samples
    Cohen, Edith
    Grossaug, Nadav
    Kaplan, Haim
    [J]. COMPUTER NETWORKS, 2008, 52 (14) : 2605 - 2622
  • [34] Approximate top-k queries in sensor networks
    Patt-Shamir, Boaz
    Shafrir, Allon
    [J]. STRUCTURAL INFORMATION AND COMMUNICATION COMPLEXITY, PROCEEDINGS, 2006, 4056 : 319 - +
  • [35] Top-k differential queries in graph databases
    Vasilyeva, Elena
    Thiele, Maik
    Bornhövd, Christof
    Lehner, Wolfgang
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8716 : 112 - 115
  • [36] Top-k Differential Queries in Graph Databases
    Vasilyeva, Elena
    Thiele, Maik
    Bornhoevd, Christof
    Lehner, Wolfgang
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS (ADBIS 2014), 2014, 8716 : 112 - 125
  • [37] Efficient processing of distributed top-k queries
    Yu, HL
    Li, HG
    Wu, P
    Agrawal, D
    El Abbadi, A
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, 3588 : 65 - 74
  • [38] Top-k Queries with Contextual Fuzzy Preferences
    Bosc, Patrick
    Pivert, Olivier
    Mokhtari, Amine
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 847 - 854
  • [39] Continuous Top-k Dominating Queries in Subspaces
    Kontaki, Maria
    Papadopoulos, Apostolos N.
    Manolopoulos, Yannis
    [J]. PCI 2008: 12TH PAN-HELLENIC CONFERENCE ON INFORMATICS, PROCEEDINGS, 2008, : 31 - 35
  • [40] Reverse spatial top-k keyword queries
    Ahmed, Pritom
    Eldawy, Ahmed
    Hristidis, Vagelis
    Tsotras, Vassilis J.
    [J]. VLDB JOURNAL, 2023, 32 (03): : 501 - 524