On the Complexity of Query Result Diversification

被引:12
|
作者
Deng, Ting [1 ,2 ]
Fan, Wenfei [3 ,4 ]
机构
[1] Beihang Univ, SKLSDE, RCBD, Beijing, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
[3] Univ Edinburgh, Sch Informat, Lab Fdn Comp Sci, Edinburgh, Midlothian, Scotland
[4] Beihang Univ, Beijing, Peoples R China
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2014年 / 39卷 / 02期
基金
英国工程与自然科学研究理事会;
关键词
Design; Algorithms; Theory; Result diversification; relevance; diversity; recommender systems; database queries; combined complexity; data complexity; counting problems;
D O I
10.1145/2602136
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query result diversification is a bi-criteria optimization problem for ranking query results. Given a database D, a query Q, and a positive integer k, it is to find a set of k tuples from Q(D) such that the tuples are as relevant as possible to the query, and at the same time, as diverse as possible to each other. Subsets of Q(D) are ranked by an objective function defined in terms of relevance and diversity. Query result diversification has found a variety of applications in databases, information retrieval, and operations research. This article investigates the complexity of result diversification for relational queries. (1) We identify three problems in connection with query result diversification, to determine whether there exists a set of k tuples that is ranked above a bound with respect to relevance and diversity, to assess the rank of a given k-element set, and to count how many k-element sets are ranked above a given bound based on an objective function. (2) We study these problems for a variety of query languages and for the three objective functions proposed in Gollapudi and Sharma [2009]. We establish the upper and lower bounds of these problems, all matching, for both combined complexity and data complexity. (3) We also investigate several special settings of these problems, identifying tractable cases. Moreover, (4) we reinvestigate these problems in the presence of compatibility constraints commonly found in practice, and provide their complexity in all these settings.
引用
收藏
页数:46
相关论文
共 50 条
  • [21] From Query Complexity to Computational Complexity
    Dobzinski, Shahar
    Vondrak, Jan
    STOC'12: PROCEEDINGS OF THE 2012 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2012, : 1107 - 1116
  • [22] The Query Complexity of Certification
    Blanc, Guy
    Koch, Caleb
    Lange, Jane
    Tan, Li-Yang
    PROCEEDINGS OF THE 54TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '22), 2022, : 623 - 636
  • [23] On the query complexity of sets
    Beigel, R
    Gasarch, W
    Kummer, M
    Martin, G
    McNicholl, T
    Stephan, F
    MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE 1996, 1996, 1113 : 206 - 217
  • [24] Query Complexity in Expectation
    Kaniewski, Jedrzej
    Lee, Troy
    de Wolf, Ronald
    AUTOMATA, LANGUAGES, AND PROGRAMMING, PT I, 2015, 9134 : 761 - 772
  • [25] Learning for Search Result Diversification
    Zhu, Yadong
    Lan, Yanyan
    Guo, Jiafeng
    Cheng, Xueqi
    Niu, Shuzi
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 293 - 302
  • [26] Aggregated Search Result Diversification
    Santos, Rodrygo L. T.
    Macdonald, Craig
    Ounis, Iadh
    ADVANCES IN INFORMATION RETRIEVAL THEORY, 2011, 6931 : 250 - 261
  • [27] Search Result Diversification in Flickr
    Negi, Sumit
    Jaju, Abhimanyu
    Chaudhury, Santanu
    2016 8TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS (COMSNETS), 2016,
  • [28] Result diversification for tweet search
    Ozsoy, Makbule Gulcin
    Onal, Kezban Dilek
    Altingovde, Ismail Sengor
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8787 : 78 - 89
  • [29] A Survey on Search Result Diversification
    Dou Z.-C.
    Qin X.-B.
    Wen J.-R.
    Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (12): : 2591 - 2613
  • [30] Result Diversification for Tweet Search
    Ozsoy, Makbule Gulcin
    Onal, Kezban Dilek
    Altingovde, Ismail Sengor
    WEB INFORMATION SYSTEMS ENGINEERING, PT II, 2014, 8787 : 78 - 89