Maximal Intersection Queries in Randomized Input Models

被引:0
|
作者
Benjamin Hoffmann
Mikhail Lifshits
Yury Lifshits
Dirk Nowotka
机构
[1] Universität Stuttgart,
[2] St. Petersburg State University,undefined
[3] California Institute of Technology,undefined
来源
关键词
Nearest neighbor problem; Randomized input models; Zipf’s law; Maximal intersection problem; Algorithms for large data sets;
D O I
暂无
中图分类号
学科分类号
摘要
Consider a family of sets and a single set, called the query set. How can one quickly find a member of the family which has a maximal intersection with the query set? Time constraints on the query and on a possible preprocessing of the set family make this problem challenging. Such maximal intersection queries arise in a wide range of applications, including web search, recommendation systems, and distributing on-line advertisements. In general, maximal intersection queries are computationally expensive. We investigate two well-motivated distributions over all families of sets and propose an algorithm for each of them. We show that with very high probability an almost optimal solution is found in time which is logarithmic in the size of the family. Moreover, we point out a threshold phenomenon on the probabilities of intersecting sets in each of our two input models which leads to the efficient algorithms mentioned above.
引用
收藏
页码:104 / 119
页数:15
相关论文
共 50 条
  • [42] PolyCard: A learned cardinality estimator for intersection queries on spatial polygons
    Ji, Yuchen
    Amagata, Daichi
    Sasaki, Yuya
    Hara, Takahiro
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2025,
  • [43] A cost model for spatial intersection queries on RI-trees
    Kriegel, HP
    Pfeifle, M
    Pötke, M
    Seidl, T
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2004, 2973 : 331 - 338
  • [44] EFFICIENT NON-INTERSECTION QUERIES ON AGGREGATED GEOMETRIC DATA
    Gupta, Prosenjit
    Janardan, Ravi
    Smid, Michiel
    INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 2009, 19 (06) : 479 - 506
  • [45] Efficient non-intersection queries on aggregated geometric data
    Gupta, P
    Janardan, R
    Smid, M
    COMPUTING AND COMBINATORICS, PROCEEDINGS, 2005, 3595 : 544 - 553
  • [46] GUARD FILES - STABBING AND INTERSECTION QUERIES ON FAT SPATIAL OBJECTS
    NIEVERGELT, J
    WIDMAYER, P
    COMPUTER JOURNAL, 1993, 36 (02): : 107 - 116
  • [47] A cost model for interval intersection queries on RI-Trees
    Kriegel, HP
    Pfeifle, M
    Pötke, M
    Seidl, T
    14TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2002, : 131 - 141
  • [48] Maximal input and feedback in production and comprehension
    Vigliocco, G
    Hartsuiker, RJ
    TWENTY-FIRST CENTURY PSYCHOLINGUISTICS: FOUR CORNERSTONES, 2005, : 209 - 228
  • [49] On intersection of maximal orthogonally k-starshaped polygons
    Oleg, T
    GEOMETRIAE DEDICATA, 1999, 78 (03) : 271 - 278
  • [50] Formal models of Web queries
    Mendelzon, AO
    Milo, T
    INFORMATION SYSTEMS, 1998, 23 (08) : 615 - 637