Optimal aggregation algorithms for middleware

被引:981
|
作者
Fagin, R
Lotem, A
Naor, M
机构
[1] IBM Corp, Almaden Res Ctr, San Jose, CA 95120 USA
[2] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
[3] Weizmann Inst Sci, Dept Comp Sci & Appl Math, IL-76100 Rehovot, Israel
关键词
D O I
10.1016/S0022-0000(03)00026-6
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Assume that each object in a database has m grades, or scores, one for each of m attributes. For example, an object can have a color grade, that tells how red it is, and a shape grade, that tells how round it is. For each attribute, there is a sorted list, which lists each object and its grade under that attribute, sorted by grade (highest grade first). Each object is assigned an overall grade, that is obtained by combining the attribute grades using a fixed monotone aggregation function, or combining rule, such as min or average. To determine the top k objects, that is, k objects with the highest overall grades, the naive algorithm must access every object in the database, to find its grade under each attribute. Fagin has given an algorithm ("Fagin's Algorithm", or FA) that is much more efficient. For some monotone aggregation functions, FA is optimal with high probability in the worst case. We analyze an elegant and remarkably simple algorithm ("the threshold algorithm", or TA) that is optimal in a much stronger sense than FA. We show that TA is essentially optimal, not just for some monotone aggregation functions, but for all of them, and not just in a high-probability worst-case sense, but over every database. Unlike FA, which requires large buffers (whose size may grow unboundedly as the database size grows), TA requires only a small, constant-size buffer. TA allows early stopping, which yields, in a precise sense, an approximate version of the top k answers. We distinguish two types of access: sorted access (where the middleware system obtains the grade of an object in some sorted list by proceeding through the list sequentially from the top), and random access (where the middleware system requests the grade of object in a list, and obtains it in one step). We consider the scenarios where random access is either impossible, or expensive relative to sorted access, and provide algorithms that are essentially optimal for these cases as well. (C) 2003 Elsevier Science (USA). All rights reserved.
引用
收藏
页码:614 / 656
页数:43
相关论文
共 50 条
  • [1] Optimal Score Aggregation Algorithms
    Fagin, Ronald
    [J]. PODS'16: PROCEEDINGS OF THE 35TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2016, : 55 - 55
  • [2] Middleware for financial risk data aggregation
    King, A. J.
    Orani, A.
    Parr, F. N.
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2014, 58 (04)
  • [3] Data aggregation and routing in Wireless Sensor Networks: Optimal and heuristic algorithms
    Al-Karaki, Jamal N.
    Ul-Mustafa, Raza
    Kamal, Ahmed E.
    [J]. COMPUTER NETWORKS, 2009, 53 (07) : 945 - 960
  • [4] Methodologies for optimization of distributed algorithms and middleware
    Singh, Gurdip
    Kolesnikov, Valeriy
    Das, Sanghamitra
    [J]. 2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 2470 - 2474
  • [5] Two algorithms for finding optimal solutions of the Kemeny rank aggregation problem for full rankings
    D'Ambrosio, Antonio
    Amodio, Sonia
    Iorio, Carmela
    [J]. ELECTRONIC JOURNAL OF APPLIED STATISTICAL ANALYSIS, 2015, 8 (02) : 198 - 213
  • [6] Algon: From Interchangeable Distributed Algorithms to Interchangeable Middleware
    Renaud, Karen
    Bishop, Judith
    Lo, Johnny
    Worrall, Basil
    [J]. ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2005, 114 : 65 - 85
  • [7] Study of security algorithms to secure IOT data in middleware
    Pallavi, K. N.
    Kumar, Ravi, V
    Kulal, Pooja
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT 2018), 2018, : 305 - 308
  • [8] A ROBUST MIDDLEWARE ARCHITECTURE FOR INTELLIGENT DATA AGGREGATION AND VIDEO SURVEILLANCE
    Mohamed, Tamer
    Shehata, Mohamed
    Badawy, Wael
    [J]. IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE 2011), 2011, : 383 - 384
  • [9] EVOLVING OPTIMAL FUZZY-CONNECTIVE-BASED HIERARCHICAL AGGREGATION NETWORKS USING GENETIC ALGORITHMS
    Su, Chao-Ton
    Wang, Fang-Fang
    Yang, Taho
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (1B): : 617 - 631
  • [10] Optimal flow aggregation
    Suri, S
    Sandholm, T
    Warkhede, PR
    [J]. ALGORITHM THEORY - SWAT 2000, 2000, 1851 : 462 - 475