Lower Bounds for Processing Data with Few Random Accesses to External Memory

被引:7
|
作者
Grohe, Martin [1 ]
Hernich, Andre [2 ]
Schweikardt, Nicole [2 ]
机构
[1] Humboldt Univ, Inst Informat, D-10099 Berlin, Germany
[2] Goethe Univ Frankfurt, Inst Informat, D-60054 Frankfurt, Germany
关键词
Theory; Languages; Complexity; data streams; real-time data; query processing; query optimization; semi-structured data; XML; COMPLEXITY;
D O I
10.1145/1516512.1516514
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We consider a scenario where we want to query a large dataset that is stored in external memory and does not fit into main memory. The most constrained resources in such a situation are the size of the main memory and the number of random accesses to external memory. We note that sequentially streaming data from external memory through main memory is much less prohibitive. We propose an abstract model of this scenario in which we restrict the size of the main memory and the number of random accesses to external memory, but admit arbitrary sequential access. A distinguishing feature of our model is that it allows the usage of unlimited external memory for storing intermediate results, such as several hard disks that can be accessed in parallel. In this model, we prove lower bounds for the problem of sorting a sequence of strings (or numbers), the problem of deciding whether two given sets of strings are equal, and two closely related decision problems. Intuitively, our results say that there is no algorithm for the problems that uses internal memory space bounded by N1-epsilon and at most o( log N) random accesses to external memory, but unlimited "streaming access", both for writing to and reading from external memory. (Here, N denotes the size of the input and e is an arbitrary constant greater than 0.) We even permit randomized algorithms with one-sided bounded error. We also consider the problem of evaluating database queries and prove similar lower bounds for evaluating relational algebra queries against relational databases and XQuery and XPath queries against XML-databases.
引用
收藏
页数:58
相关论文
共 50 条
  • [1] Tight lower bounds for query processing on streaming and external memory data
    Grohe, Martin
    Koch, Christoph
    Schweikardt, Nicole
    THEORETICAL COMPUTER SCIENCE, 2007, 380 (1-2) : 199 - 217
  • [2] Tight lower bounds for query processing on streaming and external memory data
    Grohe, M
    Koch, C
    Schweikardt, N
    AUTOMATA, LANGUAGES AND PROGRAMMING, PROCEEDINGS, 2005, 3580 : 1076 - 1088
  • [3] Lower bounds for external memory dictionaries
    Brodal, GS
    Fagerberg, R
    PROCEEDINGS OF THE FOURTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2003, : 546 - 554
  • [4] A Data Structure for a Sequence of String Accesses in External Memory
    Ciriani, Valentina
    Ferragina, Paolo
    Luccio, Fabrizio
    Muthukrishnan, S.
    ACM TRANSACTIONS ON ALGORITHMS, 2007, 3 (01)
  • [5] Lower Bounds in the Asymmetric External Memory Model
    Jacob, Riko
    Sitchinava, Nodari
    PROCEEDINGS OF THE 29TH ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES (SPAA'17), 2017, : 247 - 254
  • [6] Lower Bounds for External Memory Integer Sorting via Network Coding
    Farhadi, Alireza
    Hajiaghayi, Mohammad Taghi
    Larsen, Kasper Green
    Shi, Elaine
    COMMUNICATIONS OF THE ACM, 2020, 63 (10) : 97 - 105
  • [7] LOWER BOUNDS FOR EXTERNAL MEMORY INTEGER SORTING VIA NETWORK CODING
    Farhadi, Alireza
    Hajiaghayi, Mohammadtaghi
    Larsen, Kasper Green
    Shi, Elaine
    SIAM JOURNAL ON COMPUTING, 2023, 52 (02) : 87 - 111
  • [8] Lower Bounds for External Memory Integer Sorting via Network Coding
    Farhadi, Alireza
    Hajiaghayi, MohammadTaghi
    Larsen, Kasper Green
    Shi, Elaine
    PROCEEDINGS OF THE 51ST ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '19), 2019, : 997 - 1008
  • [9] Optimization of data accesses in reflective memory systems
    Ryou, Myung Seon
    Lee, Chang Won
    Park, Ki Woung
    Song, Seung-Whan
    Choi, Changho
    TENCON 2006 - 2006 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2006, : 1344 - +
  • [10] Lower bounds for transactional memory
    Ravi, Srivatsan, 2017, European Association for Theoretical Computer Science (2017):