The Cost of Cache-Oblivious Searching

被引:3
|
作者
Bender, Michael A. [2 ,3 ]
Brodal, Gerth Stolting [4 ]
Fagerberg, Rolf [5 ]
Ge, Dongdong [6 ]
He, Simai [7 ]
Hu, Haodong [8 ]
Iacono, John [9 ]
Lopez-Ortiz, Alejandro [1 ]
机构
[1] Univ Waterloo, Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
[2] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA
[3] Tokutek Inc, Lexington, MA USA
[4] Aarhus Univ, MADALGO Ctr Mass Data Algorithm, Ctr Danish Natl Res Fdn, Dept Comp Sci, DK-8000 Aarhus C, Denmark
[5] Univ So Denmark, Dept Math & Comp Sci, DK-5230 Odense M, Denmark
[6] Shanghai Jiao Tong Univ, Dept Management Sci & Engn, Antai Sch Econ & Management, Shanghai 200052, Peoples R China
[7] Chinese Univ Hongkong, Dept Syst Engn & Engn Management, Hong Kong, Hong Kong, Peoples R China
[8] Microsoft, Networking & Device Connect Windows Div, Redmond, WA 98052 USA
[9] Polytech Univ, Dept Comp & Informat Sci, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
Cache-oblivious B-tree; Cache-oblivious searching; van Emde Boas layout; PARALLEL MEMORY; ALGORITHMS; MODEL;
D O I
10.1007/s00453-010-9394-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper gives tight bounds on the cost of cache-oblivious searching. The paper shows that no cache-oblivious search structure can guarantee a search performance of fewer than lg elog (B) N memory transfers between any two levels of the memory hierarchy. This lower bound holds even if all of the block sizes are limited to be powers of 2. The paper gives modified versions of the van Emde Boas layout, where the expected number of memory transfers between any two levels of the memory hierarchy is arbitrarily close to [lg e+O(lg lg B/lg B)]log (B) N+O(1). This factor approaches lg ea parts per thousand 1.443 as B increases. The expectation is taken over the random placement in memory of the first element of the structure. Because searching in the disk-access machine (DAM) model can be performed in log (B) N+O(1) block transfers, this result establishes a separation between the (2-level) DAM model and cache-oblivious model. The DAM model naturally extends to k levels. The paper also shows that as k grows, the search costs of the optimal k-level DAM search structure and the optimal cache-oblivious search structure rapidly converge. This result demonstrates that for a multilevel memory hierarchy, a simple cache-oblivious structure almost replicates the performance of an optimal parameterized k-level DAM structure.
引用
收藏
页码:463 / 505
页数:43
相关论文
共 50 条
  • [21] Cache-oblivious databases: Limitations and opportunities
    He, Bingsheng
    Luo, Qiong
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2008, 33 (02):
  • [22] Cache-oblivious B-trees
    Bender, MA
    Demaine, ED
    Farach-Colton, M
    SIAM JOURNAL ON COMPUTING, 2005, 35 (02) : 341 - 358
  • [23] On the limits of cache-oblivious rational permutations
    Silvestri, Francesco
    THEORETICAL COMPUTER SCIENCE, 2008, 402 (2-3) : 221 - 233
  • [24] Cache-Oblivious R-Trees
    Arge, Lars
    de Berg, Mark
    Haverkort, Herman
    ALGORITHMICA, 2009, 53 (01) : 50 - 68
  • [25] Cache-oblivious planar shortest paths
    Jampala, H
    Zeh, N
    AUTOMATA, LANGUAGES AND PROGRAMMING, PROCEEDINGS, 2005, 3580 : 563 - 575
  • [26] Optimal cache-oblivious implicit dictionaries
    Franceschini, G
    Grossi, R
    AUTOMATA, LANGUAGES AND PROGRAMMING, PROCEEDINGS, 2003, 2719 : 316 - 331
  • [27] Cache-oblivious algorithms and data structures
    Brodal, GS
    ALGORITHM THEORY- SWAT 2004, 2004, 3111 : 3 - 13
  • [28] Optimal Cache-Oblivious Mesh Layouts
    Michael A. Bender
    Bradley C. Kuszmaul
    Shang-Hua Teng
    Kebin Wang
    Theory of Computing Systems, 2011, 48 : 269 - 296
  • [29] Cache-Oblivious R-Trees
    Lars Arge
    Mark de Berg
    Herman Haverkort
    Algorithmica, 2009, 53 : 50 - 68
  • [30] Cache-Oblivious Dynamic Programming for Bioinformatics
    Chowdhury, Rezaul Alam
    Le, Hai-Son
    Ramachandran, Vijaya
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (03) : 495 - 510