DI-MMAP—a scalable memory-map runtime for out-of-core data-intensive applications

被引:0
|
作者
Brian Van Essen
Henry Hsieh
Sasha Ames
Roger Pearce
Maya Gokhale
机构
[1] Center for Applied Scientific Computing,Lawrence Livermore National Laboratory
[2] University of California,Department of Computer Science
[3] Texas A&M University,Department of Computer Science and Engineering
来源
Cluster Computing | 2015年 / 18卷
关键词
Data-intensive; Memory-map runtime; Memory architecture; NVRAM;
D O I
暂无
中图分类号
学科分类号
摘要
We present DI-MMAP, a high-performance runtime that memory-maps large external data sets into an application’s address space and shows significantly better performance than the Linux mmap system call. Our implementation is particularly effective when used with high performance locally attached Flash arrays on highly concurrent, latency-tolerant data-intensive HPC applications. We describe the kernel module and show performance results on a benchmark test suite, a new bioinformatics metagenomic classification application, and on a level-asynchronous Breadth-First Search (BFS) graph traversal algorithm. Using DI-MMAP, the metagenomics classification application performs up to 4× better than standard Linux mmap. A fully external memory configuration of BFS executes up to 7.44× faster than traditional mmap. Finally, we demonstrate that DI-MMAP shows scalable out-of-core performance for BFS traversal in main memory constrained scenarios. Such scalable memory constrained performance would allow a system with a fixed amount of memory to solve a larger problem as well as provide memory QoS guarantees for systems running multiple data-intensive applications.
引用
收藏
页码:15 / 28
页数:13
相关论文
共 4 条
  • [1] DI-MMAP-a scalable memory-map runtime for out-of-core data-intensive applications
    Van Essen, Brian
    Hsieh, Henry
    Ames, Sasha
    Pearce, Roger
    Gokhale, Maya
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (01): : 15 - 28
  • [2] DI-MMAP: A High Performance Memory-Map Runtime for Data-Intensive Applications
    Van Essen, Brian
    Hsieh, Henry
    Ames, Sasha
    Gokhale, Maya
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 731 - 735
  • [3] Tools for improving the out-of-core performance of data and computation intensive applications
    Valsalam, VK
    Reese, DS
    PROCEEDINGS OF 1999 SYMPOSIUM ON PERFORMANCE EVALUATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS, 1999, : 89 - 96
  • [4] Improving Performance of Structured-memory, Data-Intensive Applications on Multi-core Platforms via a Space-Filling Curve Memory Layout
    Bethel, E. Wes
    Camp, David
    Donofrio, David
    Howison, Mark
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 565 - 574