ReCALL: Reordered Cache Aware Locality based Graph Processing

被引:0
|
作者
Lakhotia, Kartik [1 ]
Singapura, Shreyas [1 ]
Kannan, Rajgopal [2 ]
Prasanna, Viktor [1 ]
机构
[1] Univ Southern Calif, Ming Hsieh Dept Elect Engn, Los Angeles, CA 90007 USA
[2] US Army, Res Lab, Arlington, VA USA
基金
美国国家科学基金会;
关键词
Graph Analytics; Big Data; Cache Performance; Data Layout; Graph Reordering;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Sparse graph processing generates highly irregular Memory Access Patterns (MAP) which lack locality and result in poor cache performance. In this paper, we propose a novel graph ordering algorithm that addresses this problem. We observe that existing reordering algorithms primarily try to improve cache line utilization by enhancing spatial locality. They are oblivious to cache data reuse which reflects the temporal locality that MAP can possess. Our premise is that peak efficiency can be achieved by a graph order for which the resulting MAP exhibit both spatial and temporal locality. Therefore, we first introduce a new metric Profit, that quantifies cache data reuse leading to a heuristic pH that enhances temporal locality in the MAP of graph algorithms. Then we define a notion of dynamically matching MAP with cache contents in a way that jointly maximizes both cache data reuse and cache line utilization. To perform this joint optimization, we develop a Block Reordering algorithm which utilizes pH to rearrange blocks of consecutive nodes with high spatial locality. We evaluate our algorithm using 8 real world datasets and 4 representative graph algorithms. Experimental results show that graphs obtained by Block Reordering can achieve upto 2.3x speedup over the original graph order and consistently outperform the existing state of the art reordering technique by 20% to 25% reduction in cache misses.
引用
收藏
页码:273 / 282
页数:10
相关论文
共 50 条
  • [1] A locality aware cache diffusion system
    Casey, John
    Zhou, Wanlei
    [J]. JOURNAL OF SUPERCOMPUTING, 2010, 52 (01): : 1 - 22
  • [2] A locality aware cache diffusion system
    John Casey
    Wanlei Zhou
    [J]. The Journal of Supercomputing, 2010, 52 : 1 - 22
  • [3] Locality Aware Skip Graph
    Hassanzadeh-Nazarabadi, Yahya
    Kupcu, Alptekin
    Ozkasap, Oznur
    [J]. 2015 IEEE 35th International Conference on Distributed Computing Systems Workshops (ICDCSW), 2015, : 105 - 111
  • [4] Spatial Locality-Aware Cache Partitioning for Effective Cache Sharing
    Gupta, Saurabh
    Zhou, Huiyang
    [J]. 2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 150 - 159
  • [5] Reuse locality aware cache partitioning for last-level cache
    Shen, Fanfan
    He, Yanxiang
    Zhang, Jun
    Li, Qingan
    Li, Jianhua
    Xu, Chao
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2019, 74 : 319 - 330
  • [6] Locality-aware cache random replacement policies
    Benedicte, Pedro
    Hernandez, Carles
    Abella, Jaume
    Cazorla, Francisco J.
    [J]. JOURNAL OF SYSTEMS ARCHITECTURE, 2019, 93 : 48 - 61
  • [7] Locality-aware Thread Block Design in Single and Multi-GPU Graph Processing
    Fan, Quan
    Chen, Zizhong
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, ARCHITECTURE AND STORAGE (NAS), 2021, : 148 - 151
  • [8] Locality-Aware Vertex Scheduling for GPU-based Graph Computation
    Park, Hyunsun
    Ahn, Junwhan
    Park, Eunhyeok
    Yoo, Sungjoo
    [J]. 2015 IFIP/IEEE INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2015, : 195 - 200
  • [9] DELTA: Distributed Locality-Aware Cache Partitioning for Tile-based Chip Multiprocessors
    Holtryd, Nadja
    Manivannan, Madhavan
    Stenstrom, Per
    Pericas, Miquel
    [J]. 2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM IPDPS 2020, 2020, : 578 - 589
  • [10] A Locality-Aware Write Filter Cache for Energy Reduction of STTRAM-Based L1 Data Cache
    Kong, Joonho
    [J]. JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, 2016, 16 (01) : 80 - 90