A Quantitative Study of Locality in GPU Caches for Memory-Divergent Workloads

被引:3
|
作者
Lal, Sohan [1 ,2 ]
Varma, Bogaraju Sharatchandra [3 ]
Juurlink, Ben [2 ]
机构
[1] Tech Univ Hamburg, Hamburg, Germany
[2] Tech Univ Berlin, Berlin, Germany
[3] Ulster Univ, Jordanstown, North Ireland
关键词
Data locality; GPU caches; Memory divergence;
D O I
10.1007/s10766-022-00729-2
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
GPUs are capable of delivering peak performance in TFLOPs, however, peak performance is often difficult to achieve due to several performance bottlenecks. Memory divergence is one such performance bottleneck that makes it harder to exploit locality, cause cache thrashing, and high miss rate, therefore, impeding GPU performance. As data locality is crucial for performance, there have been several efforts to exploit data locality in GPUs. However, there is a lack of quantitative analysis of data locality, which could pave the way for optimizations. In this paper, we quantitatively study the data locality and its limits in GPUs at different granularities. We show that, in contrast to previous studies, there is a significantly higher inter-warp locality at the L1 data cache for memory-divergent workloads. We further show that about 50% of the cache capacity and other scarce resources such as NoC bandwidth are wasted due to data over-fetch caused by memory divergence. While the low spatial utilization of cache lines justifies the sectored-cache design to only fetch those sectors of a cache line that are needed during a request, our limit study reveals the lost spatial locality for which additional memory requests are needed to fetch the other sectors of the same cache line. The lost spatial locality presents opportunities for further optimizing the cache design.
引用
收藏
页码:189 / 216
页数:28
相关论文
共 50 条
  • [31] A Quantitative Study of Memory System Interference in Chip Multiprocessor Architectures
    Jahre, Magnus
    Grannaes, Marius
    Natvig, Lasse
    HPCC: 2009 11TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2009, : 622 - 629
  • [32] Memory for the quantitative and qualitative aspects of labour pain: a preliminary study
    Terry, R
    Gijsbers, K
    JOURNAL OF REPRODUCTIVE AND INFANT PSYCHOLOGY, 2000, 18 (02) : 143 - 152
  • [33] Quantitative evaluation study on metal magnetic memory of welding cracks
    Di, Xin-Jie
    Li, Wu-Shen
    Bai, Shi-Wu
    Liu, Fang-Ming
    Xue, Zhen-Kui
    Cailiao Gongcheng/Journal of Materials Engineering, 2006, (07): : 56 - 60
  • [34] A METHODOLOGICAL STUDY OF THE PREPARATION OF CONNECTED VERBAL STIMULI FOR QUANTITATIVE MEMORY EXPERIMENTS
    LEVITT, EE
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1956, 52 (01): : 33 - 38
  • [35] Originality of divergent thinking is associated with working memory ? related brain activity: Evidence from a large sample study
    Takeuchi, Hikaru
    Taki, Yasuyuki
    Nouchi, Rui
    Yokoyama, Ryoichi
    Kotozaki, Yuka
    Nakagawa, Seishu
    Sekiguchi, Atsushi
    Iizuka, Kunio
    Hanawa, Sugiko
    Araki, Tsuyoshi
    Miyauchi, Carlos Makoto
    Sakaki, Kohei
    Sassa, Yuko
    Nozawa, Takayuki
    Ikeda, Shigeyuki
    Yokota, Susumu
    Magistro, Daniele
    Kawashima, Ryuta
    NEUROIMAGE, 2020, 216
  • [36] Quantitative study of metal magnetic memory signal versus local stress concentration
    Wang, Z. D.
    Yao, K.
    Deng, B.
    Ding, K. Q.
    NDT & E INTERNATIONAL, 2010, 43 (06) : 513 - 518
  • [37] Quantitative study of magnetic memory signal characteristic affected by external magnetic field
    Liu, Bin
    He, Luyao
    Zhang, Hai
    Sfarra, Stefano
    Fernandes, Henrique
    Perilli, Stefano
    Ren, Jian
    MEASUREMENT, 2019, 131 : 730 - 736
  • [38] The Coherence of the Working Memory Study Between Deep Neural Networks and Neurophysiology: Insights From Distinguishing Topographical Electroencephalogram Data Under Different Workloads
    Ming, Yurui
    Lin, Chin-Teng
    IEEE SYSTEMS MAN AND CYBERNETICS MAGAZINE, 2021, 7 (04): : 24 - 30
  • [39] Experimental and numerical study of the centripetal behavior of the divergent bracing frame equipped with rotational friction damper and shape memory bolts
    Esfahani, Ali Naseri
    Zareei, Seyed Alireza
    Birzhandi, Mohammad Sadegh
    Zafarani, Mohammad Mahdi
    CONSTRUCTION AND BUILDING MATERIALS, 2024, 438
  • [40] Memory Performance and Quantitative Neuroimaging Software in Mild Cognitive Impairment: A Concurrent Validity Study
    Umfleet, Laura Glass
    Butts, Alissa M.
    Janecek, Julie K.
    Reiter, Katherine
    Agarwal, Mohit
    Brett, Benjamin L.
    Ryan, Joseph J.
    Reuss, James
    Klein, Andrew
    Correro, Anthony N., II
    Franczak, Malgorzata
    JOURNAL OF THE INTERNATIONAL NEUROPSYCHOLOGICAL SOCIETY, 2020, 26 (10) : 954 - 962