Scalable Adaptive NUMA-Aware Lock

被引:10
|
作者
Zhang, Mingzhe [1 ]
Chen, Haibo [2 ]
Cheng, Luwei [3 ]
Lau, Francis C. M. [1 ]
Wang, Cho-Li [1 ]
机构
[1] Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Parallel & Distributed Syst, Minhang 200240, Qu, Peoples R China
[3] Facebook, Menlo Pk, CA 94025 USA
关键词
Delegation lock; adaptive synchronization;
D O I
10.1109/TPDS.2016.2630695
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Scalable locking is a key building block for scalable multi-threaded software. Its performance is especially critical in multi-socket, multi-core machines with non-uniform memory access (NUMA). Previous schemes such as in-place locks and delegation locks only perform well under a certain level of contention, and often require non-trivial tuning for a particular configuration. Besides, in large NUMA systems, current delegation locks cannot perform satisfactorily due to lack of optimized NUMA policies. In this work, we propose SANL, a locking scheme that can deliver high performance under various contention levels by adaptively switching between in-place locks and delegation locks. To optimize the performance of delegation locks, we introduce a new NUMA policy that jointly considers node distances and server utilization when choosing lock servers. We have implemented SANL and evaluated it with four popular multi-threaded applications (Memcached, Berkeley DB, Phoenix2 and SPLASH-2), on a 40-core Intel machine and a 64-core AMD machine. The comparison results with seven other representative locking schemes show that SANL outperforms them in most contention situations. For example, in one group test, SANL is 3.7 times faster than RCL lock and 17 times faster than POSIX mutex.
引用
收藏
页码:1754 / 1769
页数:16
相关论文
共 50 条
  • [31] NUMA-aware image compositing on multi-GPU platform
    Wang, Pan
    Cheng, Zhiquan
    Martin, Ralph
    Liu, Huahai
    Cai, Xun
    Li, Sikun
    [J]. VISUAL COMPUTER, 2013, 29 (6-8): : 639 - 649
  • [32] A NUMA-aware Trajectory Store for Travel-Time Estimation
    Waury, Robert
    Jensen, Christian
    Torp, Kristian
    [J]. 27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 209 - 218
  • [33] HydraFS: an efficient NUMA-aware in-memory file system
    Wu, Ting
    Chen, Xianzhang
    Liu, Kai
    Xiao, Chunhua
    Liu, Zhixiang
    Zhuge, Qingfeng
    Sha, Edwin H. -M.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (02): : 705 - 724
  • [34] Evaluation of NUMA-Aware Scheduling in Warehouse-Scale Clusters
    Wu, Richard
    Zhang, Xiao
    Kong, Xiangling
    Chen, Yangyi
    Jnagal, Rohit
    Hagmann, Robert
    [J]. 2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 475 - 477
  • [35] NUMA-aware CPU core allocation in cooperating dynamic applications
    Dokulil, Jiri
    Benkner, Siegfried
    [J]. 2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 950 - 957
  • [36] Optimal Design of NUMA-aware Persistent Memory Storage Engine
    Tu, Yao-Feng
    Chen, He-Dui
    Wang, Han-Yi
    Yan, Zong-Shuai
    Kong, Lu
    Chen, Bing
    [J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (03): : 891 - 908
  • [37] HydraFS: an efficient NUMA-aware in-memory file system
    Ting Wu
    Xianzhang Chen
    Kai Liu
    Chunhua Xiao
    Zhixiang Liu
    Qingfeng Zhuge
    Edwin H.-M. Sha
    [J]. Cluster Computing, 2020, 23 : 705 - 724
  • [38] NUMA-Aware Graph Mining Techniques for Performance and Energy Efficiency
    Frasca, Michael
    Madduri, Kamesh
    Raghavan, Padma
    [J]. 2012 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2012,
  • [39] NUMA-aware image compositing on multi-GPU platform
    Pan Wang
    Zhiquan Cheng
    Ralph Martin
    Huahai Liu
    Xun Cai
    Sikun Li
    [J]. The Visual Computer, 2013, 29 : 639 - 649
  • [40] Reducing Cache Coherence Traffic with a NUMA-Aware Runtime Approach
    Caheny, Paul
    Alvarez, Lluc
    Derradji, Said
    Valero, Mateo
    Moreto, Miquel
    Casas, Marc
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (05) : 1174 - 1187