Scalable Adaptive NUMA-Aware Lock

被引：10

作者：

Zhang, Mingzhe ^{[1
]}

Chen, Haibo ^{[2
]}

Cheng, Luwei ^{[3
]}

Lau, Francis C. M. ^{[1
]}

Wang, Cho-Li ^{[1
]}

机构：

[1] Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China

[2] Shanghai Jiao Tong Univ, Inst Parallel & Distributed Syst, Minhang 200240, Qu, Peoples R China

[3] Facebook, Menlo Pk, CA 94025 USA

来源：

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS | 2017年 / 28卷 / 06期

关键词：

Delegation lock; adaptive synchronization;

D O I：

10.1109/TPDS.2016.2630695

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Scalable locking is a key building block for scalable multi-threaded software. Its performance is especially critical in multi-socket, multi-core machines with non-uniform memory access (NUMA). Previous schemes such as in-place locks and delegation locks only perform well under a certain level of contention, and often require non-trivial tuning for a particular configuration. Besides, in large NUMA systems, current delegation locks cannot perform satisfactorily due to lack of optimized NUMA policies. In this work, we propose SANL, a locking scheme that can deliver high performance under various contention levels by adaptively switching between in-place locks and delegation locks. To optimize the performance of delegation locks, we introduce a new NUMA policy that jointly considers node distances and server utilization when choosing lock servers. We have implemented SANL and evaluated it with four popular multi-threaded applications (Memcached, Berkeley DB, Phoenix2 and SPLASH-2), on a 40-core Intel machine and a 64-core AMD machine. The comparison results with seven other representative locking schemes show that SANL outperforms them in most contention situations. For example, in one group test, SANL is 3.7 times faster than RCL lock and 17 times faster than POSIX mutex.

引用

页码：1754 / 1769

页数：16

共 50 条

[31] NUMA-aware image compositing on multi-GPU platform
Wang, Pan
Cheng, Zhiquan
Martin, Ralph
Liu, Huahai
Cai, Xun
Li, Sikun
[J]. VISUAL COMPUTER, 2013, 29 (6-8): : 639 - 649
[32] A NUMA-aware Trajectory Store for Travel-Time Estimation
Waury, Robert
Jensen, Christian
Torp, Kristian
[J]. 27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 209 - 218
[33] HydraFS: an efficient NUMA-aware in-memory file system
Wu, Ting
Chen, Xianzhang
Liu, Kai
Xiao, Chunhua
Liu, Zhixiang
Zhuge, Qingfeng
Sha, Edwin H. -M.
[J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (02): : 705 - 724
[34] Evaluation of NUMA-Aware Scheduling in Warehouse-Scale Clusters
Wu, Richard
Zhang, Xiao
Kong, Xiangling
Chen, Yangyi
Jnagal, Rohit
Hagmann, Robert
[J]. 2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 475 - 477
[35] NUMA-aware CPU core allocation in cooperating dynamic applications
Dokulil, Jiri
Benkner, Siegfried
[J]. 2020 IEEE 34TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2020), 2020, : 950 - 957
[36] Optimal Design of NUMA-aware Persistent Memory Storage Engine
Tu, Yao-Feng
Chen, He-Dui
Wang, Han-Yi
Yan, Zong-Shuai
Kong, Lu
Chen, Bing
[J]. Ruan Jian Xue Bao/Journal of Software, 2022, 33 (03): : 891 - 908
[37] HydraFS: an efficient NUMA-aware in-memory file system
Ting Wu
Xianzhang Chen
Kai Liu
Chunhua Xiao
Zhixiang Liu
Qingfeng Zhuge
Edwin H.-M. Sha
[J]. Cluster Computing, 2020, 23 : 705 - 724
[38] NUMA-Aware Graph Mining Techniques for Performance and Energy Efficiency
Frasca, Michael
Madduri, Kamesh
Raghavan, Padma
[J]. 2012 INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC), 2012,
[39] NUMA-aware image compositing on multi-GPU platform
Pan Wang
Zhiquan Cheng
Ralph Martin
Huahai Liu
Xun Cai
Sikun Li
[J]. The Visual Computer, 2013, 29 : 639 - 649
[40] Reducing Cache Coherence Traffic with a NUMA-Aware Runtime Approach
Caheny, Paul
Alvarez, Lluc
Derradji, Said
Valero, Mateo
Moreto, Miquel
Casas, Marc
[J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (05) : 1174 - 1187

← 1 2 3 4 5 →