BLQ: Light-Weight Locality-Aware Runtime for Blocking-Less Queuing

被引:0
|
作者
Wu, Qinzhe [1 ]
Li, Ruihao [1 ]
Beard, Jonathan [2 ,3 ]
John, Lizy [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
[2] Arm, Waltham, MA USA
[3] Google LLC, Mountain View, CA USA
关键词
Message Queue; Parallel Processing; Runtime; PARALLELISM; TASK; OPENMP;
D O I
10.1145/3640537.3641568
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Message queues are used widely in parallel processing systems for worker thread synchronization. When there is a throughput mismatch between the upstream and downstream tasks, the message queue buffer will often exist as either empty or full. Polling on an empty or full queue will affect the performance of upstream or downstream threads, since such polling cycles could have been spent on other computation. Non-blocking queue is an alternative that allow polling cycles to be spared for other tasks per applications' choice. However, application programmers are not supposed to bear the burden, because a good decision of what to do upon blocking has to take many runtime environment information into consideration. This paper proposes Blocking-Less Queuing Runtime (BLQ), a systematic solution capable of finding the proper strategies at (or before) blocking, as well as lightening the programmers' burden. BLQ collects a set of solutions, including yielding, advanced dynamic queue buffer resizing, and resource-aware task scheduling. The evaluation on high-end servers shows that a set of diverse parallel queuing workloads could reduce blocking and lower cache misses with BLQ. BLQ outperforms the baseline runtime considerably (with up to 3.8x peak speedup).
引用
收藏
页码:100 / 112
页数:13
相关论文
共 26 条
  • [21] Neural3D: Light-weight Neural Portrait Scanning via Context-aware Correspondence Learning
    Suo, Xin
    Wu, Minye
    Zhang, Yanshun
    Zhang, Yingliang
    Xu, Lan
    Hu, Qiang
    Yu, Jingyi
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3651 - 3660
  • [22] Energy-Aware Light-Weight DMM-1 Patterns Decoders with Efficiently Storage in 3D-HEVC
    Sanchez, Gustavo
    Agostini, Luciano
    Marcon, Cesar
    2016 29TH SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN (SBCCI), 2016,
  • [23] Light-Weight and Robust Security-Aware D2D-Assist Data Transmission Protocol for Mobile-Health Systems
    Zhang, Aiqing
    Wang, Lei
    Ye, Xinrong
    Lin, Xiaodong
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (03) : 662 - 675
  • [24] Comments on "Light-Weight and Robust Security-Aware D2D-Assist Data Transmission Protocol for Mobile-Health Systems"
    Zhou, Caixue
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (07) : 1869 - 1870
  • [25] A robust and light-weight transfer learning-based architecture for accurate detection of leaf diseases across multiple plants using less amount of images
    Mazumder, Md. Khairul Alam
    Mridha, M. F.
    Alfarhood, Sultan
    Safran, Mejdl
    Abdullah-Al-Jubair, Md.
    Che, Dunren
    FRONTIERS IN PLANT SCIENCE, 2024, 14
  • [26] EF-CIM: An Endurance Friendly CIM Accelerator Using Embedded NVM With Bit-Aware Wear Leveling for Efficient Light-Weight On-Chip Training in Edge Devices
    Guo, Zhiwang
    Jiang, Jingwen
    Chen, Deyang
    Fang, Jinbei
    Yang, Jianguo
    Han, Jun
    Xue, Xiaoyong
    Zeng, Xiaoyang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024,