Application-aware Resource Sharing using Software and Hardware Partitioning on Modern GPUs

被引:0
|
作者
Adufu, Theodora [1 ]
Ha, Jiwon [2 ]
Kim, Yoonhee [1 ]
机构
[1] Sookmyung Womens Univ, Dept Comp Sci, Seoul, South Korea
[2] Seoul Natl Univ, Dept Comp Sci, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Resource sharing; resource under-utilization; concurrency; hardware partitioning;
D O I
10.1109/NOMS59830.2024.10574996
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graphic Processing Units (GPUs) are known for the large computing capabilities they offer users compared to traditional CPUs. However, the issue of resource under-utilization is becoming more apparent as more and more applications are unable to saturate modern GPUs which have even higher processing capabilities. While concurrency mechanisms like hardware partitioning have resulted in better utilization compared to deployments without sharing, the issue of resource under-utilization still persists even in deployment scenarios where applications are executed on the smallest GPU partitions of modern GPUs. Software partitioning on the other hand, does not guarantee isolation during executions leading to issues of interference and consequently limiting the number of applications which can be run concurrently. Leveraging both software and hardware resource partitioning schemes in an effort to mitigate resource under-utilization issues is yet to be fully explored. In this paper, we evaluate the predictions of a proposed linear regression model relative to actual executions. The results of our experiments show that whilst our approach accurately estimates performance for sharing differently-sized GPU partitions among diverse applications based on each application's characteristics, it also improves utilization and reduces resource wastage.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Scalable Application-Aware Resource Management in Software Defined Networking
    Liu, Jiyang
    Zhu, Liang
    Sun, Weiqiang
    Hu, Weisheng
    [J]. 2015 17th International Conference on Transparent Optical Networks (ICTON), 2015,
  • [2] Application-aware NoC management in GPUs multitasking
    Zhen Xu
    Xia Zhao
    Zhiying Wang
    Canqun Yang
    [J]. The Journal of Supercomputing, 2019, 75 : 4710 - 4730
  • [3] Application-aware NoC management in GPUs multitasking
    Xu, Zhen
    Zhao, Xia
    Wang, Zhiying
    Yang, Canqun
    [J]. JOURNAL OF SUPERCOMPUTING, 2019, 75 (08): : 4710 - 4730
  • [4] Optimizing Hardware Resource Partitioning and Job Allocations on Modern GPUs under Power Caps
    Arima, Eishi
    Kang, Minjoon
    Saba, Issa
    Weidendorfer, Josef
    Trinitis, Carsten
    Schulz, Martin
    [J]. 51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS PROCEEDINGS, ICPP 2022, 2022,
  • [5] Application-Aware Diagnosis of Runtime Hardware Faults
    Pellegrini, Andrea
    Bertacco, Valeria
    [J]. 2010 IEEE AND ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2010, : 487 - 492
  • [6] Resource and application-aware resource discovery in computing environments
    Norouzi, Mohammad
    Jannesari, Ali
    [J]. JOURNAL OF SUPERCOMPUTING, 2015, 71 (03): : 824 - 839
  • [7] Resource and application-aware resource discovery in computing environments
    Mohammad Norouzi
    Ali Jannesari
    [J]. The Journal of Supercomputing, 2015, 71 : 824 - 839
  • [8] Dynamic Application-Aware Resource Management Using Software-Defined Networking: Implementation Prospects and Challenges
    Zinner, Thomas
    Jarschel, Michael
    Blenk, Andreas
    Wamser, Florian
    Kellerer, Wolfgang
    [J]. 2014 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (NOMS), 2014,
  • [9] Application-aware adaptive partitioning for graph processing systems
    Le Merrer, Erwan
    Tredan, Gilles
    [J]. 2019 IEEE 27TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2019), 2019, : 235 - 240
  • [10] Hardware resource allocation for hardware/software partitioning in the LYCOS system
    Grode, J
    Knudsen, PV
    Madsen, J
    [J]. DESIGN, AUTOMATION AND TEST IN EUROPE, PROCEEDINGS, 1998, : 22 - 27