QoS-Aware Dynamic Resource Allocation for Spatial-Multitasking GPUs

被引:0
|
作者
Aguilera, Paula [1 ]
Morrow, Katherine [1 ]
Kim, Nam Sung [1 ]
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
General-purpose computing on GPUs (GPGPU computing) is becoming widely adopted; however, some GPGPU applications fail to fully utilize GPU resources. In these cases, spatial multitasking better exploits the parallelism offered by GPUs by partitioning the GPU resources among simultaneously-running applications. When one or more such applications have quality-of-service (QoS) requirements, enough resources must be allocated for those applications to satisfy their requirements. Remaining resources can be either disabled to reduce power consumption or used to accelerate other applications. However, we observe that the amount of resources for a QoS application to satisfy its performance requirement is dependent in part upon the co-executing applications. In this paper, we propose a runtime technique to dynamically partition GPU resources between concurrently running applications-at least one of which has a QoS requirement. We demonstrate that the proposed technique can satisfy a 100% QoS requirement while also achieving either a 7W power consumption reduction or a 17.57% performance improvement for co-executing best-effort applications.
引用
收藏
页码:726 / 731
页数:6
相关论文
共 50 条
  • [1] Process Variation-Aware Workload Partitioning Algorithms for GPUs Supporting Spatial-Multitasking
    Aguilera, Paula
    Lee, Jungseob
    Farmahini-Farahani, Amin
    Morrow, Katherine
    Schulte, Michael
    Kim, Nam Sung
    2014 DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION (DATE), 2014,
  • [2] QoS-aware dynamic resource allocation for wireless broadband access networks
    Tri M Nguyen
    Taihyung Yim
    Youchan Jeon
    Yeunwoong Kyung
    Jinwoo Park
    EURASIP Journal on Wireless Communications and Networking, 2014
  • [3] Dynamic QoS-Aware Resource Allocation for Narrow Band Internet of Things
    Chen, Wei
    Zhang, Heli
    Ji, Hong
    Li, Xi
    2018 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC WORKSHOPS), 2018, : 107 - 111
  • [4] QoS-aware dynamic resource allocation for wireless broadband access networks
    Nguyen, Tri M.
    Yim, Taihyung
    Jeon, Youchan
    Kyung, Yeunwoong
    Park, Jinwoo
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2014, : 1 - 12
  • [5] QoS-aware dynamic resource allocation with improved utilization and energy efficiency on GPU
    Sun, Qingxiao
    Yi, Liu
    Yang, Hailong
    Li, Mingzhen
    Luan, Zhongzhi
    Qian, Depei
    PARALLEL COMPUTING, 2022, 113
  • [6] QoS-Aware Resource Allocation for Video Transcoding in Clouds
    Wei, Lei
    Cai, Jianfei
    Foh, Chuan Heng
    He, Bingsheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (01) : 49 - 61
  • [7] A QoS-aware and Fair Resource Allocation Scheme for WPANs
    An, Xueli
    Hekmat, Ramin
    2009 6TH IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE, VOLS 1 AND 2, 2009, : 903 - 907
  • [8] An Efficient QoS-aware Resource Allocation Scheme in WiMAX
    Luo, Shida
    Li, Zisu
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL II, PROCEEDINGS, 2008, : 796 - 800
  • [9] QoS-Aware and Cost-Efficient Dynamic Resource Allocation for Serverless ML Workflows
    Wu, Hao
    Deng, Junxiao
    Fan, Hao
    Ibrahim, Shadi
    Wu, Song
    Jin, Hai
    2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS, 2023, : 886 - 896
  • [10] A QoS-Aware Dynamic Bandwidth Allocation in PON Networks
    Maheswaravenkatesh, P.
    Raja, A. Sivanantha
    WIRELESS PERSONAL COMMUNICATIONS, 2017, 94 (04) : 2499 - 2512