Adaptive Task-Oriented Resource Allocation for Large Dynamic Workflows on Opportunistic Resources

被引:0
|
作者
Phung, Thanh Son [1 ]
Thain, Douglas [1 ]
机构
[1] Univ Notre Dame, Notre Dame, IN 46556 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/IPDPS57955.2024.00034
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Dynamic workflow management systems offer a solution to the problem of distributing a local application by packaging individual computations and their dependencies on-the-fly into tasks executable on remote workers. Such independent task execution allows workers to be launched in an opportunistic manner to maximize the current pool of resources at any given time, either through opportunistic systems (e.g., HTCondor, AWS Spot Instances), or conventional systems (e.g., SLURM, SGE) with backfilling enabled, as opposed to monolithic or message-passing applications requiring a fixed block of non-preemptible workers. However, the dynamic nature of task generation presents a significant challenge in terms of resource management as tasks must be allocated with some unknown amount of resources pre-execution but are only observable at runtime. This in turn results in potentially huge resource waste per task as (1) users lack direct knowledge about the relationship between tasks and resources, and thus cannot correctly specify the amount of resources a task needs in advance, and (2) workflows and tasks may exhibit stochastic behaviors at runtime, which complicates the process of resource management. In this paper, we (1) argue for the need of an adaptive resource allocator capable of allocating tasks at runtime and adjusting to random fluctuations and abrupt changes in a dynamic workflow without requiring any prior knowledge, and (2) introduce Greedy Bucketing and Exhaustive Bucketing: two robust, online, general-purpose, and prior-free allocation algorithms capable of producing quality estimates of a task's resource consumption as the workflow runs. Our results show that a resource allocator equipped with either algorithm consistently outperforms 5 alternative allocation algorithms on 7 diverse workflows and incurs at most 1.6 ms overhead per allocation in the steady state.
引用
收藏
页码:300 / 311
页数:12
相关论文
共 50 条
  • [1] DRL-Driven Dynamic Resource Allocation for Task-Oriented Semantic Communication
    Zhang, Haijun
    Wang, Hongyu
    Li, Yabo
    Long, Keping
    Nallanathan, Arumugam
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (07) : 3992 - 4004
  • [2] Toward Intelligent Resource Allocation on Task-Oriented Semantic Communication
    Zhang, Haijun
    Wang, Hongyu
    Li, Yabo
    Long, Keping
    Leung, Victor C. M.
    [J]. IEEE WIRELESS COMMUNICATIONS, 2023, 30 (03) : 70 - 77
  • [3] Adaptable Semantic Compression and Resource Allocation for Task-Oriented Communications
    Liu, Chuanhong
    Guo, Caili
    Yang, Yang
    Jiang, Nan
    [J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (03) : 769 - 782
  • [4] Resource allocation and congestion control in task-oriented distributed sensor networks
    Zhang, J
    Kulasekere, EC
    Premaratne, K
    Bauer, PH
    [J]. FIBER OPTIC SENSOR TECHNOLOGY AND APPLICATIONS 2001, 2001, 4578 : 220 - 228
  • [5] Adaptive Service Placement, Task Offloading and Bandwidth Allocation in Task-Oriented URLLC Edge Networks
    Dang Van Huynh
    Van-Dinh Nguyen
    Dobre, Octavia A.
    Khosravirad, Saeed R.
    Duong, Trung Q.
    [J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5755 - 5760
  • [6] Resource Allocation and Common Message Selection for Task-Oriented Semantic Information Transmission With RSMA
    Cheng, Yanyu
    Niyato, Dusit
    Du, Hongyang
    Kang, Jiawen
    Xiong, Zehui
    Miao, Chunyan
    Kim, Dong In
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (06) : 5557 - 5570
  • [7] Research on a dynamic resource organization method task-oriented for the loosely coupled system
    Zhao, Yang
    Yi, Xian-Qing
    Luo, Xue-Shan
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2014, 36 (07): : 1328 - 1333
  • [8] Task-Oriented Adaptive Maintenance Support System
    Huang, Ying
    Wang, Xingjun
    Gardoni, Mickael
    Coulibaly, Amadou
    [J]. JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2015, 15 (03)
  • [9] Opportunistic cooperation by dynamic resource allocation
    Gunduz, Deniz
    Erkip, Elza
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2007, 6 (04) : 1446 - 1454
  • [10] Task-oriented collaborative customer profit allocation mechanism
    Zhang, Xue-Feng
    Yang, Yu
    Yu, Guo-Dong
    Yang, Tao
    Xue, Cheng-Meng
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2015, 21 (01): : 13 - 20