Adaptive Task-Oriented Resource Allocation for Large Dynamic Workflows on Opportunistic Resources

被引：0

作者：

Phung, Thanh Son ^{[1
]}

Thain, Douglas ^{[1
]}

机构：

[1] Univ Notre Dame, Notre Dame, IN 46556 USA

来源：

PROCEEDINGS 2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS 2024 | 2024年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/IPDPS57955.2024.00034

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Dynamic workflow management systems offer a solution to the problem of distributing a local application by packaging individual computations and their dependencies on-the-fly into tasks executable on remote workers. Such independent task execution allows workers to be launched in an opportunistic manner to maximize the current pool of resources at any given time, either through opportunistic systems (e.g., HTCondor, AWS Spot Instances), or conventional systems (e.g., SLURM, SGE) with backfilling enabled, as opposed to monolithic or message-passing applications requiring a fixed block of non-preemptible workers. However, the dynamic nature of task generation presents a significant challenge in terms of resource management as tasks must be allocated with some unknown amount of resources pre-execution but are only observable at runtime. This in turn results in potentially huge resource waste per task as (1) users lack direct knowledge about the relationship between tasks and resources, and thus cannot correctly specify the amount of resources a task needs in advance, and (2) workflows and tasks may exhibit stochastic behaviors at runtime, which complicates the process of resource management. In this paper, we (1) argue for the need of an adaptive resource allocator capable of allocating tasks at runtime and adjusting to random fluctuations and abrupt changes in a dynamic workflow without requiring any prior knowledge, and (2) introduce Greedy Bucketing and Exhaustive Bucketing: two robust, online, general-purpose, and prior-free allocation algorithms capable of producing quality estimates of a task's resource consumption as the workflow runs. Our results show that a resource allocator equipped with either algorithm consistently outperforms 5 alternative allocation algorithms on 7 diverse workflows and incurs at most 1.6 ms overhead per allocation in the steady state.

引用

页码：300 / 311

页数：12

共 50 条

[1] DRL-Driven Dynamic Resource Allocation for Task-Oriented Semantic Communication
Zhang, Haijun
Wang, Hongyu
Li, Yabo
Long, Keping
Nallanathan, Arumugam
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (07) : 3992 - 4004
[2] Toward Intelligent Resource Allocation on Task-Oriented Semantic Communication
Zhang, Haijun
Wang, Hongyu
Li, Yabo
Long, Keping
Leung, Victor C. M.
[J]. IEEE WIRELESS COMMUNICATIONS, 2023, 30 (03) : 70 - 77
[3] Adaptable Semantic Compression and Resource Allocation for Task-Oriented Communications
Liu, Chuanhong
Guo, Caili
Yang, Yang
Jiang, Nan
[J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (03) : 769 - 782
[4] Resource allocation and congestion control in task-oriented distributed sensor networks
Zhang, J
Kulasekere, EC
Premaratne, K
Bauer, PH
[J]. FIBER OPTIC SENSOR TECHNOLOGY AND APPLICATIONS 2001, 2001, 4578 : 220 - 228
[5] Adaptive Service Placement, Task Offloading and Bandwidth Allocation in Task-Oriented URLLC Edge Networks
Dang Van Huynh
Van-Dinh Nguyen
Dobre, Octavia A.
Khosravirad, Saeed R.
Duong, Trung Q.
[J]. ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5755 - 5760
[6] Resource Allocation and Common Message Selection for Task-Oriented Semantic Information Transmission With RSMA
Cheng, Yanyu
Niyato, Dusit
Du, Hongyang
Kang, Jiawen
Xiong, Zehui
Miao, Chunyan
Kim, Dong In
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (06) : 5557 - 5570
[7] Research on a dynamic resource organization method task-oriented for the loosely coupled system
Zhao, Yang
Yi, Xian-Qing
Luo, Xue-Shan
[J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2014, 36 (07): : 1328 - 1333
[8] Task-Oriented Adaptive Maintenance Support System
Huang, Ying
Wang, Xingjun
Gardoni, Mickael
Coulibaly, Amadou
[J]. JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2015, 15 (03)
[9] Opportunistic cooperation by dynamic resource allocation
Gunduz, Deniz
Erkip, Elza
[J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2007, 6 (04) : 1446 - 1454
[10] Task-oriented collaborative customer profit allocation mechanism
Zhang, Xue-Feng
Yang, Yu
Yu, Guo-Dong
Yang, Tao
Xue, Cheng-Meng
[J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2015, 21 (01): : 13 - 20

← 1 2 3 4 5 →