Adversarial Deep Learning for Online Resource Allocation

被引：2

作者：

Du, Bingqian ^{[1
]}

Huang, Zhiyi ^{[1
]}

Wu, Chuan ^{[1
]}

机构：

[1] Univ Hong Kong, Pokfulam, Dept Comp Sci, Hong Kong, Peoples R China

来源：

ACM TRANSACTIONS ON MODELING AND PERFORMANCE EVALUATION OF COMPUTING SYSTEMS | 2021年 / 6卷 / 04期

关键词：

Neural networks; adversarial learning; online algorithm; PRIMAL-DUAL ALGORITHMS; AUCTIONS;

D O I：

10.1145/3494526

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Online algorithms are an important branch in algorithm design. Designing online algorithms with a bounded competitive ratio (in terms of worst-case performance) can be hard and usually relies on problem-specific assumptions. Inspired by adversarial training from Generative Adversarial Net and the fact that the competitive ratio of an online algorithm is based on worst-case input, we adopt deep neural networks (NNs) to learn an online algorithm for a resource allocation and pricing problem from scratch, with the goal that the performance gap between offline optimum and the learned online algorithm can be minimized for worst-case input. Specifically, we leverage two NNs as the algorithm and the adversary, respectively, and let them play a zero sum game, with the adversary being responsible for generating worst-case input while the algorithm learns the best strategy based on the input provided by the adversary. To ensure better convergence of the algorithm network (to the desired online algorithm), we propose a novel per-round update method to handle sequential decision making to break complex dependency among different rounds so that update can be done for every possible action instead of only sampled actions. To the best of our knowledge, our work is the first using deep NNs to design an online algorithm from the perspective of worst-case performance guarantee. Empirical studies show that our updating methods ensure convergence to Nash equilibrium and the learned algorithm outperforms state-of-the-art online algorithms under various settings.

引用

页数：25

共 50 条

[1] Deep Reinforcement Learning for Online Resource Allocation in Network Slicing
Cai, Yue
Cheng, Peng
Chen, Zhuo
Ding, Ming
Vucetic, Branka
Li, Yonghui
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (06) : 7099 - 7116
[2] Deep learning for online computation offloading and resource allocation in NOMA
Niu, Juncui
Zhang, Shubin
Chi, Kaikai
Shen, Guanqun
Gao, Wei
COMPUTER NETWORKS, 2022, 216
[3] Online Resource Allocation with Personalized Learning
Zhalechian, Mohammad
Keyvanshokooh, Esmaeil
Shi, Cong
Van Oyen, Mark P.
OPERATIONS RESEARCH, 2022, 70 (04) : 2138 - 2161
[4] Online Learning for Network Resource Allocation
Salem T.S.
Performance Evaluation Review, 2023, 50 (03): : 20 - 23
[5] Federated Learning for Online Resource Allocation in Mobile Edge Computing: A Deep Reinforcement Learning Approach
Zheng, Jingjing
Li, Kai
Mhaisen, Naram
Ni, Wei
Tovar, Eduardo
Guizani, Mohsen
2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
[6] Predictive Resource Allocation with Deep Learning
Guo, Jia
Yang, Chenyang
2018 IEEE 88TH VEHICULAR TECHNOLOGY CONFERENCE (VTC-FALL), 2018,
[7] RADEAN: A Resource Allocation Model Based on Deep Reinforcement Learning and Generative Adversarial Networks in Edge Computing
Yu, Zhaoyang
Zhao, Sinong
Su, Tongtong
Liu, Wenwen
Liu, Xiaoguang
Wang, Gang
Wang, Zehua
Leung, Victor C. M.
MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES, MOBIQUITOUS 2023, PT I, 2024, 593 : 257 - 277
[8] Deep Reinforcement Learning for Online Resource Allocation in IoT Networks: Technology, Development, and Future Challenges
Cheng, Peng
Chen, Youjia
Ding, Ming
Chen, Zhuo
Liu, Sige
Chen, Yi-Ping Phoebe
IEEE COMMUNICATIONS MAGAZINE, 2023, 61 (06) : 111 - 117
[9] Resource Allocation in URLLC with Online Learning for Mobile Users
Zhang, Jie
Sun, Chengjian
Yang, Chenyang
2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING), 2021,
[10] Online Learning Methods for Border Patrol Resource Allocation
Klima, Richard
Kiekintveld, Christopher
Lisy, Viliam
DECISION AND GAME THEORY FOR SECURITY, GAMESEC 2014, 2014, 8840 : 340 - 349

← 1 2 3 4 5 →