Adversarial Deep Learning for Online Resource Allocation

被引:2
|
作者
Du, Bingqian [1 ]
Huang, Zhiyi [1 ]
Wu, Chuan [1 ]
机构
[1] Univ Hong Kong, Pokfulam, Dept Comp Sci, Hong Kong, Peoples R China
关键词
Neural networks; adversarial learning; online algorithm; PRIMAL-DUAL ALGORITHMS; AUCTIONS;
D O I
10.1145/3494526
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Online algorithms are an important branch in algorithm design. Designing online algorithms with a bounded competitive ratio (in terms of worst-case performance) can be hard and usually relies on problem-specific assumptions. Inspired by adversarial training from Generative Adversarial Net and the fact that the competitive ratio of an online algorithm is based on worst-case input, we adopt deep neural networks (NNs) to learn an online algorithm for a resource allocation and pricing problem from scratch, with the goal that the performance gap between offline optimum and the learned online algorithm can be minimized for worst-case input. Specifically, we leverage two NNs as the algorithm and the adversary, respectively, and let them play a zero sum game, with the adversary being responsible for generating worst-case input while the algorithm learns the best strategy based on the input provided by the adversary. To ensure better convergence of the algorithm network (to the desired online algorithm), we propose a novel per-round update method to handle sequential decision making to break complex dependency among different rounds so that update can be done for every possible action instead of only sampled actions. To the best of our knowledge, our work is the first using deep NNs to design an online algorithm from the perspective of worst-case performance guarantee. Empirical studies show that our updating methods ensure convergence to Nash equilibrium and the learned algorithm outperforms state-of-the-art online algorithms under various settings.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] Deep Reinforcement Learning for Online Resource Allocation in Network Slicing
    Cai, Yue
    Cheng, Peng
    Chen, Zhuo
    Ding, Ming
    Vucetic, Branka
    Li, Yonghui
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (06) : 7099 - 7116
  • [2] Deep learning for online computation offloading and resource allocation in NOMA
    Niu, Juncui
    Zhang, Shubin
    Chi, Kaikai
    Shen, Guanqun
    Gao, Wei
    COMPUTER NETWORKS, 2022, 216
  • [3] Online Resource Allocation with Personalized Learning
    Zhalechian, Mohammad
    Keyvanshokooh, Esmaeil
    Shi, Cong
    Van Oyen, Mark P.
    OPERATIONS RESEARCH, 2022, 70 (04) : 2138 - 2161
  • [4] Online Learning for Network Resource Allocation
    Salem T.S.
    Performance Evaluation Review, 2023, 50 (03): : 20 - 23
  • [5] Federated Learning for Online Resource Allocation in Mobile Edge Computing: A Deep Reinforcement Learning Approach
    Zheng, Jingjing
    Li, Kai
    Mhaisen, Naram
    Ni, Wei
    Tovar, Eduardo
    Guizani, Mohsen
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [6] Predictive Resource Allocation with Deep Learning
    Guo, Jia
    Yang, Chenyang
    2018 IEEE 88TH VEHICULAR TECHNOLOGY CONFERENCE (VTC-FALL), 2018,
  • [7] RADEAN: A Resource Allocation Model Based on Deep Reinforcement Learning and Generative Adversarial Networks in Edge Computing
    Yu, Zhaoyang
    Zhao, Sinong
    Su, Tongtong
    Liu, Wenwen
    Liu, Xiaoguang
    Wang, Gang
    Wang, Zehua
    Leung, Victor C. M.
    MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES, MOBIQUITOUS 2023, PT I, 2024, 593 : 257 - 277
  • [8] Deep Reinforcement Learning for Online Resource Allocation in IoT Networks: Technology, Development, and Future Challenges
    Cheng, Peng
    Chen, Youjia
    Ding, Ming
    Chen, Zhuo
    Liu, Sige
    Chen, Yi-Ping Phoebe
    IEEE COMMUNICATIONS MAGAZINE, 2023, 61 (06) : 111 - 117
  • [9] Resource Allocation in URLLC with Online Learning for Mobile Users
    Zhang, Jie
    Sun, Chengjian
    Yang, Chenyang
    2021 IEEE 93RD VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-SPRING), 2021,
  • [10] Online Learning Methods for Border Patrol Resource Allocation
    Klima, Richard
    Kiekintveld, Christopher
    Lisy, Viliam
    DECISION AND GAME THEORY FOR SECURITY, GAMESEC 2014, 2014, 8840 : 340 - 349