Adversarial Deep Learning for Online Resource Allocation

被引:2
|
作者
Du, Bingqian [1 ]
Huang, Zhiyi [1 ]
Wu, Chuan [1 ]
机构
[1] Univ Hong Kong, Pokfulam, Dept Comp Sci, Hong Kong, Peoples R China
关键词
Neural networks; adversarial learning; online algorithm; PRIMAL-DUAL ALGORITHMS; AUCTIONS;
D O I
10.1145/3494526
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Online algorithms are an important branch in algorithm design. Designing online algorithms with a bounded competitive ratio (in terms of worst-case performance) can be hard and usually relies on problem-specific assumptions. Inspired by adversarial training from Generative Adversarial Net and the fact that the competitive ratio of an online algorithm is based on worst-case input, we adopt deep neural networks (NNs) to learn an online algorithm for a resource allocation and pricing problem from scratch, with the goal that the performance gap between offline optimum and the learned online algorithm can be minimized for worst-case input. Specifically, we leverage two NNs as the algorithm and the adversary, respectively, and let them play a zero sum game, with the adversary being responsible for generating worst-case input while the algorithm learns the best strategy based on the input provided by the adversary. To ensure better convergence of the algorithm network (to the desired online algorithm), we propose a novel per-round update method to handle sequential decision making to break complex dependency among different rounds so that update can be done for every possible action instead of only sampled actions. To the best of our knowledge, our work is the first using deep NNs to design an online algorithm from the perspective of worst-case performance guarantee. Empirical studies show that our updating methods ensure convergence to Nash equilibrium and the learned algorithm outperforms state-of-the-art online algorithms under various settings.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] A Multi-Agent Learning Approach to Online Distributed Resource Allocation
    Zhang, Chongjie
    Lesser, Victor
    Shenoy, Prashant
    21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 361 - 366
  • [32] Online Learning for Load Balancing of Unknown Monotone Resource Allocation Games
    Bistritz, Ilai
    Bambos, Nicholas
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [33] Adversarial reasoning and resource allocation: The LG approach
    Stilman, B
    Yakhnis, V
    Umanskiy, O
    Boyd, R
    Enabling Technologies for Simulation Science IX, 2005, 5805 : 177 - 188
  • [34] Deep resource allocation for a massively multiplayer online finance of tourism gamification in metaverse
    Chu, Chung-Hua
    INFORMATION TECHNOLOGY & TOURISM, 2023, 25 (04) : 565 - 583
  • [35] Onboard Deep Deterministic Policy Gradients for Online Flight Resource Allocation of UAVs
    Li, Kai
    Emami, Yousef
    Ni, Wei
    Tovar, Eduardo
    Han, Zhu
    IEEE Networking Letters, 2020, 2 (03): : 106 - 110
  • [36] Deep resource allocation for a massively multiplayer online finance of tourism gamification in metaverse
    Chung-Hua Chu
    Information Technology & Tourism, 2023, 25 : 565 - 583
  • [37] Adversarial Online Learning with noise
    Resler, Alon
    Mansour, Yishay
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [38] Online Learning with Adversarial Delays
    Quanrud, Kent
    Khashabi, Daniel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [39] Online mobile learning resource recommendation method based on deep reinforcement learning
    Li, Pingyang
    Zhang, Juan
    INTERNATIONAL JOURNAL OF INNOVATION AND SUSTAINABLE DEVELOPMENT, 2025, 19 (01)
  • [40] Resource Allocation Using Deep Learning in Mobile Small Cell Networks
    Zafar, Saniya
    Jangsher, Sobia
    Al-Dweik, Arafat
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2022, 6 (03): : 1903 - 1915