Pricing Scheme based Nash Q-Learning Flow Control for Multi-user Network

被引:0
|
作者
Li, Xin [1 ]
Yu, Haibin [1 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Liaoning, Peoples R China
关键词
pricing scheme; Nash Q-learning; multi-user; ADMISSION CONTROLLER;
D O I
10.4028/www.scientific.net/KEM.467-469.847
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For the congestion problems with multi-user existing in high-speed networks, a pricing scheme based Nash Q-learning flow controller is proposed. It considers a network with a single service provider, and some non-cooperative users. The pricing scheme is introduced to the design of the reward function in the learning process of Q-learning. Because of the uncertainties and highly time-varying, it is not easy to accurately obtain the complete information for high-speed networks. The Nash Q-learning, which is independent of mathematic model, shows particular superiority. It obtains the Nash Q-values through trial-and-error and interaction with the environment to improve its behavior policy. By means of learning process, the proposed controller can learn to take the best actions to regulate source flow with the features of high quality of service. Simulation results show that the proposed controller can promote the performance of the networks and avoid the occurrence of congestion effectively.
引用
收藏
页码:847 / 852
页数:6
相关论文
共 50 条
  • [21] Multi-user Multi-service Rate Allocation Scheme using Nash Bargaining Solution
    Wang, Zelang
    Chen, Huifang
    Xie, Lei
    [J]. 2011 IEEE VEHICULAR TECHNOLOGY CONFERENCE (VTC FALL), 2011,
  • [22] Q-Learning Based Autonomous Control of the Auxiliary Power Network of a Ship
    Huotari, Janne
    Ritari, Antti
    Ojala, Risto
    Vepsalainen, Jari
    Tammi, Kari
    [J]. IEEE ACCESS, 2019, 7 : 152879 - 152890
  • [23] A Multi-User Collaborative Access Control Scheme Based on New Hash Chain
    Wang, Zetian
    Li, Yunfa
    Liu, Guanxu
    Zhang, Di
    [J]. ELECTRONICS, 2023, 12 (08)
  • [24] Q-learning control based on self-organizing RBF network
    Xu, Ming-Liang
    Xu, Wen-Bo
    [J]. Kongzhi yu Juece/Control and Decision, 2010, 25 (02): : 303 - 306
  • [25] Discrete-Time Optimal Control Scheme Based on Q-Learning Algorithm
    Wei, Qinglai
    Liu, Derong
    Song, Ruizhuo
    [J]. 2016 SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2016, : 125 - 130
  • [26] Dynamic Q-Learning for Intersection Traffic Flow Control Based on Agents
    Vista, Felipe P.
    Zhou, Xuan
    Ryu, Ji Hyoung
    Chong, Kil To
    [J]. ADVANCED SCIENCE LETTERS, 2014, 20 (01) : 120 - 123
  • [27] Formation Control of Multi-quadrotors Based on Deep Q-learning
    Mousavifard, Roujin
    Alipour, Khalil
    Najafqolian, Mohammad Amin
    Zarafshan, Payam
    [J]. 2022 10TH RSI INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM), 2022, : 172 - 177
  • [28] Network pricing optimization in multi-user and multimodal context with elastic demand
    Bellei, G
    Gentile, G
    Papola, N
    [J]. TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2002, 36 (09) : 779 - 798
  • [29] A multi-user reusing one slot scheme for GSM network
    School of Information and Electronics, Beijing Institute of Technology, Beijing 100081, China
    不详
    [J]. Beijing Ligong Daxue Xuebao, 2009, 9 (815-819):
  • [30] Pricing in agent economies using multi-agent Q-learning
    Tesauro, G
    Kephart, JO
    [J]. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2002, 5 (03) : 289 - 304