Multi-Agent Reinforcement Learning based Bit Allocation for Gaming Video Coding

被引:0
|
作者
Ren, Guangjie [1 ,2 ]
Liu, Zizheng [3 ]
Chen, Zhenzhong [1 ]
Liu, Shan [4 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan, Peoples R China
[2] Tencent, Shanghai, Peoples R China
[3] Tencent, Shenzhen, Peoples R China
[4] Tencent Amer, Palo Alto, CA USA
关键词
Quality Stability; ROI; bit allocation; reinforcement learning; VVC; gaming video; LEVEL; ALGORITHM; SCHEME;
D O I
10.1109/PCS60826.2024.10566434
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a multi-agent reinforcement learning based bit allocation method towards quality stability for gaming video coding in Versatile Video Coding (VVC). The bits allocated to regions-of-interests (ROI) are critical to obtain a subjectively optimal visual quality but also constrained subject to the frame-level bit budgets. A multi-objective partially observable stochastic game is formulated by combining the frame-level and ROI-level bit allocation process, which optimizes both the quality and fluctuation simultaneously. The proposed method is implemented in VVC and verified with gaming video. A multi-agent reinforcement learning method is utilized for training the agents and obtaining reasonable bit allocation actions. In comparison to the reference methods, the proposed method achieves a more consistent quality at both the frame-level and ROI-level, while improving the quality of ROI.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Reinforcement Learning based ROI Bit Allocation for Gaming Video Coding in VVC
    Ren, Guangjie
    Liu, Zizheng
    Chen, Zhenzhong
    Liu, Shan
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [2] Coding for Distributed Multi-Agent Reinforcement Learning
    Wang, Baoqian
    Xie, Junfei
    Atanasov, Nikolay
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 10625 - 10631
  • [3] Multi-objective optimization based perceptual bit allocation for gaming video coding in VVC
    Ren, Guangjie
    Liu, Feiyang
    Wang, Huairui
    Yang, Daiqin
    Wang, Tao
    Wang, Sihan
    Zhang, Yunfei
    SIGNAL PROCESSING, 2022, 198
  • [4] Multi-Agent Deep Reinforcement Learning Based Distributed Resource Allocation
    Urmonov, Odilbek
    Kim, HyungWon
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [5] A model of video coding based on multi-agent
    Tao, Yang
    Liu, Zhiming
    Peng, Yuxing
    AGENT COMPUTING AND MULTI-AGENT SYSTEMS, 2006, 4088 : 590 - 595
  • [6] Dynamic power allocation in IIoT based on multi-agent deep reinforcement learning
    Li, Fenglei
    Liu, Zhixin
    Zhang, Xinzhe
    Yang, Yi
    NEUROCOMPUTING, 2022, 505 : 10 - 18
  • [7] Spectrum allocation algorithm based on multi-agent reinforcement learning in smart grid
    Yan F.
    Lin X.
    Li Z.
    Xu X.
    Xia W.
    Shen L.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (09): : 12 - 24
  • [8] Review of multi-agent reinforcement learning based dynamic spectrum allocation method
    Song B.
    Ye W.
    Meng X.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (11): : 3338 - 3351
  • [9] Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks
    Cui, Jingjing
    Liu, Yuanwei
    Nallanathan, Arumugam
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) : 729 - 743
  • [10] A Multi-Agent Reinforcement Learning Approach for Stock Portfolio Allocation
    Koratamaddi, Prahlad
    Wadhwani, Karan
    Gupta, Mridul
    Sanjeevi, Sriram G.
    CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 410 - 410