Learning-Based QP Initialization for Versatile Video Coding

被引:0
|
作者
Zhang, Zhentao [1 ]
Zeng, Hongji [1 ]
Lin, Jielian [1 ,2 ]
机构
[1] Fuzhou Univ, Fujian Key Lab Intelligent Proc & Wireless Transmi, Fuzhou, Peoples R China
[2] Putian Univ, Sch Mech & Elect, Informat Engn, Putian, Fujian, Peoples R China
关键词
Bit rate control; residual network; video coding;
D O I
10.1561/116.20240029
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Versatile Video Coding (VVC) is a modern video compression standard designed to efficiently encode high definition video content, regardless of its diversity. It is expected to deliver superior compression performance compared to the previous standard, High Efficiency Video Coding (HEVC). However, the bit rate control problem for VVC can still be improved. To address this issue, a learning-based initial frame Quantization Parameter (QP) prediction algorithm has been proposed in this paper. This algorithm extracts information from image pixels and maps it to a feature matrix to reduce its additional cost. Furthermore, the problem of inaccurate determination of VVC QPs has been addressed by building a residual network to represent the frame complexity progressively and learning the optimal relationship between QPs and the target bit rate. Experimental results show that the proposed method reduces the control error from 10.74% to 7.19% compared to the original encoder.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Deep learning-based video quality enhancement for the new versatile video coding
    Soulef Bouaafia
    Randa Khemiri
    Seifeddine Messaoud
    Olfa Ben Ahmed
    Fatma Ezahra Sayadi
    Neural Computing and Applications, 2022, 34 : 14135 - 14149
  • [2] Deep learning-based video quality enhancement for the new versatile video coding
    Bouaafia, Soulef
    Khemiri, Randa
    Messaoud, Seifeddine
    Ben Ahmed, Olfa
    Sayadi, Fatma Ezahra
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17): : 14135 - 14149
  • [3] Deep Learning-Based Intra Mode Derivation for Versatile Video Coding
    Zhu, Linwei
    Zhang, Yun
    Li, Na
    Jiang, Gangyi
    Kwong, Sam
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [4] Deep Learning-Based Chroma Prediction for Intra Versatile Video Coding
    Zhu, Linwei
    Zhang, Yun
    Wang, Shiqi
    Kwong, Sam
    Jin, Xin
    Qiao, Yu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3168 - 3181
  • [5] Learning-Based Multi-Stage Intra Partition for Versatile Video Coding
    Zeng, Hongji
    Zhao, Tiesong
    Feng, Weize
    Chen, Nan
    Lin, Jielian
    Wang, Xu
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [6] Multitask Learning-Based Early MTT Partition Decision for Versatile Video Coding
    Liu, Wu
    Li, Yue
    Nie, Mingxing
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT II, 2024, 14474 : 488 - 499
  • [7] Learning-based Multiview Video Coding
    Bai, Baochun
    Cheng, Li
    Lei, Cheng
    Boulanger, Pierre
    Harms, Janelle
    PCS: 2009 PICTURE CODING SYMPOSIUM, 2009, : 201 - +
  • [8] QP INITIALIZATION AND ADAPTIVE MAD PREDICTION FOR RATE CONTROL IN HEVC-BASED MULTI-VIEW VIDEO CODING
    Lim, Woong
    Bajic, Ivan V.
    Sim, Donggyu
    2013 IEEE 11TH IVMSP WORKSHOP: 3D IMAGE/VIDEO TECHNOLOGIES AND APPLICATIONS (IVMSP 2013), 2013,
  • [9] QP INITIALIZATION AND INTERVIEW MAD PREDICTION FOR RATE CONTROL IN HEVC-BASED MULTI-VIEW VIDEO CODING
    Lim, Woong
    Bajic, Ivan V.
    Sim, Donggyu
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 2045 - 2049
  • [10] A LEARNING-BASED LOWCOMPLEXITY IN-LOOP FILTER FOR VIDEO CODING
    Liu, Chao
    Sun, Heming
    Katto, Jiro
    Zeng, Xiaoyang
    Fan, Yibo
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,