CBREN: Convolutional Neural Networks for Constant Bit Rate Video Quality Enhancement

被引:13
|
作者
Zhao, Hengrun [1 ]
Zheng, Bolun [1 ]
Yuan, Shanxin [2 ]
Zhang, Hua [3 ]
Yan, Chenggang [1 ]
Li, Liang [4 ]
Slabaugh, Gregory [5 ]
机构
[1] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 311305, Peoples R China
[2] Huawei Technol, Noahs Ark Lab, London N1C 4AG, England
[3] Hangzhou Dianzi Univ, Sch Comp Sci, Hangzhou 311305, Peoples R China
[4] Chinese Acad Sci, Inst Comp Technol, Beijing 100049, Peoples R China
[5] Queen Mary Univ London, Digital Environm Res Inst DERI, London E1 4NS, England
关键词
Image coding; Quantization (signal); Streaming media; Bit rate; Image restoration; Transform coding; Video recording; Quality enhancement; CBR compressed video; dual-domain restoration; DECISION ALGORITHM; MODE DECISION; SIZE DECISION; HEVC;
D O I
10.1109/TCSVT.2021.3123621
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Constant bit rate (CBR) videos are widely used in streaming playback applications. However, the image quality of the CBR video is often unstable, especially for scenes with large motion. To this end, we design a new model to represent the distortion of High Efficiency Video Coding (HEVC) constant bit rate video, and propose a neural network for a constant bit rate video quality enhancement (CBREN). We propose a dual-domain restoration module (DRM) to jointly learn the prior knowledge in the pixel domain and the frequency domain. To address the degradation resulting from compression, we propose a two-step quantization degradation estimation strategy. The Inverse DCT (IDCT) Translation Unit (ITU) is used to constrain the quantization table of the constant bit rate video to a suitable range, and the Dynamic Alpha Unit (DAU) is used to fine-tune the quantization table according to the content of each frame. In order to effectively reduce the block distortion of different sizes produced in the compression process, we adopt a multi-scale network. Extensive experiments show that our approach can greatly enhance the quality of CBR compressed video. Moreover, our method can also be applied to constant quantization parameter (CQP) video enhancement tasks, and is certainly superior to existing methods.
引用
收藏
页码:4138 / 4149
页数:12
相关论文
共 50 条
  • [21] Foveated convolutional neural networks for video summarization
    Jiaxin Wu
    Sheng-hua Zhong
    Zheng Ma
    Stephen J. Heinen
    Jianmin Jiang
    Multimedia Tools and Applications, 2018, 77 : 29245 - 29267
  • [22] Deep Convolutional Neural Network for Decompressed Video Enhancement
    Lin, Rongqun
    Zhang, Yongbing
    Wang, Haoqian
    Wang, Xingzheng
    Dai, Qionghai
    2016 DATA COMPRESSION CONFERENCE (DCC), 2016, : 617 - 617
  • [23] Pyramid coding based rate control for constant bit rate video streaming
    Kumar, Venkata Phani M.
    Varma, K. C. Ravi Chandra
    Mahapatra, Sudipta
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (24) : 17247 - 17272
  • [24] Nonlinear predictive rate control for constant bit rate MPEG video coders
    Saw, YS
    Grant, PM
    Hannah, JM
    Mulgrew, B
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 2641 - 2644
  • [25] Pyramid coding based rate control for constant bit rate video streaming
    Venkata Phani Kumar M
    K. C. Ravi Chandra Varma
    Sudipta Mahapatra
    Multimedia Tools and Applications, 2016, 75 : 17247 - 17272
  • [26] Power consumption analysis of constant bit rate video transmission over 3G networks
    Ukhanova, Anna
    Belyaev, Evgeny
    Wang, Le
    Forchhammer, Soren
    COMPUTER COMMUNICATIONS, 2012, 35 (14) : 1695 - 1706
  • [27] Stereoscopic video quality assessment based on 3D convolutional neural networks
    Yang, Jiachen
    Zhu, Yinghao
    Ma, Chaofan
    Lu, Wen
    Meng, Qinggang
    NEUROCOMPUTING, 2018, 309 : 83 - 93
  • [28] RESIDUAL FRAME FOR NOISY VIDEO CLASSIFICATION ACCORDING TO PERCEPTUAL QUALITY IN CONVOLUTIONAL NEURAL NETWORKS
    Zhang, Huaixuan
    Lan, Yuhai
    Dai, Tao
    Qiao, Ruizhi
    Xu, Ying
    Yao, Yao
    Xia, Shu-Tao
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 242 - 247
  • [29] MoViDNN: A Mobile Platform for Evaluating Video Quality Enhancement with Deep Neural Networks
    Cetinkaya, Ekrem
    Minh Nguyen
    Timmerer, Christian
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 465 - 472
  • [30] Frame layer bit allocation scheme for constant quality video
    Jiang, MQ
    Yi, XQ
    Ling, N
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1055 - 1058