CBREN: Convolutional Neural Networks for Constant Bit Rate Video Quality Enhancement

被引：13

作者：

Zhao, Hengrun ^{[1
]}

Zheng, Bolun ^{[1
]}

Yuan, Shanxin ^{[2
]}

Zhang, Hua ^{[3
]}

Yan, Chenggang ^{[1
]}

Li, Liang ^{[4
]}

Slabaugh, Gregory ^{[5
]}

机构：

[1] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 311305, Peoples R China

[2] Huawei Technol, Noahs Ark Lab, London N1C 4AG, England

[3] Hangzhou Dianzi Univ, Sch Comp Sci, Hangzhou 311305, Peoples R China

[4] Chinese Acad Sci, Inst Comp Technol, Beijing 100049, Peoples R China

[5] Queen Mary Univ London, Digital Environm Res Inst DERI, London E1 4NS, England

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2022年 / 32卷 / 07期

关键词：

Image coding; Quantization (signal); Streaming media; Bit rate; Image restoration; Transform coding; Video recording; Quality enhancement; CBR compressed video; dual-domain restoration; DECISION ALGORITHM; MODE DECISION; SIZE DECISION; HEVC;

D O I：

10.1109/TCSVT.2021.3123621

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Constant bit rate (CBR) videos are widely used in streaming playback applications. However, the image quality of the CBR video is often unstable, especially for scenes with large motion. To this end, we design a new model to represent the distortion of High Efficiency Video Coding (HEVC) constant bit rate video, and propose a neural network for a constant bit rate video quality enhancement (CBREN). We propose a dual-domain restoration module (DRM) to jointly learn the prior knowledge in the pixel domain and the frequency domain. To address the degradation resulting from compression, we propose a two-step quantization degradation estimation strategy. The Inverse DCT (IDCT) Translation Unit (ITU) is used to constrain the quantization table of the constant bit rate video to a suitable range, and the Dynamic Alpha Unit (DAU) is used to fine-tune the quantization table according to the content of each frame. In order to effectively reduce the block distortion of different sizes produced in the compression process, we adopt a multi-scale network. Extensive experiments show that our approach can greatly enhance the quality of CBR compressed video. Moreover, our method can also be applied to constant quantization parameter (CQP) video enhancement tasks, and is certainly superior to existing methods.

引用

页码：4138 / 4149

页数：12

共 50 条

[21] Foveated convolutional neural networks for video summarization
Jiaxin Wu
Sheng-hua Zhong
Zheng Ma
Stephen J. Heinen
Jianmin Jiang
Multimedia Tools and Applications, 2018, 77 : 29245 - 29267
[22] Deep Convolutional Neural Network for Decompressed Video Enhancement
Lin, Rongqun
Zhang, Yongbing
Wang, Haoqian
Wang, Xingzheng
Dai, Qionghai
2016 DATA COMPRESSION CONFERENCE (DCC), 2016, : 617 - 617
[23] Pyramid coding based rate control for constant bit rate video streaming
Kumar, Venkata Phani M.
Varma, K. C. Ravi Chandra
Mahapatra, Sudipta
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (24) : 17247 - 17272
[24] Nonlinear predictive rate control for constant bit rate MPEG video coders
Saw, YS
Grant, PM
Hannah, JM
Mulgrew, B
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 2641 - 2644
[25] Pyramid coding based rate control for constant bit rate video streaming
Venkata Phani Kumar M
K. C. Ravi Chandra Varma
Sudipta Mahapatra
Multimedia Tools and Applications, 2016, 75 : 17247 - 17272
[26] Power consumption analysis of constant bit rate video transmission over 3G networks
Ukhanova, Anna
Belyaev, Evgeny
Wang, Le
Forchhammer, Soren
COMPUTER COMMUNICATIONS, 2012, 35 (14) : 1695 - 1706
[27] Stereoscopic video quality assessment based on 3D convolutional neural networks
Yang, Jiachen
Zhu, Yinghao
Ma, Chaofan
Lu, Wen
Meng, Qinggang
NEUROCOMPUTING, 2018, 309 : 83 - 93
[28] RESIDUAL FRAME FOR NOISY VIDEO CLASSIFICATION ACCORDING TO PERCEPTUAL QUALITY IN CONVOLUTIONAL NEURAL NETWORKS
Zhang, Huaixuan
Lan, Yuhai
Dai, Tao
Qiao, Ruizhi
Xu, Ying
Yao, Yao
Xia, Shu-Tao
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 242 - 247
[29] MoViDNN: A Mobile Platform for Evaluating Video Quality Enhancement with Deep Neural Networks
Cetinkaya, Ekrem
Minh Nguyen
Timmerer, Christian
MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 465 - 472
[30] Frame layer bit allocation scheme for constant quality video
Jiang, MQ
Yi, XQ
Ling, N
2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1055 - 1058

← 1 2 3 4 5 →