One-for-All: An Efficient Variable Convolution Neural Network for In-Loop Filter of VVC

被引:26
|
作者
Huang, Zhijie [1 ]
Sun, Jun [1 ]
Guo, Xiaopeng [1 ]
Shang, Mingyu [1 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing 100871, Peoples R China
关键词
Encoding; Videos; Feature extraction; Convolution; Adaptation models; Visualization; Training; Variable; in-loop filter; attention; versatile video coding (VVC); CNN;
D O I
10.1109/TCSVT.2021.3089498
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, many researches on convolution neural network (CNN) based in-loop filters have been proposed to improve coding efficiency. However, most existing CNN based filters tend to train and deploy multiple networks for various quantization parameters (QP) and frame types (FT), which drastically increases resources in training these models and the memory burdens for video codec. In this paper, we propose a novel variable CNN (VCNN) based in-loop filter for VVC, which can effectively handle the compressed videos with different QPs and FTs via a single model. Specifically, an efficient and flexible attention module is developed to recalibrate features according to QPs or FTs. Then we embed the module into the residual block so that these informative features can be continuously utilized in the residual learning process. To minimize the information loss in the learning process of the entire network, we utilize a residual feature aggregation module (RFA) for more efficient feature extraction. Based on it, an efficient network architecture VCNN is designed that can not only effectively reduce compression artifacts, but also can be adaptive to various QPs and FTs. To address training data imbalance on various QPs and FTs and improve the robustness of the model, a focal mean square error loss function is employed to train the proposed network. Then we integrate the VCNN into VVC as an additional tool of in-loop filters after the deblocking filter. Extensive experimental results show that our VCNN approach obtains on average 3.63%, 4.36%, 4.23%, 3.56% under all intra, low-delay P, low-delay, and random access configurations, respectively, which is even better than QP-Separate models.
引用
收藏
页码:2342 / 2355
页数:14
相关论文
共 50 条
  • [31] RESIDUAL CONVOLUTIONAL NEURAL NETWORK BASED IN-LOOP FILTER WITH INTRA AND INTER FRAMES PROCESSED RESPECTIVELY FOR AVS3
    Zhu, Han
    Xu, Xiaozhong
    Liu, Shan
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,
  • [32] Reference-based In-loop Filter with Robust Neural Feature Transfer for Video Coding
    Kim, Nayoung
    Webtoon, Naver
    Lee, Jung-kyung
    Kang, Je-won
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2025, 21 (01)
  • [33] Efficient In-Loop Filtering Based on Enhanced Deep Convolutional Neural Networks for HEVC
    Pan, Zhaoqing
    Yi, Xiaokai
    Zhang, Yun
    Jeon, Byeungwoo
    Kwong, Sam
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5352 - 5366
  • [34] Learning an Efficient Convolution Neural Network for Pansharpening
    Guo, Yecai
    Ye, Fei
    Gong, Hao
    ALGORITHMS, 2019, 12 (01)
  • [35] Efficient Convolution Architectures for Convolutional Neural Network
    Wang, Jichen
    Lin, Jun
    Wang, Zhongfeng
    2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
  • [36] One-for-All: Grouped Variation Network-Based Fractional Interpolation in Video Coding
    Liu, Jiaying
    Xia, Sifeng
    Yang, Wenhan
    Li, Mading
    Liu, Dong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (05) : 2140 - 2151
  • [37] ONE-DIMENSIONAL FILTERING AND FILTER COEFFICIENT COMPRESSION FOR OPTIMAL POST-PROCESS/IN-LOOP FILTER COEFFICIENT
    Akbulut, Orhan
    Ertuerk, Sarp
    2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 653 - +
  • [38] Efficient Convolution Neural Networks for Object Tracking Using Separable Convolution and Filter Pruning
    Mao, Yuanhong
    He, Zhanzhuang
    Ma, Zhong
    Tang, Xuehan
    Wang, Zhuping
    IEEE ACCESS, 2019, 7 (106466-106474) : 106466 - 106474
  • [39] Efficient HW Design of Adaptive Loop Filter for 4k ASIC VVC Encoder
    Farhat, Ibrahim
    Hamidouche, Wassim
    Grill, Adrien
    Menard, Daniel
    Deforges, Olivier
    2022 PICTURE CODING SYMPOSIUM (PCS), 2022, : 1 - 5
  • [40] MULTI-MODAL/MULTI-SCALE CONVOLUTIONAL NEURAL NETWORK BASED IN-LOOP FILTER DESIGN FOR NEXT GENERATION VIDEO CODEC
    Kang, Jihong
    Kim, Sungjei
    Lee, Kyoung Mu
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 26 - 30