One-for-All: An Efficient Variable Convolution Neural Network for In-Loop Filter of VVC

被引:26
|
作者
Huang, Zhijie [1 ]
Sun, Jun [1 ]
Guo, Xiaopeng [1 ]
Shang, Mingyu [1 ]
机构
[1] Peking Univ, Wangxuan Inst Comp Technol, Beijing 100871, Peoples R China
关键词
Encoding; Videos; Feature extraction; Convolution; Adaptation models; Visualization; Training; Variable; in-loop filter; attention; versatile video coding (VVC); CNN;
D O I
10.1109/TCSVT.2021.3089498
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, many researches on convolution neural network (CNN) based in-loop filters have been proposed to improve coding efficiency. However, most existing CNN based filters tend to train and deploy multiple networks for various quantization parameters (QP) and frame types (FT), which drastically increases resources in training these models and the memory burdens for video codec. In this paper, we propose a novel variable CNN (VCNN) based in-loop filter for VVC, which can effectively handle the compressed videos with different QPs and FTs via a single model. Specifically, an efficient and flexible attention module is developed to recalibrate features according to QPs or FTs. Then we embed the module into the residual block so that these informative features can be continuously utilized in the residual learning process. To minimize the information loss in the learning process of the entire network, we utilize a residual feature aggregation module (RFA) for more efficient feature extraction. Based on it, an efficient network architecture VCNN is designed that can not only effectively reduce compression artifacts, but also can be adaptive to various QPs and FTs. To address training data imbalance on various QPs and FTs and improve the robustness of the model, a focal mean square error loss function is employed to train the proposed network. Then we integrate the VCNN into VVC as an additional tool of in-loop filters after the deblocking filter. Extensive experimental results show that our VCNN approach obtains on average 3.63%, 4.36%, 4.23%, 3.56% under all intra, low-delay P, low-delay, and random access configurations, respectively, which is even better than QP-Separate models.
引用
收藏
页码:2342 / 2355
页数:14
相关论文
共 50 条
  • [41] Recursive Residual Convolutional Neural Network- Based In-Loop Filtering for Intra Frames
    Zhang, Shufang
    Fan, Zenghui
    Ling, Nam
    Jiang, Minqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 1888 - 1900
  • [42] Efficient Fast Convolution Architectures for Convolutional Neural Network
    Xu, Weihong
    Wang, Zhongfeng
    You, Xiaohu
    Zhang, Chuan
    2017 IEEE 12TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2017, : 904 - 907
  • [43] Joint Pixel and Frequency Feature Learning and Fusion via Channel-Wise Transformer for High-Efficiency Learned In-Loop Filter in VVC
    Kathariya, Birendra
    Li, Zhu
    Van der Auwera, Geert
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 4070 - 4083
  • [44] A High Efficient Architecture for Convolution Neural Network Accelerator
    Kong Anmin
    Zhao Bin
    2019 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2019), 2019, : 131 - 134
  • [45] Neural Network Based Multi-Level In-Loop Filtering for Versatile Video Coding
    Zhu, Linwei
    Zhang, Yun
    Li, Na
    Wu, Wenhui
    Wang, Shiqi
    Kwong, Sam
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 12092 - 12096
  • [46] Analysis and Efficient Architecture Design for VC-1 Overlap Smoothing and In-Loop Deblocking Filter
    Lee, Yen-Lin
    Nguyen, Truong Q.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2008, 18 (12) : 1786 - 1796
  • [47] Hybrid video coding scheme based on VVC and spatio-temporal attention convolution neural network
    He, Gang
    Xu, Kepeng
    Wu, Chang
    Ma, Zijia
    Wen, Xing
    Sun, Ming
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 1790 - 1793
  • [48] A Low Complexity Convolutional Neural Network with Fused CP Decomposition for In-Loop Filtering in Video Coding
    Shao, Tong
    Shingala, Jay N.
    Yin, Peng
    Arora, Arjun
    Shyam, Ajay
    McCarthy, Sean
    2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 238 - 247
  • [49] A Low Complexity Convolutional Neural Network with Fused CP Decomposition for In-Loop Filtering in Video Coding
    Shao, Tong
    Shingala, Jay N.
    Yin, Peng
    Arora, Arjun
    Shyam, Ajay
    McCarthy, Sean
    Data Compression Conference Proceedings, 2023, 2023-March : 238 - 247
  • [50] Content-Aware Convolutional Neural Network for In-Loop Filtering in High Efficiency Video Coding
    Jia, Chuanmin
    Wang, Shiqi
    Zhang, Xinfeng
    Wang, Shanshe
    Liu, Jiaying
    Pu, Shiliang
    Ma, Siwei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) : 3343 - 3356