A Switchable Deep Learning Approach for In-Loop Filtering in Video Coding

被引:42
|
作者
Ding, Dandan [1 ]
Kong, Lingyi [1 ]
Chen, Guangyao [1 ]
Liu, Zoe [2 ]
Fang, Yong [3 ]
机构
[1] Hangzhou Normal Univ, Sch Informat Sci & Engn, Hangzhou 311121, Peoples R China
[2] Visionular Inc, Mountain View, CA 94040 USA
[3] Changan Univ, Sch Informat Engn, Xian 710064, Peoples R China
关键词
Encoding; Video coding; Feature extraction; Adaptation models; Tools; Training; Correlation; CNN; in-loop filter; video coding; enhancement;
D O I
10.1109/TCSVT.2019.2935508
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning provides a great potential for in-loop filtering to improve both coding efficiency and subjective quality in video coding. State-of-the-art work focuses on network structure design and employs a single powerful network to solve all problems. In contrast, this paper proposes a deep learning based systematic approach that includes an effective Convolutional Neural Network (CNN) structure, a hierarchical training strategy, and a video codec oriented switchable mechanism. First, we propose a novel CNN structure, i.e., Squeeze-and-Excitation Filtering CNN (SEFCNN), as an optional in-loop filter. To capture the non-linear interaction between channels, the SEFCNN is comprised of two subnets, i.e., Feature EXtracting (FEX) subnet and Feature ENhancing (FEN) subnet. Then, we develop a hierarchical model training strategy to adapt the two subnets to different coding scenarios. For high-rate videos with small artifacts, we train a single global model using the FEX for all types of frames, whereas for low-rate videos with large artifacts, different models are trained using both FEX and FEN for different types of frames. Finally, we propose an adaptive enhancing mechanism which is switchable between the CNN-based and the conventional methods. We selectively apply the CNN model to some frames or some regions in a frame. Experimental results show that the proposed scheme outperforms state-of-the-art work in coding efficiency, while the computational complexity is acceptable after GPU acceleration.
引用
收藏
页码:1871 / 1887
页数:17
相关论文
共 50 条
  • [1] Deep Learning based Spatial-Temporal In-loop filtering for Versatile Video Coding
    Pham, Chi D. K.
    Fu, Chen
    Zhou, Jinjia
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1861 - 1865
  • [2] PTR-CNN for in-loop filtering in video coding
    Shao, Tong
    Liu, Tianqi
    Wu, Dapeng
    Tsai, Chia-Yang
    Lei, Zhijun
    Katsavounidis, Ioannis
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 88
  • [3] Complexity Reduction of Learned In-Loop Filtering in Video Coding
    Bayliss, Woody
    Murn, Luka
    Izquierdo, Ebroul
    Zhang, Qianni
    Mrak, Marta
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 506 - 510
  • [4] Deep In-Loop Filtering via Multi-Domain Correlation Learning and Partition Constraint for Multiview Video Coding
    Peng, Bo
    Chang, Renjie
    Pan, Zhaoqing
    Li, Ge
    Ling, Nam
    Lei, Jianjun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (04) : 1911 - 1921
  • [5] Joint Rate-Distortion Optimization for Video Coding and Learning-Based In-Loop Filtering
    Yang, Mingyi
    Huo, Junyan
    Zhou, Xile
    Qiao, Wenhan
    Wan, Shuai
    Wang, Hao
    Yang, Fuzheng
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2851 - 2865
  • [6] ON INTRA VIDEO CODING AND IN-LOOP FILTERING FOR NEURAL OBJECT DETECTION NETWORKS
    Fischer, Kristian
    Herglotz, Christian
    Kaup, Andre
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1147 - 1151
  • [7] Constrained in-loop filtering for error resiliency in high efficiency video coding
    Lee, Jinho
    Kim, Hui Yong
    Lim, Sung-Chang
    Choi, Jin Soo
    [J]. OPTICAL ENGINEERING, 2013, 52 (07)
  • [8] A progressive CNN in-loop filtering approach for inter frame coding
    Ding, Dandan
    Kong, Lingyi
    Wang, Wenyu
    Zhu, Fengqing
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 94
  • [9] Adaptive Guided Image Filter for Improved In-Loop Filtering in Video Coding
    Chen, Chen
    Miao, Zexiang
    Zeng, Bing
    [J]. 2015 IEEE 17TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2015,
  • [10] A LEARNING-BASED LOWCOMPLEXITY IN-LOOP FILTER FOR VIDEO CODING
    Liu, Chao
    Sun, Heming
    Katto, Jiro
    Zeng, Xiaoyang
    Fan, Yibo
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2020,