CNN-based Bi-prediction Utilizing Spatial Information for Video Coding

被引:2
|
作者
Mao, Jue [1 ]
Yu, Hualong [1 ]
Gao, Xiaoding [1 ]
Yu, Lu [1 ]
机构
[1] Zhejiang Univ, Inst Informat & Commun Engn, Zhejiang Prov Key Lab Informat Proc Commun & Netw, Hangzhou, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
bi-prediction; CNN; pixel-wise combination; spatial neighboring pixels; WEIGHTED PREDICTION;
D O I
10.1109/iscas.2019.8702552
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In video coding, slice-level and block-level weighted bi-prediction are used for scenes with temporal brightness variation. However, there are still structured residuals when applying weighted bi-prediction in slice and block level. Recently, CNN-based bi-prediction has achieved remarkable success on reducing significant structured residuals, in which bi-predictor is generated by CNN model using two reference blocks as inputs. Inspired by high spatial correlation of pixels, this paper uses spatial neighboring pixels of both current block and two reference blocks as the additional information of the proposed CNN model to further reduce residual and generate a more accurate bi-predictor. Moreover, by comparing AMVP and merge/skip mode, this paper illustrates that CNN-based bi-prediction is more efficient for merge/skip mode than for AMVP mode. Experimental results show that proposed method reaches 3.46% BD-rate saving for random access configuration on average compared to HM 16.15.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Convolutional Neural Network Based Bi-Prediction Utilizing Spatial and Temporal Information in Video Coding
    Mao, Jue
    Yu, Lu
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 1856 - 1870
  • [2] Adaptive Weighted Bi-Prediction based on Template Similarity in Video Coding
    Mao, Jue
    Zhao, Yin
    Xu, Weiwei
    Yu, Lu
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [3] Generalized Bi-prediction Method for Future Video Coding
    Chen, Chun-Chi
    Xiu, Xiaoyu
    He, Yuwen
    Ye, Yan
    [J]. 2016 PICTURE CODING SYMPOSIUM (PCS), 2016,
  • [4] CNN-based Super Resolution for Video Coding Using Decoded Information
    Lin, Chaoyi
    Li, Yue
    Zhang, Kai
    Zhang, Zhaobin
    Zhang, Li
    [J]. 2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [5] DEPTH-BASED WEIGHTED BI-PREDICTION FOR VIDEO PLUS DEPTH MAP CODING
    Shimizu, Shinya
    Kimata, Hideaki
    Sugimoto, Shiori
    Kojima, Akira
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1313 - 1316
  • [6] Bi-prediction Enhancement with Deep Frame Prediction Network for Versatile Video Coding
    Tao, Hao
    Qian, Jian
    Yu, Li
    Wang, Hongkui
    [J]. 2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 374 - 374
  • [7] CNN-Based Bi-Directional Motion Compensation for High Efficiency Video Coding
    Zhao, Zhenghui
    Wang, Shiqi
    Wang, Shanshe
    Zhang, Xinfeng
    Ma, Siwei
    Yang, Jiansheng
    [J]. 2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,
  • [8] Enhanced Bi-Prediction With Convolutional Neural Network for High-Efficiency Video Coding
    Zhao, Zhenghui
    Wang, Shiqi
    Wang, Shanshe
    Zhang, Xinfeng
    Ma, Siwei
    Yang, Jiansheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (11) : 3291 - 3301
  • [9] CNN-based Prediction for Lossless Coding of Photographic Images
    Schiopu, Ionut
    Liu, Yu
    Munteanu, Adrian
    [J]. 2018 PICTURE CODING SYMPOSIUM (PCS 2018), 2018, : 16 - 20
  • [10] New Bi-prediction techniques for B pictures coding
    Ji, XY
    Zhao, DB
    Gao, W
    Huang, QM
    Ma, SW
    Lu, Y
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 101 - 104