Spatial-Temporal Inter-Layer Reference Frame Generation Network for Spatial SHVC

被引:0
|
作者
Wang, Shiwei [1 ]
Shen, Liquan [2 ]
Liu, Jingyue [1 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Key Lab Adv Display & Syst Applicat, Minist Educ, Shanghai 200072, Peoples R China
基金
中国国家自然科学基金;
关键词
Reference frame reconstruction; inter prediction; SHVC; SCALABLE EXTENSIONS; NEURAL-NETWORK; VIDEO; HEVC;
D O I
10.1109/TMM.2023.3308444
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the current spatial Scalable High Efficiency Video Coding (SHVC) standard, the main techniques involve exploiting the correlation between pixel values of different layers to achieve inter-layer prediction samples, allowing the enhancement layer (EL) to predict samples from the upsampled base layer (BL) frame and remove temporal redundancy. However, existing network-based methods cannot effectively handle multi-layer compressed images with different resolutions to generate reference frame in spatial SHVC. Meanwhile, spatial SHVC only uses traditional interpolation filters to upsample the BL frame for EL frame sample prediction, which cannot handle different structures and contents. Therefore, considering the high correlation of multi-scale distortion characteristics across different layers, this article proposes a spatial-temporal inter-layer reference frame generation network (ST-ILR) for spatial SHVC, which can generate a high-fidelity reference frame for efficient inter-prediction and insert it into the EL reference picture list. The proposed method consists of two modules: a multi-scale motion restoration (MMR) module and a guided multi-scale feature reconstruction (GMFR) module. The MMR model is designed to accurately predict the motion trend of the EL based on the BL motion information, while implicitly compensating for previous EL frames. This is achieved by dynamically modeling the current EL motion information from the BL, capturing compression downsampling differences of prior motion vectors across different layers. The GMFR module adaptively super-resolves compressed BL frames and selectively aggregates high-frequency information from aligned EL features to preserve precise spatial detail, fusing abundant features from different layers to achieve better ILR frame quality performance. Extensive experiments show that our network achieves a 13.6% BD-rate (Bjontegaard Delta Rate) reduction in random access configuration compared to the SHVC baseline, which offers state-of-the-art coding performance.
引用
收藏
页码:3235 / 3250
页数:16
相关论文
共 50 条
  • [1] Smoothed inter-layer residual prediction for SHVC
    Lee, Hahyun
    Lee, Jinho
    Kang, Jung Won
    Choi, Jin Soo
    [J]. 2013 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2013): FUTURE CREATIVE CONVERGENCE TECHNOLOGIES FOR NEW ICT ECOSYSTEMS, 2013, : 606 - 607
  • [2] Progressive Spatial-temporal Collaborative Network for Video Frame Interpolation
    Hu, Mengshun
    Jiang, Kui
    Liao, Liang
    Nie, Zhixiang
    Xiao, Jing
    Wang, Zheng
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2145 - 2153
  • [3] Progressive Spatial-temporal Collaborative Network for Video Frame Interpolation
    Hu, Mengshun
    Jiang, Kui
    Liao, Liang
    Nie, Zhixiang
    Xiao, Jing
    Wang, Zheng
    [J]. MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia, 2022, : 2145 - 2153
  • [4] VRFCNN: Virtual Reference Frame Generation Network for Quality SHVC
    Ding, Qing
    Shen, Liquan
    Yang, Hao
    Dong, Xinchao
    Xu, Mai
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 2049 - 2053
  • [5] ICRS: inter-layer compression method combined with generation of a spatial image pyramid
    Yunzhou Zhang
    Mo Zhang
    Jinnian Wang
    Gang Zhang
    [J]. Multimedia Tools and Applications, 2016, 75 : 12077 - 12099
  • [6] ICRS: inter-layer compression method combined with generation of a spatial image pyramid
    Zhang, Yunzhou
    Zhang, Mo
    Wang, Jinnian
    Zhang, Gang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (19) : 12077 - 12099
  • [7] STGGAN: Spatial-temporal Graph Generation
    Zhang, Liming
    [J]. 27TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2019), 2019, : 608 - 609
  • [8] Spatial-Temporal Wireless Network Channels
    Chen, Yifan
    Mucchi, Lorenzo
    Wang, Rui
    [J]. 2013 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2013, : 2597 - 2602
  • [9] Fast Spatial-Temporal Transformer Network
    Escher, Rafael Molossi
    de Bem, Rodrigo Andrade
    Jorge Drews Jr, Paulo Lilles
    [J]. 2021 34TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2021), 2021, : 65 - 72
  • [10] Spatial-Temporal Interpolation of Reference Evapotranspiration for Pakistan
    Saeed Shah, Syed Muhammad
    El-Morshedy, M.
    Mansoor, Wahidullah
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022