Object Segmentation-Assisted Inter Prediction for Versatile Video Coding

被引:0
|
作者
Li, Zhuoyuan [1 ]
Yuan, Zikun [2 ]
Li, Li [1 ]
Liu, Dong [1 ]
Tang, Xiaohu [2 ]
Wu, Feng [1 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & App, Hefei 230027, Peoples R China
[2] Southwest Jiaotong Univ, Informat Secur & Natl Comp Grid Lab, Chengdu 610031, Peoples R China
关键词
Motion segmentation; Encoding; Video coding; Accuracy; Standards; Shape; Vectors; Inter prediction; motion compensation; motion estimation; motion vector coding; object segmentation; partition estimation; video coding; VVC; MOTION COMPENSATION; IMAGE;
D O I
10.1109/TBC.2024.3434520
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In modern video coding standards, block-based inter prediction is widely adopted, which brings high compression efficiency. However, in natural videos, there are usually multiple moving objects of arbitrary shapes, resulting in complex motion fields that are difficult to represent compactly. This problem has been tackled by more flexible block partitioning methods in the Versatile Video Coding (VVC) standard, but the more flexible partitions require more overhead bits to signal and still cannot be made arbitrarily shaped. To address this limitation, we propose an object segmentation-assisted inter prediction method (SAIP), where objects in the reference frames are segmented by some advanced technologies. With a proper indication, the object segmentation mask is translated from the reference frame to the current frame as the arbitrary-shaped partition of different regions without any extra signal. Using the segmentation mask, motion compensation is separately performed for different regions, achieving higher prediction accuracy. The segmentation mask is further used to code the motion vectors of different regions more efficiently. Moreover, the segmentation mask is considered in the joint rate-distortion optimization for motion estimation and partition estimation to derive the motion vector of different regions and partition more accurately. The proposed method is implemented into the VVC reference software, VTM version 12.0. Experimental results show that the proposed method achieves up to 1.98%, 1.14%, 0.79%, and on average 0.82%, 0.49%, 0.37% BD-rate reduction for common test sequences, under the Low-delay P, Low-delay B, and Random Access configurations, respectively.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Attention-Based Inter-Prediction for Versatile Video Coding
    Tran, Quang Nhat
    Yang, Shih-Hsuan
    [J]. IEEE ACCESS, 2023, 11 : 84313 - 84322
  • [2] Advanced Geometric-based Inter Prediction for Versatile Video Coding
    Gao, Han
    Liao, Ru-Ling
    Reuze, Kevin
    Esenlik, Semih
    Alshina, Elena
    Ye, Yan
    Chen, Jie
    Luo, Jiancong
    Chen, Chun-Chi
    Huang, Han
    Chien, Wei-Jung
    Seregin, Vadim
    Karczewicz, Marta
    [J]. 2020 DATA COMPRESSION CONFERENCE (DCC 2020), 2020, : 93 - 102
  • [3] Object segmentation for video coding
    Chen, LH
    Chen, JR
    Liao, HY
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 383 - 386
  • [4] Low-Complexity Geometric Inter-Prediction for Versatile Video Coding
    Blaeser, Max
    Gao, Han
    Esenlik, Semih
    Alshina, Elena
    Zhao, Zhijie
    Rohlfing, Christian
    Steinbach, Eckehard
    [J]. 2019 PICTURE CODING SYMPOSIUM (PCS), 2019,
  • [5] Enhanced Combined Inter-Intra Prediction (CIIP) in Versatile Video Coding
    Do, JiHoon
    Cho, Beomhee
    Kim, Jae-Gon
    Jeong, Dae-Gwon
    [J]. INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2020, 2020, 11515
  • [6] Geometric Partitioning Mode with Inter and Intra Prediction for Beyond Versatile Video Coding
    Kidani, Yoshitaka
    Kato, Haruhisa
    Kawamura, Kei
    Watanabe, Hiroshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (10) : 1691 - 1703
  • [7] Segmentation-assisted classification for IKONOS imagery
    Onkar Dikshit
    Vinay Behl
    [J]. Journal of the Indian Society of Remote Sensing, 2009, 37 : 551 - 564
  • [8] Gated fusion network for SAO filter and inter frame prediction in Versatile Video Coding
    Kuanar, Shiba
    Athitsos, Vassilis
    Mahapatra, Dwarikanath
    Rao, K. R.
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 109
  • [9] Multi-Zone Division-Based Inter Prediction for Versatile Video Coding
    Yuan, Zikun
    Tang, Xiaohu
    [J]. 2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [10] Deep Reference Frame Interpolation based Inter Prediction Enhancement for Versatile Video Coding
    Jia, Jianghao
    Liu, Zizheng
    Xu, Xiaozhong
    Liu, Shan
    Chen, Zhenzhong
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,