Predicting split decisions in MPEG-2 to HEVC video transcoding

被引:1
|
作者
Shanableh, Tamer [1 ]
Hassan, Mahitab [2 ]
机构
[1] Amer Univ Sharjah, Dept Comp Sci & Engn, Sharjah, U Arab Emirates
[2] IBM Cloud, Dubai, U Arab Emirates
来源
SN APPLIED SCIENCES | 2020年 / 2卷 / 06期
关键词
Video coding; Video transcoding; HEVC; Machine learning; H.264/AVC; INTER;
D O I
10.1007/s42452-020-2909-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper proposes learning-based approaches for transcoding videos compressed using the Moving Picture Experts Group 2 format into the High Efficiency Video Coding (HEVC) format. In the training mode of the transcoder, mappings between extracted features and split decisions are calculated. While in the transcoding mode, the split decisions of Coding Units of the HEVC video are predicted. Two formulations are proposed for the prediction of split decisions based on multi model and multi-tier solutions. In the former solution, multi models are generated based on the total number of split flags in a coding unit. While in the latter solution, split decisions are modelled at three different coding depths. The proposed solutions are evaluated in terms of excessive bitrate, drop in PSNR, classification accuracy, model generation time and transcoding speedup. It is shown that the multi-tier solution maintains the rate-distortion behaviour of full re-encoding at the expense of lower gain in transcoding speedup. In comparison to existing work, it is shown that the proposed solutions offer a significant enhancement in terms of rate-distortion performance and classification accuracy.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Predicting split decisions in MPEG-2 to HEVC video transcoding
    Tamer Shanableh
    Mahitab Hassan
    SN Applied Sciences, 2020, 2
  • [2] MPEG-2 to HEVC Video Transcoding With Content-Based Modeling
    Shanableh, Tamer
    Peixoto, Eduardo
    Izquierdo, Ebroul
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (07) : 1191 - 1196
  • [3] Transcoding of MPEG-2 video in the frequency domain
    Assuncao, PAA
    Ghanbari, M
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 2633 - 2636
  • [4] Efficient MPEG-2 to MPEG-4 video transcoding
    Liu, S
    Lu, LG
    Kuo, CCJ
    IMAGE AND VIDEO COMMUNICATIONS AND PROCESSING 2003, PTS 1 AND 2, 2003, 5022 : 186 - 195
  • [5] Improved Resizing MPEG-2 Video Transcoding Method
    Ryu, Sung Pil
    Kwak, Nae Joung
    Kwon, Dong Jin
    Ahn, Jae-Hyeong
    MULTIMEDIA, COMPUTER GRAPHICS AND BROADCASTING, PT I, 2011, 262 : 10 - +
  • [6] Efficient MPEG-2 to MPEG-4 compressed video transcoding
    Xie, R
    Liu, JL
    Wang, XG
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2002, PTS 1 AND 2, 2002, 4671 : 192 - 201
  • [7] Embedded SNR multilayer video transcoding with MPEG-2 compliancy
    Shanableh, T
    ELECTRONICS LETTERS, 2005, 41 (05) : 236 - 238
  • [8] Issues in H.264/MPEG-2 video transcoding
    Kalvi, H
    CCNC 2004: 1ST IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE, PROCEEDINGS: CONSUMER NETWORKING: CLOSING THE DIGITAL DIVIDE, 2004, : 657 - 659
  • [9] DCT-DOMAIN DOWNSCALING FOR TRANSCODING MPEG-2 VIDEO
    Dogan, S.
    Worrall, S. T.
    Sadka, A. H.
    Kondoz, A. M.
    COMPUTER VISION AND GRAPHICS (ICCVG 2004), 2006, 32 : 246 - 251
  • [10] Robust data hiding in MPEG-2 video against transcoding
    Wang, YL
    Pearmain, A
    PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 695 - 700