Predicting split decisions in MPEG-2 to HEVC video transcoding

被引：1

作者：

Shanableh, Tamer ^{[1
]}

Hassan, Mahitab ^{[2
]}

机构：

[1] Amer Univ Sharjah, Dept Comp Sci & Engn, Sharjah, U Arab Emirates

[2] IBM Cloud, Dubai, U Arab Emirates

来源：

SN APPLIED SCIENCES | 2020年 / 2卷 / 06期

关键词：

Video coding; Video transcoding; HEVC; Machine learning; H.264/AVC; INTER;

D O I：

10.1007/s42452-020-2909-7

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

This paper proposes learning-based approaches for transcoding videos compressed using the Moving Picture Experts Group 2 format into the High Efficiency Video Coding (HEVC) format. In the training mode of the transcoder, mappings between extracted features and split decisions are calculated. While in the transcoding mode, the split decisions of Coding Units of the HEVC video are predicted. Two formulations are proposed for the prediction of split decisions based on multi model and multi-tier solutions. In the former solution, multi models are generated based on the total number of split flags in a coding unit. While in the latter solution, split decisions are modelled at three different coding depths. The proposed solutions are evaluated in terms of excessive bitrate, drop in PSNR, classification accuracy, model generation time and transcoding speedup. It is shown that the multi-tier solution maintains the rate-distortion behaviour of full re-encoding at the expense of lower gain in transcoding speedup. In comparison to existing work, it is shown that the proposed solutions offer a significant enhancement in terms of rate-distortion performance and classification accuracy.

引用

页数：14

共 50 条

[1] Predicting split decisions in MPEG-2 to HEVC video transcoding
Tamer Shanableh
Mahitab Hassan
SN Applied Sciences, 2020, 2
[2] MPEG-2 to HEVC Video Transcoding With Content-Based Modeling
Shanableh, Tamer
Peixoto, Eduardo
Izquierdo, Ebroul
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (07) : 1191 - 1196
[3] Transcoding of MPEG-2 video in the frequency domain
Assuncao, PAA
Ghanbari, M
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 2633 - 2636
[4] Efficient MPEG-2 to MPEG-4 video transcoding
Liu, S
Lu, LG
Kuo, CCJ
IMAGE AND VIDEO COMMUNICATIONS AND PROCESSING 2003, PTS 1 AND 2, 2003, 5022 : 186 - 195
[5] Improved Resizing MPEG-2 Video Transcoding Method
Ryu, Sung Pil
Kwak, Nae Joung
Kwon, Dong Jin
Ahn, Jae-Hyeong
MULTIMEDIA, COMPUTER GRAPHICS AND BROADCASTING, PT I, 2011, 262 : 10 - +
[6] Efficient MPEG-2 to MPEG-4 compressed video transcoding
Xie, R
Liu, JL
Wang, XG
VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2002, PTS 1 AND 2, 2002, 4671 : 192 - 201
[7] Embedded SNR multilayer video transcoding with MPEG-2 compliancy
Shanableh, T
ELECTRONICS LETTERS, 2005, 41 (05) : 236 - 238
[8] Issues in H.264/MPEG-2 video transcoding
Kalvi, H
CCNC 2004: 1ST IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE, PROCEEDINGS: CONSUMER NETWORKING: CLOSING THE DIGITAL DIVIDE, 2004, : 657 - 659
[9] DCT-DOMAIN DOWNSCALING FOR TRANSCODING MPEG-2 VIDEO
Dogan, S.
Worrall, S. T.
Sadka, A. H.
Kondoz, A. M.
COMPUTER VISION AND GRAPHICS (ICCVG 2004), 2006, 32 : 246 - 251
[10] Robust data hiding in MPEG-2 video against transcoding
Wang, YL
Pearmain, A
PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 695 - 700

← 1 2 3 4 5 →