Deterministic Annealing Based Transform Domain Temporal Predictor Design for Adaptive Video Coding

被引:2
|
作者
Vishwanath, Bharath [1 ]
Nanjundaswamy, Tejaswi [1 ]
Rose, Kenneth [1 ]
机构
[1] Univ Calif Santa Barbara, Dept Elect Engn, Santa Barbara, CA 93106 USA
关键词
FILTER; MOTION; OPTIMIZATION;
D O I
10.1109/DCC.2019.00027
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Current video coders employ motion compensated pixel-to-pixel prediction, which largely ignores significant spatial correlations and the fact that true temporal correlations vary with spatial frequency. Earlier work from our lab proposed to first spatially decorrelate the block of pixels by performing temporal prediction in the transform domain, and to effectively account for both spatial and temporal correlations. To adapt to variations in video signal statistics, the encoder switches between a set of appropriately designed prediction modes. This setting critically depends on efficient offline learning of transform domain temporal prediction modes. Significant challenges include: i) issues of instability and mismatched statistics inherent to closed loop design; and ii) severe non-convexity of the cost function trapping the system in poor local minima. Statistics mismatch is tackled by an appropriate paradigm for system design in a stable open loop fashion, but which asymptotically mimics closed loop operation. The non-convexity is handled by deterministic annealing, a powerful non-convex optimization tool whose probabilistic formulation allows for direct optimization of the cost function with respect to the discrete set of prediction modes, and whose annealing schedule avoids poor local minima. Experimental results validate the method's efficacy.
引用
收藏
页码:192 / 200
页数:9
相关论文
共 50 条
  • [1] SPHERICAL VIDEO CODING WITH GEOMETRY AND REGION ADAPTIVE TRANSFORM DOMAIN TEMPORAL PREDICTION
    Vishwanath, Bharath
    Rose, Kenneth
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2043 - 2047
  • [2] Transform-domain Temporal Prediction in Video Coding with Spatially Adaptive Spectral Correlations
    Han, Jingning
    Melkote, Vinay
    Rose, Kenneth
    [J]. 2011 IEEE 13TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2011,
  • [3] A DETERMINISTIC ANNEALING APPROACH TO SWITCHED PREDICTOR DESIGN FOR ADAPTIVE COMPRESSION SYSTEMS
    Vishwanath, Bharath
    Nanjundaswamy, Tejaswi
    Rose, Kenneth
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7150 - 7154
  • [4] TRANSFORM DOMAIN SLICE BASED DISTRIBUTED VIDEO CODING
    Elamin, A.
    Jeoti, Varun
    Belhouari, Samir
    [J]. JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2011, 6 (05) : 542 - 550
  • [5] Transform-domain distributed video coding with rate-distortion-based adaptive quantisation
    Chien, W. -J.
    Karam, L. J.
    [J]. IET IMAGE PROCESSING, 2009, 3 (06) : 340 - 354
  • [6] ADAPTIVE TRANSFORM CODING OF VIDEO SIGNALS
    NGAN, KN
    [J]. IEE PROCEEDINGS-F RADAR AND SIGNAL PROCESSING, 1982, 129 (01) : 28 - 40
  • [7] Transform Domain Temporal Prediction and Geodesic Motion Compensation in Spherical Video Coding
    Sivakumar, Kruthika Koratti
    Vishwanath, Bharath
    Rose, Kenneth
    [J]. 2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 1127 - 1131
  • [8] A locally temporal adaptive transform scheme for sub-band video coding
    Escoda, OD
    Vandergheynst, P
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING SIGNAL, PROCESSING EDUCATION, 2003, : 645 - 648
  • [9] Adaptive lapped transform-based image and video coding
    Klausutis, TJ
    Madisetti, VK
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING '97, PTS 1-2, 1997, 3024 : 117 - 128
  • [10] Adaptive deblocking filter for transform domain Wyner-Ziv video coding
    Martins, R.
    Brites, C.
    Ascenso, J.
    Pereira, F.
    [J]. IET IMAGE PROCESSING, 2009, 3 (06) : 315 - 328