Linear transform for simultaneous diagonalization of covariance and perceptual metric matrix in image coding

被引:25
|
作者
Epifanio, I
Gutiérrez, J
Malo, J
机构
[1] Univ Jaume 1, Dept Matemat, Castello 12071, Spain
[2] Univ Valencia, Dept Informat, E-46100 Burjassot, Spain
[3] Univ Valencia, Dept Opt, E-46100 Burjassot, Spain
关键词
image compression; transform coding; statistical redundancy; psychovisual redundancy; perceptual metric;
D O I
10.1016/S0031-3203(02)00325-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Two types of redundancies are contained in images: statistical redundancy and psychovisual redundancy. Image representation techniques for image coding should remove both redundancies in order to obtain good results. In order to establish an appropriate representation, the standard approach to transform coding only considers the statistical redundancy, whereas the psychovisual factors are introduced after the selection of the representation as a simple scalar weighting in the transform domain. In this work, we take into account the psychovisual factors in the definition of the representation together with the statistical factors, by means of the perceptual metric and the covariance matrix, respectively. In general the ellipsoids described by these matrices are not aligned. Therefore, the optimal basis for image representation should simultaneously diagonalize both matrices. This approach to the basis selection problem has several advantages in the particular application of image coding. As the transform domain is Euclidean (by definition), the quantizer design is highly simplified and at the same time, the use of scalar quantizers is truly justified. The proposed representation is compared to covariance-based representations such as the DCT and the KLT or PCA using standard JPEG-like and Max-Lloyd quantizers. (C) 2003 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1799 / 1811
页数:13
相关论文
共 21 条
  • [1] APPLICATION OF IMAGE COVARIANCE-MODELS TO TRANSFORM CODING
    CLARKE, RJ
    [J]. INTERNATIONAL JOURNAL OF ELECTRONICS, 1984, 56 (02) : 245 - 260
  • [2] A PERCEPTUAL METRIC FOR BLIND MEASUREMENT OF BLOCKING ARTIFACTS WITH APPLICATIONS IN TRANSFORM-BLOCK-BASED IMAGE AND VIDEO CODING
    Minoo, Koohyar
    Nguyen, Truong Q.
    [J]. 2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 3152 - 3155
  • [3] Compressive Sensing Image Coding with Perceptual Weighting Measuring Matrix
    Song, Yundong
    Wang, Yongfang
    Shang, Xiwu
    Zhang, Zhaoyang
    [J]. ADVANCES ON DIGITAL TELEVISION AND WIRELESS MULTIMEDIA COMMUNICATIONS, 2012, 331 : 264 - 270
  • [4] JPEG-BASED PERCEPTUAL IMAGE CODING WITH BLOCK-BASED IMAGE QUALITY METRIC
    Jin, Lina
    Egiazarian, Karen
    Kuo, C. -C. Jay
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1053 - 1056
  • [5] Perceptual Image Coding Based on Visibility Threshold Model in Wavelet Transform Domain
    韦志辉
    富煜清
    衡伟
    程时昕
    [J]. Journal of Southeast University(English Edition), 1998, (02) : 19 - 22
  • [6] Application of non-linear transform coding to image processing
    Hocke, Jens
    Barth, Erhardt
    Martinetz, Thomas
    [J]. HUMAN VISION AND ELECTRONIC IMAGING XVII, 2012, 8291
  • [7] PERCEPTUAL NOISE SHAPING IN DUAL-TREE COMPLEX WAVELET TRANSFORM FOR IMAGE CODING
    Zhu, Junwu
    Dansereau, Richard M.
    Joslin, Chris
    [J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 237 - 240
  • [8] Exploration of linear discriminant analysis for transform coding in distributed image classification
    Xie, H
    Ortega, A
    [J]. CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 1624 - 1629
  • [9] Image Coding Using Periodic Walsh Piecewise-Linear Transform
    Belgassem, Fituri H.
    [J]. 2009 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS 2009), 2009, : 227 - 230
  • [10] A perceptual color image quality metric using adequate error pooling for coding scheme evaluation
    Le Callet, P
    Barba, D
    [J]. HUMAN VISION AND ELECTRONIC IMAGING VII, 2002, 4662 : 173 - 180