iWave: CNN-Based Wavelet-Like Transform for Image Compression

被引:53
|
作者
Ma, Haichuan [1 ]
Liu, Dong [1 ]
Xiong, Ruiqin [2 ]
Wu, Feng [1 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Peoples R China
[2] Peking Univ, Sch Elect Engn & Comp Sci, Inst Digital Media, Beijing 100871, Peoples R China
关键词
Wavelet transforms; Image coding; Wavelet analysis; Task analysis; Kernel; Quantization (signal); Convolutional neural network (CNN); image compression; lifting scheme; wavelet transform; LIFTING SCHEME; CONSTRUCTION;
D O I
10.1109/TMM.2019.2957990
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Wavelet transform is a powerful tool for multiresolution time-frequency analysis. It has been widely adopted in many image processing tasks, such as denoising, enhancement, fusion, and especially compression. Wavelets lead to the successful image coding standard JPEG-2000. Traditionally, wavelets were designed from the signal processing theory with certain assumption on the signal, but natural images are not as ideal as assumed by the theory. How to design content-adaptive wavelets for natural images remains a difficulty. Inspired by the recent progress of convolutional neural network (CNN), we propose iWave as a framework for deriving wavelet-like transform that is more suitable for natural image compression. iWave adopts an update-first lifting scheme, where the prediction filter is a trained CNN, to achieve wavelet-like transform. The CNN can be embedded into a deep network that is analogous to an auto-encoder, which is trained end-to-end. The trained wavelet-like transform still possesses the lifting structure, which ensures perfect reconstruction, supports multiresolution analysis, and is more interpretable than the deep networks trained as "black boxes." We perform experiments to verify the generality as well as the speciality of iWave in comparison with JPEG-2000. When trained with a generic set of natural images and tested on the Kodak dataset, iWave achieves on average 4.4% and up to 14% BD-rate reductions. When trained and tested with a specific kind of textures, iWave provides as high as 27% BD-rate reduction.
引用
收藏
页码:1667 / 1679
页数:13
相关论文
共 50 条
  • [1] CNN-Based DCT-Like Transform for Image Compression
    Liu, Dong
    Ma, Haichuan
    Xiong, Zhiwei
    Wu, Feng
    [J]. MULTIMEDIA MODELING, MMM 2018, PT II, 2018, 10705 : 61 - 72
  • [2] End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform
    Ma, Haichuan
    Liu, Dong
    Yan, Ning
    Li, Houqiang
    Wu, Feng
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1247 - 1263
  • [3] Improved Transform Structures For Learned Wavelet-like Fully Scalable Image Compression
    Li, Xinyue
    Naman, Aous
    Taubman, David
    [J]. 2023 IEEE 25TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, MMSP, 2023,
  • [4] Wavelet-like Transform with Subbands Fusion in Decoupled Structure for Deep Image Compression
    Ma, Ke
    Wu, Yaojun
    Zhang, Zhaobin
    Esenlik, Semih
    Sun, Xiaoyan
    Zhang, Kai
    Zhang, Li
    [J]. 2024 PICTURE CODING SYMPOSIUM, PCS 2024, 2024,
  • [5] Still Image Coding Using a Wavelet-Like Transform
    Maalouf, Aldo
    Larabi, Mohamed-Chaker
    Fernandez-Maloigne, Christine
    [J]. 2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 1893 - 1896
  • [6] A CONTENT-ADAPTIVE WAVELET-LIKE TRANSFORM FOR ALIASING SUPPRESSION IN IMAGE AND VIDEO COMPRESSION
    Gan, Jonathan
    Taubman, David
    [J]. 2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3821 - 3824
  • [7] aiWave: Volumetric Image Compression With 3-D Trained Affine Wavelet-Like Transform
    Xue, Dongmei
    Ma, Haichuan
    Li, Li
    Liu, Dong
    Xiong, Zhiwei
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (03) : 606 - 618
  • [8] Analog wavelet-like transform based on stimulated Brillouin scattering
    Zuo, Pengcheng
    Ma, Dong
    Chen, Yang
    [J]. OPTICS LETTERS, 2023, 48 (01) : 29 - 32
  • [9] CNN image compression and reconstruction based on non-orthogonal wavelet transform
    Mori, I
    Matsuyama, M
    Tanji, Y
    Tanaka, M
    [J]. PROCEEDINGS OF THE 2000 6TH IEEE INTERNATIONAL WORKSHOP ON CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS (CNNA 2000), 2000, : 83 - 86
  • [10] Robust Reversible Watermarking Scheme Based on Wavelet-Like Transform
    Mohammed, Rasha Thabit
    Khoo, Bee Ee
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (IEEE ICSIPA 2013), 2013, : 354 - 359