Ensemble Learning-Based Rate-Distortion Optimization for End-to-End Image Compression

被引:26
|
作者
Wang, Yefei [1 ]
Liu, Dong [1 ]
Ma, Siwei [2 ]
Wu, Feng [1 ]
Gao, Wen [2 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Peoples R China
[2] Peking Univ, Natl Engn Lab Video Technol, Beijing 100871, Peoples R China
关键词
Image coding; Adaptation models; Entropy coding; Transforms; Rate-distortion; Optimization; Decoding; Ensemble learning; image compression; rate-distortion optimization; REGRESSION;
D O I
10.1109/TCSVT.2020.3000331
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
End-to-end image compression using trained deep networks as encoding/decoding models has been developed substantially in the recent years. Previous work is limited in using a single encoding/decoding model, whereas we explore the usage of multiple encoding/decoding models as an ensemble. We propose several methods to obtain multiple models. First, we adopt the boosting strategy to train multiple networks with diversity as an ensemble. Second, we train an ensemble of multiple probability distribution models to reduce the distribution gap for efficient entropy coding. Third, we present a geometric transform-based self-ensemble method. The multiple models can be regarded as the multiple coding modes, similar to those in non-deep video coding schemes. We further adopt block-level model/mode selection at the encoder side to pursue rate-distortion optimization, where we use hierarchical block partitioning to improve the adaptation ability. Compared with single-model end-to-end compression, our proposed method improves the compression efficiency significantly, leading to 21% BD-rate reduction on the Kodak dataset, without increasing the decoding complexity. On the other hand, when keeping the same compression efficiency, our method can use much simplified decoding models, where the floating-point operations are reduced by 70%.
引用
收藏
页码:1193 / 1207
页数:15
相关论文
共 50 条
  • [1] Learning True Rate-Distortion-Optimization for End-To-End Image Compression
    Brand, Fabian
    Fischer, Kristian
    Kopte, Alexander
    Kaup, Andre
    [J]. DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 443 - 443
  • [2] End-to-End Learning-Based Image Compression: A Review
    Chen Jimin
    Lin Zehao
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
  • [3] End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression
    Yilmaz, M. Akin
    Tekalp, A. Murat
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1311 - 1315
  • [4] DEFENDING AGAINST NOISE BY CHARACTERIZING THE RATE-DISTORTION FUNCTIONS IN END-TO-END NOISY IMAGE COMPRESSION
    Li, Binzhe
    Wang, Shurun
    Wang, Shiqi
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3727 - 3731
  • [5] End-to-End Learning-Based Image Compression With a Decoupled Framework
    Zhang, Zhaobin
    Esenlik, Semih
    Wu, Yaojun
    Wang, Meng
    Zhang, Kai
    Zhang, Li
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3067 - 3081
  • [6] EICNet: An End-to-End Efficient Learning-Based Image Compression Network
    Cheng, Ziyi
    [J]. IEEE Access, 2024, 12 : 142668 - 142676
  • [7] End-to-end rate-distortion optimized motion estimation
    Wan, Shuai
    Izquierdo, Ebroul
    Yang, Fuzheng
    Chang, Yilin
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 809 - +
  • [8] End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression
    Yilmaz, M. Akin
    Tekalp, A. Murat
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 974 - 983
  • [9] Error-Resilient Multi-view Video Coding Based on End-to-End Rate-Distortion Optimization
    Gao Pan
    Peng Qiang
    Wang Qionghua
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (02) : 277 - 283
  • [10] Error-Resilient Multi-view Video Coding Based on End-to-End Rate-Distortion Optimization
    GAO Pan
    PENG Qiang
    WANG Qionghua
    [J]. Chinese Journal of Electronics, 2016, 25 (02) : 277 - 283