Unveiling the Future of Human and Machine Coding: A Survey of End-to-End Learned Image Compression

被引:2
|
作者
Huang, Chen-Hsiu [1 ]
Wu, Ja-Ling [1 ]
机构
[1] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei 106, Taiwan
关键词
deep learning; image compression; video coding; neural compression; coding for machines; EFFICIENCY;
D O I
10.3390/e26050357
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
End-to-end learned image compression codecs have notably emerged in recent years. These codecs have demonstrated superiority over conventional methods, showcasing remarkable flexibility and adaptability across diverse data domains while supporting new distortion losses. Despite challenges such as computational complexity, learned image compression methods inherently align with learning-based data processing and analytic pipelines due to their well-suited internal representations. The concept of Video Coding for Machines has garnered significant attention from both academic researchers and industry practitioners. This concept reflects the growing need to integrate data compression with computer vision applications. In light of these developments, we present a comprehensive survey and review of lossy image compression methods. Additionally, we provide a concise overview of two prominent international standards, MPEG Video Coding for Machines and JPEG AI. These standards are designed to bridge the gap between data compression and computer vision, catering to practical industry use cases.
引用
收藏
页数:35
相关论文
共 50 条
  • [41] NN-based Embedment of Watermark in End-to-end Image Compression
    Lee, EunSeong
    Lee, Jongseok
    Seo, Young-Ho
    Sim, Donggyu
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
  • [42] New Results in End-to-end Image and Video Compression by Deep Learning
    Ozsoy, Gokberk
    Yilmaz, Melih
    Kirmemis, Ogun
    Tekalp, A. Murat
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [43] End-to-end optimized image compression with the frequency-oriented transform
    Zhang, Yuefeng
    Lin, Kai
    MACHINE VISION AND APPLICATIONS, 2024, 35 (02)
  • [44] End-to-End Facial Image Compression with Integrated Semantic Distortion Metric
    He, Tianyu
    Chen, Zhibo
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [45] Image Compression Based on Compressive Sensing: End-to-End Comparison With JPEG
    Yuan, Xin
    Haimi-Cohen, Raziel
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2889 - 2904
  • [46] End-to-End Multispectral Image Compression Using Convolutional Neural Network
    Kong Fanqiang
    Zhou Yongbo
    Shen Qiu
    Wen Keyao
    CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2019, 46 (10):
  • [47] End-to-End Learning-Based Image Compression With a Decoupled Framework
    Zhang, Zhaobin
    Esenlik, Semih
    Wu, Yaojun
    Wang, Meng
    Zhang, Kai
    Zhang, Li
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3067 - 3081
  • [48] CPIPS: Learning to Preserve Perceptual Distances in End-to-End Image Compression
    Huang, Chen-Hsiu
    Wu, Ja-Ling
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1705 - 1711
  • [49] TRANSFORM SKIP INSPIRED END-TO-END COMPRESSION FOR SCREEN CONTENT IMAGE
    Wang, Meng
    Zhang, Kai
    Zhang, Li
    Wu, Yaojun
    Li, Yue
    Li, Junru
    Wang, Shiqi
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3848 - 3852
  • [50] Reducing The Amortization Gap of Entropy Bottleneck In End-to-End Image Compression
    Balcilar, Muhammet
    Damodaran, Bharath
    Hellier, Pierre
    2022 PICTURE CODING SYMPOSIUM (PCS), 2022, : 115 - 119