End-to-end optimized image compression for machines, a study

被引:42
|
作者
Chamain, Lahiru D. [1 ,2 ]
Racape, Fabien [1 ]
Begaint, Jean [1 ]
Pushparaja, Akshay [1 ]
Feltman, Simon [1 ]
机构
[1] InterDigital AI Lab, 4410 El Camino Real, Los Altos, CA 94022 USA
[2] Univ Calif Davis, 1 Shields Ave, Davis, CA 95616 USA
关键词
D O I
10.1109/DCC50243.2021.00024
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
An increasing share of image and video content is analyzed by machines rather than viewed by humans, and therefore it becomes relevant to optimize codecs for such applications where the analysis is performed remotely. Unfortunately, conventional coding tools are challenging to specialize for machine tasks as they were originally designed for human perception. However, neural network based codecs can be jointly trained end-to-end with any convolutional neural network (CNN)-based task model. In this paper, we propose to study an end-to-end framework enabling efficient image compression for remote machine task analysis, using a chain composed of a compression module and a task algorithm that can be optimized end-to-end. We show that it is possible to significantly improve the task accuracy when fine-tuning jointly the codec and the task networks, especially at low bit-rates. Depending on training or deployment constraints, selective fine-tuning can be applied only on the encoder, decoder or task network and still achieve rate-accuracy improvements over an off-the-shelf codec and task network. Our results also demonstrate the flexibility of end-to-end pipelines for practical applications.
引用
收藏
页码:163 / 172
页数:10
相关论文
共 50 条
  • [31] Saliency Map-Guided End-to-End Image Coding for Machines
    Peng, Bo
    Lin, Tianxiang
    Jin, Dengchao
    Pan, Zhaoqing
    Lei, Jianjun
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1755 - 1759
  • [32] Compression of End-to-End Models
    Pang, Ruoming
    Sainath, Tara N.
    Prabhavalkar, Rohit
    Gupta, Suyog
    Wu, Yonghui
    Zhang, Shuyuan
    Chiu, Chung-cheng
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 27 - 31
  • [33] NN-based Embedment of Watermark in End-to-end Image Compression
    Lee, EunSeong
    Lee, Jongseok
    Seo, Young-Ho
    Sim, Donggyu
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY, IWAIT 2023, 2023, 12592
  • [34] TRELLIS-CODED QUANTIZATION FOR END-TO-END LEARNED IMAGE COMPRESSION
    Suhring, Karsten
    Schafer, Michael
    Pfaff, Jonathan
    Schwarz, Heiko
    Marpe, Detlev
    Wiegand, Thomas
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3306 - 3310
  • [35] END-TO-END LEARNED IMAGE COMPRESSION WITH FIXED POINT WEIGHT QUANTIZATION
    Sun, Heming
    Cheng, Zhengxue
    Takeuchi, Masaru
    Katto, Jiro
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3359 - 3363
  • [36] New Results in End-to-end Image and Video Compression by Deep Learning
    Ozsoy, Gokberk
    Yilmaz, Melih
    Kirmemis, Ogun
    Tekalp, A. Murat
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [37] End-to-End Facial Image Compression with Integrated Semantic Distortion Metric
    He, Tianyu
    Chen, Zhibo
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [38] Image Compression Based on Compressive Sensing: End-to-End Comparison With JPEG
    Yuan, Xin
    Haimi-Cohen, Raziel
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (11) : 2889 - 2904
  • [39] End-to-End Multispectral Image Compression Using Convolutional Neural Network
    Kong Fanqiang
    Zhou Yongbo
    Shen Qiu
    Wen Keyao
    CHINESE JOURNAL OF LASERS-ZHONGGUO JIGUANG, 2019, 46 (10):
  • [40] End-to-End Learning-Based Image Compression With a Decoupled Framework
    Zhang, Zhaobin
    Esenlik, Semih
    Wu, Yaojun
    Wang, Meng
    Zhang, Kai
    Zhang, Li
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3067 - 3081