End-to-end optimized image compression for machines, a study

被引:42
|
作者
Chamain, Lahiru D. [1 ,2 ]
Racape, Fabien [1 ]
Begaint, Jean [1 ]
Pushparaja, Akshay [1 ]
Feltman, Simon [1 ]
机构
[1] InterDigital AI Lab, 4410 El Camino Real, Los Altos, CA 94022 USA
[2] Univ Calif Davis, 1 Shields Ave, Davis, CA 95616 USA
关键词
D O I
10.1109/DCC50243.2021.00024
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
An increasing share of image and video content is analyzed by machines rather than viewed by humans, and therefore it becomes relevant to optimize codecs for such applications where the analysis is performed remotely. Unfortunately, conventional coding tools are challenging to specialize for machine tasks as they were originally designed for human perception. However, neural network based codecs can be jointly trained end-to-end with any convolutional neural network (CNN)-based task model. In this paper, we propose to study an end-to-end framework enabling efficient image compression for remote machine task analysis, using a chain composed of a compression module and a task algorithm that can be optimized end-to-end. We show that it is possible to significantly improve the task accuracy when fine-tuning jointly the codec and the task networks, especially at low bit-rates. Depending on training or deployment constraints, selective fine-tuning can be applied only on the encoder, decoder or task network and still achieve rate-accuracy improvements over an off-the-shelf codec and task network. Our results also demonstrate the flexibility of end-to-end pipelines for practical applications.
引用
收藏
页码:163 / 172
页数:10
相关论文
共 50 条
  • [21] End-to-end Optimized Video Compression with MV-Residual Prediction
    Wu, XiangJi
    Zhang, Ziwen
    Feng, Jie
    Zhou, Lei
    Wu, Junmin
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 611 - 614
  • [22] End-to-end image compression method based on perception metric
    Shuai Liu
    Yingcong Huang
    Huoxiang Yang
    Yongsheng Liang
    Wei Liu
    Signal, Image and Video Processing, 2022, 16 : 1803 - 1810
  • [23] End-to-end system consideration of the Galileo image compression system
    Cheung, K
    Tong, K
    Belongie, M
    IGARSS '96 - 1996 INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM: REMOTE SENSING FOR A SUSTAINABLE FUTURE, VOLS I - IV, 1996, : 1035 - 1038
  • [24] End-to-End Learning-Based Image Compression: A Review
    Chen Jimin
    Lin Zehao
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (22)
  • [25] End-to-End Learned Image Compression with Augmented Normalizing Flows
    Ho, Yung-Han
    Chan, Chih-Chun
    Peng, Wen-Hsiao
    Hang, Hsueh-Ming
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1931 - 1935
  • [26] An end-to-end spike-based image compression architecture
    Doutsi, Effrosyni
    Antonini, Marc
    Tsakalides, Panagiotis
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 818 - 820
  • [27] End-to-end image compression method based on perception metric
    Liu, Shuai
    Huang, Yingcong
    Yang, Huoxiang
    Liang, Yongsheng
    Liu, Wei
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (07) : 1803 - 1810
  • [28] Estimating the resize parameter in end-to-end learned image compression
    Chen, Li-Heng
    Bampis, Christos G.
    Li, Zhi
    Krasula, Lukas
    Bovik, Alan C.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2025, 135
  • [29] A Reference Resource Based End-to-End Image Compression Scheme
    Yin, Wenbin
    Fan, Xiaopeng
    Shi, Yunhui
    Zuo, Wangmeng
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 534 - 544
  • [30] End-to-End Image Patch Quality Assessment for Image/Video With Compression Artifacts
    Tung Thanh Pham
    Xiem Van Hoang
    Nghia Trung Nguyen
    Duong Trieu Dinh
    Le Thanh Ha
    IEEE ACCESS, 2020, 8 : 215157 - 215172