Learned image compression via multiscale prior for machine recognition

被引:0
|
作者
Shi, Yuan [1 ]
Shen, Liquan [1 ,2 ]
Wang, Qiang [1 ]
机构
[1] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai, Peoples R China
[2] Key Lab Specialty Fiber Opt & Opt Access Networks, Joint Int Res Lab Specialty Fiber Opt & Adv Commun, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
multiscale priors; image compression; pixel fidelity-driven; machine recognition-driven; interaction refinement module; machine vision perceptual loss; semantic variation weight; FUSION;
D O I
10.1117/1.JEI.32.1.013003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The conventional image compression framework is pixel fidelity-driven, which can generate compressed images with considerable visual quality even at low bit rates. However, these methods emphasize the human visual experience and ignore the need for machine recognition-driven tasks. To this end, we propose an image compression framework that utilizes multiscale prior information extracted from the machine perceptual model to improve the machine recognition accuracy of compressed images. Specifically, the interaction refinement module (IRM) is designed to interact multiscale prior information with each other, adaptively retaining machine recognition-relevant features to enhance its expression on compact features. To further improve the accuracy of machine recognition, machine vision perceptual loss is designed on semantic variation weight, which is the weight of semantic variation degree of deep adjacent layers in multiscale priors. Machine vision perceptual loss is used to optimize the semantic distortion of compressed images for retaining important semantic information. Experimental results show that compared with compression methods including BPG, WebP, Mentzer, NIC, IUWD, and RCIS, the Top-1 recognition accuracy of the proposed method is improved by 10.9%, 19%, 11.6%, 12.9%, 6%, and 2.7% at a lower bit rate (0.2 bpp). In addition, the performance improvement on other machine recognition networks and machine vision tasks shows the versatility of the proposed method.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] STRUCTURED PRUNING AND QUANTIZATION FOR LEARNED IMAGE COMPRESSION
    Hossain, Md Adnan Faisal
    Zhu, Fengqing
    2024 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2024, : 3730 - 3736
  • [32] Content-Oriented Learned Image Compression
    Li, Meng
    Gao, Shangyin
    Feng, Yihui
    Shi, Yibo
    Wang, Jing
    COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 : 632 - 647
  • [33] Three Gaps for Quantisation in Learned Image Compression
    Pan, Shi
    Finlay, Chris
    Besenbruch, Chri
    Knottenbelt, William
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 720 - 726
  • [34] Generalized Gaussian Model for Learned Image Compression
    Zhang, Haotian
    Li, Li
    Liu, Dong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1950 - 1965
  • [35] Learned Compression of High Dimensional Image Datasets
    Cole, Elizabeth
    Meng, Qingxi
    Pauly, John
    Vasanawala, Shreyas
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 1747 - 1751
  • [36] Causal Contextual Prediction for Learned Image Compression
    Guo, Zongyu
    Zhang, Zhizheng
    Feng, Runsen
    Chen, Zhibo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2329 - 2341
  • [37] Identity Preserving Loss for Learned Image Compression
    Xiao, Jiuhong
    Aggarwal, Lavisha
    Banerjee, Prithviraj
    Aggarwal, Manoj
    Medioni, Gerard
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 516 - 525
  • [38] A Differentiable Entropy Model for Learned Image Compression
    Presta, Alberto
    Fiandrotti, Attilio
    Tartaglione, Enzo
    Grangetto, Marco
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 : 328 - 339
  • [39] Padding-Aware Learned Image Compression
    Zhang, Haotian
    Liao, Junqi
    Jiang, Yiheng
    Li, Li
    Liu, Dong
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [40] Enhanced Invertible Encoding for Learned Image Compression
    Xie, Yueqi
    Cheng, Ka Leong
    Chen, Qifeng
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 162 - 170