Learned image compression via multiscale prior for machine recognition

被引:0
|
作者
Shi, Yuan [1 ]
Shen, Liquan [1 ,2 ]
Wang, Qiang [1 ]
机构
[1] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai, Peoples R China
[2] Key Lab Specialty Fiber Opt & Opt Access Networks, Joint Int Res Lab Specialty Fiber Opt & Adv Commun, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
multiscale priors; image compression; pixel fidelity-driven; machine recognition-driven; interaction refinement module; machine vision perceptual loss; semantic variation weight; FUSION;
D O I
10.1117/1.JEI.32.1.013003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The conventional image compression framework is pixel fidelity-driven, which can generate compressed images with considerable visual quality even at low bit rates. However, these methods emphasize the human visual experience and ignore the need for machine recognition-driven tasks. To this end, we propose an image compression framework that utilizes multiscale prior information extracted from the machine perceptual model to improve the machine recognition accuracy of compressed images. Specifically, the interaction refinement module (IRM) is designed to interact multiscale prior information with each other, adaptively retaining machine recognition-relevant features to enhance its expression on compact features. To further improve the accuracy of machine recognition, machine vision perceptual loss is designed on semantic variation weight, which is the weight of semantic variation degree of deep adjacent layers in multiscale priors. Machine vision perceptual loss is used to optimize the semantic distortion of compressed images for retaining important semantic information. Experimental results show that compared with compression methods including BPG, WebP, Mentzer, NIC, IUWD, and RCIS, the Top-1 recognition accuracy of the proposed method is improved by 10.9%, 19%, 11.6%, 12.9%, 6%, and 2.7% at a lower bit rate (0.2 bpp). In addition, the performance improvement on other machine recognition networks and machine vision tasks shows the versatility of the proposed method.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Synthetic Face Discrimination via Learned Image Compression
    Iliopoulou, Sofia
    Tsinganos, Panagiotis
    Ampeliotis, Dimitris
    Skodras, Athanassios
    ALGORITHMS, 2024, 17 (09)
  • [2] IMAGE COMPRESSION VIA MULTIPLE LEARNED GEOMETRIC DICTIONARIES
    Huang, Danlan
    Tao, Xiaoming
    Xu, Mai
    Gao, Shenghua
    Lu, Jianhua
    2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 1373 - 1377
  • [3] Coarse-to-Fine Hyper-Prior Modeling for Learned Image Compression
    Hu, Yueyu
    Yang, Wenhan
    Liu, Jiaying
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11013 - 11020
  • [4] Efficient Decoder for Learned Image Compression via Structured Pruning
    Liao, Liewen
    Li, Shaohui
    Luo, Jixiang
    Dai, Wenrui
    Li, Chenglin
    Zou, Junni
    Xiong, Hongkai
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 464 - 464
  • [5] Prior-Guided Contrastive Image Compression for Underwater Machine Vision
    Fang, Zhengkai
    Shen, Liquan
    Li, Mengyao
    Wang, Zhengyong
    Jin, Yanliang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (06) : 2950 - 2961
  • [6] Multiscale bandelet image compression
    Yang, Shuyuan
    Liu, Fan
    Wang, Min
    Jiao, Licheng
    2007 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, VOLS 1 AND 2, 2007, : 423 - +
  • [7] Scalable Face Image Coding via StyleGAN Prior: Toward Compression for Human-Machine Collaborative Vision
    Mao, Qi
    Wang, Chongyu
    Wang, Meng
    Wang, Shiqi
    Chen, Ruijie
    Jin, Libiao
    Ma, Siwei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 408 - 422
  • [8] Entropy Modeling via Gaussian Process Regression for Learned Image Compression
    Cao, Maida
    Dai, Wenrui
    Li, Shaohui
    Li, Chenglin
    Zou, Junni
    Chen, Ying
    Xiong, Hongkai
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 163 - 172
  • [9] Underwater Image Segmentation via Dark Channel Prior and Multiscale Hierarchical Decomposition
    Zheng, Haiyong
    Sun, Xue
    Zheng, Bing
    Nian, Rui
    Wang, Yangfan
    OCEANS 2015 - GENOVA, 2015,
  • [10] Curvelet based image compression via core vector machine
    Li, Yuancheng
    Wang, Yiliang
    Xiao, Rui
    Yang, Qiu
    OPTIK, 2013, 124 (21): : 4859 - 4866