Learned image compression via multiscale prior for machine recognition

被引:0
|
作者
Shi, Yuan [1 ]
Shen, Liquan [1 ,2 ]
Wang, Qiang [1 ]
机构
[1] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai, Peoples R China
[2] Key Lab Specialty Fiber Opt & Opt Access Networks, Joint Int Res Lab Specialty Fiber Opt & Adv Commun, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
multiscale priors; image compression; pixel fidelity-driven; machine recognition-driven; interaction refinement module; machine vision perceptual loss; semantic variation weight; FUSION;
D O I
10.1117/1.JEI.32.1.013003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The conventional image compression framework is pixel fidelity-driven, which can generate compressed images with considerable visual quality even at low bit rates. However, these methods emphasize the human visual experience and ignore the need for machine recognition-driven tasks. To this end, we propose an image compression framework that utilizes multiscale prior information extracted from the machine perceptual model to improve the machine recognition accuracy of compressed images. Specifically, the interaction refinement module (IRM) is designed to interact multiscale prior information with each other, adaptively retaining machine recognition-relevant features to enhance its expression on compact features. To further improve the accuracy of machine recognition, machine vision perceptual loss is designed on semantic variation weight, which is the weight of semantic variation degree of deep adjacent layers in multiscale priors. Machine vision perceptual loss is used to optimize the semantic distortion of compressed images for retaining important semantic information. Experimental results show that compared with compression methods including BPG, WebP, Mentzer, NIC, IUWD, and RCIS, the Top-1 recognition accuracy of the proposed method is improved by 10.9%, 19%, 11.6%, 12.9%, 6%, and 2.7% at a lower bit rate (0.2 bpp). In addition, the performance improvement on other machine recognition networks and machine vision tasks shows the versatility of the proposed method.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Optimizing Multiscale SSIM for Compression via MLDS
    Charrier, Christophe
    Knoblauch, Kenneth
    Maloney, Laurence T.
    Bovik, Alan C.
    Moorthy, Anush K.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (12) : 4682 - 4694
  • [42] Reconstruction-Free Image Compression for Machine Vision via Knowledge Transfer
    Tu, Hanyue
    Li, Li
    Zhou, Wengang
    Li, Houqiang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (10)
  • [43] PERCEPTUAL LEARNED IMAGE COMPRESSION VIA END-TO-END JND-BASED OPTIMIZATION
    Pakdaman, Farhad
    Nami, Sanaz
    Gabbouj, Moncef
    2024 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2024, : 1146 - 1151
  • [44] THE IMPACT OF JPEG COMPRESSION ON PRIOR IMAGE NOISE
    Gardella, Marina
    Nikoukhah, Tina
    Li, Yanhao
    Bammey, Quentin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2689 - 2693
  • [45] IMAGE COMPRESSION AND RESTORATION INCORPORATING PRIOR KNOWLEDGE
    HALL, TJ
    DARLING, AM
    FIDDY, MA
    OPTICS LETTERS, 1982, 7 (10) : 467 - 468
  • [46] Multiscale color and texture invariants for image recognition
    Wanderley, JFC
    Fisher, MH
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2001, : 862 - 865
  • [47] Image compression algorithms based on multidirectional and multiscale transform
    Lian, Qiu-Sheng
    Chen, Shu-Zhen
    Kong, Ling-Fu
    Guangxue Jishu/Optical Technique, 2006, 32 (01): : 67 - 70
  • [48] Image Compression via Wavelets and Row Compression
    Hudachek-Buswell, Mary
    Stewart, Michael
    Belkasim, Saeid
    CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 1547 - 1550
  • [49] Face Recognition using Support Vector Machine and Multiscale Directional Image Representation Methods: A comparative study
    da Costa, Daniel M. M.
    Peres, Sarajane M.
    Lima, Clodoaldo A. M.
    Mustaro, Pollyana
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [50] IMAGE-DEPENDENT LOCAL ENTROPY MODELS FOR LEARNED IMAGE COMPRESSION
    Minnen, David
    Toderici, George
    Singh, Saurabh
    Hwang, Sung Jin
    Covell, Michele
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 430 - 434