Learned image compression via multiscale prior for machine recognition

被引:0
|
作者
Shi, Yuan [1 ]
Shen, Liquan [1 ,2 ]
Wang, Qiang [1 ]
机构
[1] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai, Peoples R China
[2] Key Lab Specialty Fiber Opt & Opt Access Networks, Joint Int Res Lab Specialty Fiber Opt & Adv Commun, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
multiscale priors; image compression; pixel fidelity-driven; machine recognition-driven; interaction refinement module; machine vision perceptual loss; semantic variation weight; FUSION;
D O I
10.1117/1.JEI.32.1.013003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The conventional image compression framework is pixel fidelity-driven, which can generate compressed images with considerable visual quality even at low bit rates. However, these methods emphasize the human visual experience and ignore the need for machine recognition-driven tasks. To this end, we propose an image compression framework that utilizes multiscale prior information extracted from the machine perceptual model to improve the machine recognition accuracy of compressed images. Specifically, the interaction refinement module (IRM) is designed to interact multiscale prior information with each other, adaptively retaining machine recognition-relevant features to enhance its expression on compact features. To further improve the accuracy of machine recognition, machine vision perceptual loss is designed on semantic variation weight, which is the weight of semantic variation degree of deep adjacent layers in multiscale priors. Machine vision perceptual loss is used to optimize the semantic distortion of compressed images for retaining important semantic information. Experimental results show that compared with compression methods including BPG, WebP, Mentzer, NIC, IUWD, and RCIS, the Top-1 recognition accuracy of the proposed method is improved by 10.9%, 19%, 11.6%, 12.9%, 6%, and 2.7% at a lower bit rate (0.2 bpp). In addition, the performance improvement on other machine recognition networks and machine vision tasks shows the versatility of the proposed method.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Joint Denoising/Compression of Image Contours via Shape Prior and Context Tree
    Zheng, Amin
    Cheung, Gene
    Florencio, Dinei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) : 3332 - 3344
  • [22] Hybrid-context-based multi-prior entropy modeling for learned lossless image compression
    Fu, Chuan
    Du, Bo
    Zhang, Liangpei
    PATTERN RECOGNITION, 2024, 155
  • [23] A Secure Learned Image Codec for Authenticity Verification via Self-Destructive Compression
    Huang, Chen-Hsiu
    Wu, Ja-Ling
    BIG DATA AND COGNITIVE COMPUTING, 2025, 9 (01)
  • [24] Multiscale image compression for satellite telecommunication systems
    Bagmanov, Valery H.
    Kharitonov, Svjatoslav V.
    Meshkov, Ivan K.
    Sultanov, Albert H.
    OPTICAL TECHNOLOGIES FOR TELECOMMUNICATIONS 2007, 2008, 7026
  • [25] Unveiling the Future of Human and Machine Coding: A Survey of End-to-End Learned Image Compression
    Huang, Chen-Hsiu
    Wu, Ja-Ling
    ENTROPY, 2024, 26 (05)
  • [26] Sparse representation with learned multiscale dictionary for image fusion
    Yin, Haitao
    NEUROCOMPUTING, 2015, 148 : 600 - 610
  • [27] HYPERSPECTRAL IMAGE REPRESENTATION USING LEARNED MULTISCALE DICTIONARIES
    Wu, Qian
    Zhang, Rong
    Xu, Dawei
    2014 6TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2014,
  • [28] Data compression for image recognition
    Chang, Y
    Kumar, D
    Mahalingam, N
    IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 399 - 402
  • [29] Learned Image Compression With Separate Hyperprior Decoders
    Zan, Zhao
    Liu, Chao
    Sun, Heming
    Zeng, Xiaoyang
    Fan, Yibo
    IEEE OPEN JOURNAL OF CIRCUITS AND SYSTEMS, 2021, 2 : 627 - 632
  • [30] Learned Image Compression with Frequency Domain Loss
    Lee, Soonbin
    Jeong, Jong-Beom
    Kim, Inae
    Ryu, Eun-Seok
    35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 1 - 4