A unified efficient deep image compression framework and its application on human-centric Task

被引:2
|
作者
Chen, Xueyuan [1 ]
Hu, Zhihao [1 ]
Lu, Guo [2 ]
Liu, Jiaheng [1 ]
机构
[1] Beihang Univ, Beijing, Peoples R China
[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
Image compression; Neural network; Auto-encoder; Gaussian mixture model;
D O I
10.1007/s11042-023-17696-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Image compression is a widely used technique to reduce the spatial redundancy in images. Recently, learning based image compression has achieved significant progress by using the powerful representation ability from neural networks. However, the current learning based image compression methods suffer from the huge computational cost, which limits their capacity for practical applications. In this paper, we propose a unified framework called Efficient Deep Image Compression (EDIC) based on three new technologies, including a channel attention module, a Gaussian mixture model and a decoder-side enhancement module. Specifically, we design an auto-encoder style network for learning based image compression. To improve the coding efficiency, we exploit the channel relationship between latent representations by using the channel attention module. Besides, the Gaussian mixture model is introduced for the entropy model and improves the accuracy for bitrate estimation. Furthermore, we introduce the decoder-side enhancement module to further improve image compression performance. Our EDIC method can also be readily incorporated with the Deep Video Compression (DVC) framework (Lu et al. 2019) to further improve the video compression performance. Simultaneously, our EDIC method boosts the coding performance significantly while bringing slightly increased computational cost. More importantly, experimental results demonstrate that the proposed approach outperforms the current image compression methods and is up to more than 150 times faster in terms of decoding speed when compared with Minnen's method (Minnen et al. 2018). Moreover, we also evaluate the performance of the human-centric task (i.e., face recognition) by using different coding strategies.
引用
收藏
页码:73407 / 73425
页数:19
相关论文
共 50 条
  • [41] UGC: Unified GAN Compression for Efficient Image-to-Image Translation
    Ren, Yuxi
    Wu, Jie
    Zhang, Peng
    Zhang, Manlin
    Xiao, Xuefeng
    He, Qian
    Wang, Rui
    Zheng, Min
    Pan, Xin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17235 - 17245
  • [42] A Novel Deep Progressive Image Compression Framework
    Cai, Chunlei
    Chen, Li
    Zhang, Xiaoyun
    Lu, Guo
    Gao, Zhiyong
    2019 PICTURE CODING SYMPOSIUM (PCS), 2019,
  • [43] A Unified Algebraic Framework for Fuzzy Image Compression and Mathematical Morphology
    Russo, Ciro
    FUZZY LOGIC AND APPLICATIONS, 2009, 5571 : 205 - 212
  • [44] Deep Image Interpolation: A Unified Unsupervised Framework for Pansharpening
    Gao, Jianhao
    Li, Jie
    Su, Xin
    Jiang, Menghui
    Yuan, Qiangqiang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 608 - 617
  • [45] A Unified Deep Learning Framework for ssTEM Image Restoration
    Deng, Shiyu
    Huang, Wei
    Chen, Chang
    Fu, Xueyang
    Xiong, Zhiwei
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (12) : 3734 - 3746
  • [46] AN EFFICIENT FRAMEWORK FOR LOSSLESS COLOR IMAGE COMPRESSION
    Luo, Zhikui
    Wan, Yi
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2016, : 380 - 384
  • [47] Development of a Human-Centric Knowledge Management Framework Through the Integration Between PLM and MES
    Marongiu, Giovanni
    Bruno, Giulia
    Lombardi, Franco
    PRODUCT LIFECYCLE MANAGEMENT: LEVERAGING DIGITAL TWINS, CIRCULAR ECONOMY, AND KNOWLEDGE MANAGEMENT FOR SUSTAINABLE INNOVATION, PT II, PLM 2023, 2024, 702 : 119 - 129
  • [48] A multidimensional human-centric framework for environmental intelligence: Air pollution and noise in smart cities
    Bardoutsos, Andreas
    Filios, Gabriel
    Katsidimas, Ioannis
    Krousarlis, Thomas
    Nikoletseas, Sotiris
    Tzamalis, Pantelis
    16TH ANNUAL INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (DCOSS 2020), 2020, : 155 - 164
  • [49] A human-centric framework for context-aware flowable services in cloud computing environments
    Zhu, Yishui
    Shtykh, Roman Y.
    Jin, Qun
    INFORMATION SCIENCES, 2014, 257 : 231 - 247
  • [50] You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-person Multi-task Human-Centric Perception
    Jin, Sheng
    Li, Shuhuai
    Li, Tong
    Liu, Wentao
    Qian, Chen
    Luo, Ping
    COMPUTER VISION-ECCV 2024, PT XVIII, 2025, 15076 : 126 - 146