A unified efficient deep image compression framework and its application on human-centric Task

被引：2

作者：

Chen, Xueyuan ^{[1
]}

Hu, Zhihao ^{[1
]}

Lu, Guo ^{[2
]}

Liu, Jiaheng ^{[1
]}

机构：

[1] Beihang Univ, Beijing, Peoples R China

[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 83卷 / 29期

关键词：

Image compression; Neural network; Auto-encoder; Gaussian mixture model;

D O I：

10.1007/s11042-023-17696-6

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Image compression is a widely used technique to reduce the spatial redundancy in images. Recently, learning based image compression has achieved significant progress by using the powerful representation ability from neural networks. However, the current learning based image compression methods suffer from the huge computational cost, which limits their capacity for practical applications. In this paper, we propose a unified framework called Efficient Deep Image Compression (EDIC) based on three new technologies, including a channel attention module, a Gaussian mixture model and a decoder-side enhancement module. Specifically, we design an auto-encoder style network for learning based image compression. To improve the coding efficiency, we exploit the channel relationship between latent representations by using the channel attention module. Besides, the Gaussian mixture model is introduced for the entropy model and improves the accuracy for bitrate estimation. Furthermore, we introduce the decoder-side enhancement module to further improve image compression performance. Our EDIC method can also be readily incorporated with the Deep Video Compression (DVC) framework (Lu et al. 2019) to further improve the video compression performance. Simultaneously, our EDIC method boosts the coding performance significantly while bringing slightly increased computational cost. More importantly, experimental results demonstrate that the proposed approach outperforms the current image compression methods and is up to more than 150 times faster in terms of decoding speed when compared with Minnen's method (Minnen et al. 2018). Moreover, we also evaluate the performance of the human-centric task (i.e., face recognition) by using different coding strategies.

引用

页码：73407 / 73425

页数：19

共 50 条

[41] UGC: Unified GAN Compression for Efficient Image-to-Image Translation
Ren, Yuxi
Wu, Jie
Zhang, Peng
Zhang, Manlin
Xiao, Xuefeng
He, Qian
Wang, Rui
Zheng, Min
Pan, Xin
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17235 - 17245
[42] A Novel Deep Progressive Image Compression Framework
Cai, Chunlei
Chen, Li
Zhang, Xiaoyun
Lu, Guo
Gao, Zhiyong
2019 PICTURE CODING SYMPOSIUM (PCS), 2019,
[43] A Unified Algebraic Framework for Fuzzy Image Compression and Mathematical Morphology
Russo, Ciro
FUZZY LOGIC AND APPLICATIONS, 2009, 5571 : 205 - 212
[44] Deep Image Interpolation: A Unified Unsupervised Framework for Pansharpening
Gao, Jianhao
Li, Jie
Su, Xin
Jiang, Menghui
Yuan, Qiangqiang
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 608 - 617
[45] A Unified Deep Learning Framework for ssTEM Image Restoration
Deng, Shiyu
Huang, Wei
Chen, Chang
Fu, Xueyang
Xiong, Zhiwei
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (12) : 3734 - 3746
[46] AN EFFICIENT FRAMEWORK FOR LOSSLESS COLOR IMAGE COMPRESSION
Luo, Zhikui
Wan, Yi
PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2016, : 380 - 384
[47] Development of a Human-Centric Knowledge Management Framework Through the Integration Between PLM and MES
Marongiu, Giovanni
Bruno, Giulia
Lombardi, Franco
PRODUCT LIFECYCLE MANAGEMENT: LEVERAGING DIGITAL TWINS, CIRCULAR ECONOMY, AND KNOWLEDGE MANAGEMENT FOR SUSTAINABLE INNOVATION, PT II, PLM 2023, 2024, 702 : 119 - 129
[48] A multidimensional human-centric framework for environmental intelligence: Air pollution and noise in smart cities
Bardoutsos, Andreas
Filios, Gabriel
Katsidimas, Ioannis
Krousarlis, Thomas
Nikoletseas, Sotiris
Tzamalis, Pantelis
16TH ANNUAL INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (DCOSS 2020), 2020, : 155 - 164
[49] A human-centric framework for context-aware flowable services in cloud computing environments
Zhu, Yishui
Shtykh, Roman Y.
Jin, Qun
INFORMATION SCIENCES, 2014, 257 : 231 - 247
[50] You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-person Multi-task Human-Centric Perception
Jin, Sheng
Li, Shuhuai
Li, Tong
Liu, Wentao
Qian, Chen
Luo, Ping
COMPUTER VISION-ECCV 2024, PT XVIII, 2025, 15076 : 126 - 146

← 1 2 3 4 5 →