Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer

被引：5

作者：

Zhang, Pingping ^{[1
]}

Wang, Shiqi ^{[1
,2
]}

Wang, Meng ^{[1
]}

Li, Jiguo ^{[3
]}

Wang, Xu ^{[4
,5
]}

Kwong, Sam ^{[1
,2
]}

机构：

[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China

[2] City Univ Hong Kong, Shenzhen Res Inst, Shenzhen 518057, Peoples R China

[3] Univ Chinese Acad Sci, Inst Comp Technol, Beijing 100049, Peoples R China

[4] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China

[5] Shenzhen Univ, Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518060, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2023年 / 33卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Semantic image compression; cross-modality; scalable coding;

D O I：

10.1109/TCSVT.2023.3241225

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This article proposes the scalable cross-modality compression (SCMC) paradigm, in which the image compression problem is further cast into a representation task by hierarchically sketching the image with different modalities. Herein, we adopt the conceptual organization philosophy to model the overwhelmingly complicated visual patterns, based upon the semantic, structure, and signal level representation accounting for different tasks. The SCMC paradigm that incorporates the representation at different granularities supports diverse application scenarios, such as high-level semantic communication and low-level image reconstruction. The decoder, which enables the recovery of the visual information, benefits from the scalable coding based upon the semantic, structure, and signal layers. Qualitative and quantitative results demonstrate that the SCMC can convey accurate semantic and perceptual information of images, especially at low bitrates, and promising rate-distortion performance has been achieved compared to state-of-the-art methods. The code will be available online https://github.com/ppingzhang/SCMC.

引用

页码：4441 / 4445

页数：5

共 50 条

[1] Cross-modality semantic guidance for multi-label image classification
Huang, Jun
Wang, Dian
Hong, Xudong
Qu, Xiwen
Xue, Wei
INTELLIGENT DATA ANALYSIS, 2024, 28 (03) : 633 - 646
[2] Representation Learning for Cross-Modality Classification
van Tulder, Gijs
de Bruijne, Marleen
MEDICAL COMPUTER VISION AND BAYESIAN AND GRAPHICAL MODELS FOR BIOMEDICAL IMAGING, 2017, 10081 : 126 - 136
[3] Semantic Consistent Unsupervised Domain Adaptation for Cross-Modality Medical Image Segmentation
Zeng, Guodong
Lerch, Till D.
Schmaranzer, Florian
Zheng, Guoyan
Burger, Juergen
Gerber, Kate
Tannast, Moritz
Siebenrock, Klaus
Gerber, Nicolas
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 201 - 210
[4] Boosting Cross-Modality Image Registration
Barbu, Adrian
Ionasec, Razvan
2009 JOINT URBAN REMOTE SENSING EVENT, VOLS 1-3, 2009, : 89 - +
[5] CROSS-MODALITY TRANSFER OF SPATIAL INFORMATION
FISHBEIN, HD
DECKER, J
WILCOX, P
BRITISH JOURNAL OF PSYCHOLOGY, 1977, 68 (NOV) : 503 - 508
[6] Anatomy-Regularized Representation Learning for Cross-Modality Medical Image Segmentation
Chen, Xu
Lian, Chunfeng
Wang, Li
Deng, Hannah
Kuang, Tianshu
Fung, Steve
Gateno, Jaime
Yap, Pew-Thian
Xia, James J.
Shen, Dinggang
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (01) : 274 - 285
[7] Cross-Modality Transfer Learning for Image-Text Information Management
Niu, Shuteng
Jiang, Yushan
Chen, Bowen
Wang, Jian
Liu, Yongxin
Song, Houbing
ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2022, 13 (01)
[8] Representation Learning Through Cross-Modality Supervision
Sankaran, Nishant
Mohan, Deen Dayal
Setlur, Srirangaraj
Govindaraju, Venugopal
Fedorishin, Dennis
2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, : 107 - 114
[9] Semantic Scalable Image Compression with Cross-Layer Priors
Tu, Hanyue
Li, Li
Zhou, Wengang
Li, Houqiang
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4044 - 4052
[10] MRI Cross-Modality Image-to-Image Translation
Yang, Qianye
Li, Nannan
Zhao, Zixu
Fan, Xingyu
Chang, Eric I-Chao
Xu, Yan
SCIENTIFIC REPORTS, 2020, 10 (01)

← 1 2 3 4 5 →