Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer

被引:5
|
作者
Zhang, Pingping [1 ]
Wang, Shiqi [1 ,2 ]
Wang, Meng [1 ]
Li, Jiguo [3 ]
Wang, Xu [4 ,5 ]
Kwong, Sam [1 ,2 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Shenzhen Res Inst, Shenzhen 518057, Peoples R China
[3] Univ Chinese Acad Sci, Inst Comp Technol, Beijing 100049, Peoples R China
[4] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[5] Shenzhen Univ, Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic image compression; cross-modality; scalable coding;
D O I
10.1109/TCSVT.2023.3241225
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article proposes the scalable cross-modality compression (SCMC) paradigm, in which the image compression problem is further cast into a representation task by hierarchically sketching the image with different modalities. Herein, we adopt the conceptual organization philosophy to model the overwhelmingly complicated visual patterns, based upon the semantic, structure, and signal level representation accounting for different tasks. The SCMC paradigm that incorporates the representation at different granularities supports diverse application scenarios, such as high-level semantic communication and low-level image reconstruction. The decoder, which enables the recovery of the visual information, benefits from the scalable coding based upon the semantic, structure, and signal layers. Qualitative and quantitative results demonstrate that the SCMC can convey accurate semantic and perceptual information of images, especially at low bitrates, and promising rate-distortion performance has been achieved compared to state-of-the-art methods. The code will be available online https://github.com/ppingzhang/SCMC.
引用
收藏
页码:4441 / 4445
页数:5
相关论文
共 50 条
  • [1] Cross-modality semantic guidance for multi-label image classification
    Huang, Jun
    Wang, Dian
    Hong, Xudong
    Qu, Xiwen
    Xue, Wei
    INTELLIGENT DATA ANALYSIS, 2024, 28 (03) : 633 - 646
  • [2] Representation Learning for Cross-Modality Classification
    van Tulder, Gijs
    de Bruijne, Marleen
    MEDICAL COMPUTER VISION AND BAYESIAN AND GRAPHICAL MODELS FOR BIOMEDICAL IMAGING, 2017, 10081 : 126 - 136
  • [3] Semantic Consistent Unsupervised Domain Adaptation for Cross-Modality Medical Image Segmentation
    Zeng, Guodong
    Lerch, Till D.
    Schmaranzer, Florian
    Zheng, Guoyan
    Burger, Juergen
    Gerber, Kate
    Tannast, Moritz
    Siebenrock, Klaus
    Gerber, Nicolas
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT III, 2021, 12903 : 201 - 210
  • [4] Boosting Cross-Modality Image Registration
    Barbu, Adrian
    Ionasec, Razvan
    2009 JOINT URBAN REMOTE SENSING EVENT, VOLS 1-3, 2009, : 89 - +
  • [5] CROSS-MODALITY TRANSFER OF SPATIAL INFORMATION
    FISHBEIN, HD
    DECKER, J
    WILCOX, P
    BRITISH JOURNAL OF PSYCHOLOGY, 1977, 68 (NOV) : 503 - 508
  • [6] Anatomy-Regularized Representation Learning for Cross-Modality Medical Image Segmentation
    Chen, Xu
    Lian, Chunfeng
    Wang, Li
    Deng, Hannah
    Kuang, Tianshu
    Fung, Steve
    Gateno, Jaime
    Yap, Pew-Thian
    Xia, James J.
    Shen, Dinggang
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (01) : 274 - 285
  • [7] Cross-Modality Transfer Learning for Image-Text Information Management
    Niu, Shuteng
    Jiang, Yushan
    Chen, Bowen
    Wang, Jian
    Liu, Yongxin
    Song, Houbing
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2022, 13 (01)
  • [8] Representation Learning Through Cross-Modality Supervision
    Sankaran, Nishant
    Mohan, Deen Dayal
    Setlur, Srirangaraj
    Govindaraju, Venugopal
    Fedorishin, Dennis
    2019 14TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2019), 2019, : 107 - 114
  • [9] Semantic Scalable Image Compression with Cross-Layer Priors
    Tu, Hanyue
    Li, Li
    Zhou, Wengang
    Li, Houqiang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4044 - 4052
  • [10] MRI Cross-Modality Image-to-Image Translation
    Yang, Qianye
    Li, Nannan
    Zhao, Zixu
    Fan, Xingyu
    Chang, Eric I-Chao
    Xu, Yan
    SCIENTIFIC REPORTS, 2020, 10 (01)