Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer

被引:5
|
作者
Zhang, Pingping [1 ]
Wang, Shiqi [1 ,2 ]
Wang, Meng [1 ]
Li, Jiguo [3 ]
Wang, Xu [4 ,5 ]
Kwong, Sam [1 ,2 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Shenzhen Res Inst, Shenzhen 518057, Peoples R China
[3] Univ Chinese Acad Sci, Inst Comp Technol, Beijing 100049, Peoples R China
[4] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[5] Shenzhen Univ, Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic image compression; cross-modality; scalable coding;
D O I
10.1109/TCSVT.2023.3241225
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This article proposes the scalable cross-modality compression (SCMC) paradigm, in which the image compression problem is further cast into a representation task by hierarchically sketching the image with different modalities. Herein, we adopt the conceptual organization philosophy to model the overwhelmingly complicated visual patterns, based upon the semantic, structure, and signal level representation accounting for different tasks. The SCMC paradigm that incorporates the representation at different granularities supports diverse application scenarios, such as high-level semantic communication and low-level image reconstruction. The decoder, which enables the recovery of the visual information, benefits from the scalable coding based upon the semantic, structure, and signal layers. Qualitative and quantitative results demonstrate that the SCMC can convey accurate semantic and perceptual information of images, especially at low bitrates, and promising rate-distortion performance has been achieved compared to state-of-the-art methods. The code will be available online https://github.com/ppingzhang/SCMC.
引用
收藏
页码:4441 / 4445
页数:5
相关论文
共 50 条
  • [41] Agent-Driven Generative Semantic Communication With Cross-Modality and Prediction
    Yang, Wanting
    Xiong, Zehui
    Yuan, Yanli
    Jiang, Wenchao
    Quek, Tony Q. S.
    Debbah, Merouane
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2025, 24 (03) : 2233 - 2248
  • [42] IMPLICIT LEARNING - WITHIN-MODALITY AND CROSS-MODALITY TRANSFER OF TACIT KNOWLEDGE
    MANZA, L
    REBER, AS
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1991, 29 (06) : 499 - 499
  • [43] Hierarchical Cross-Modality Semantic Correlation Learning Model for Multimodal Summarization
    Zhang, Litian
    Zhang, Xiaoming
    Pan, Junshu
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11676 - 11684
  • [44] From CNNs to GANs for cross-modality medical image estimation
    Fard, Azin Shokraei
    Reutens, David C.
    Vegh, Viktor
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
  • [45] Cyclic Cross-Modality Interaction for Hyperspectral and Multispectral Image Fusion
    Chen, Shi
    Zhang, Lefei
    Zhang, Liangpei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 741 - 753
  • [46] LOCAL CROSS-MODALITY IMAGE ALIGNMENT USING UNSUPERVISED LEARNING
    BERNANDER, O
    KOCH, C
    LECTURE NOTES IN COMPUTER SCIENCE, 1990, 427 : 573 - 575
  • [47] Disentangle domain features for cross-modality cardiac image segmentation
    Pei, Chenhao
    Wu, Fuping
    Huang, Liqin
    Zhuang, Xiahai
    Medical Image Analysis, 2021, 71
  • [48] Disentangle domain features for cross-modality cardiac image segmentation
    Pei, Chenhao
    Wu, Fuping
    Huang, Liqin
    Zhuang, Xiahai
    MEDICAL IMAGE ANALYSIS, 2021, 71
  • [49] Cross-Modality Data Augmentation for Aerial Object Detection with Representation Learning
    Wei, Chiheng
    Bai, Lianfa
    Chen, Xiaoyu
    Han, Jing
    REMOTE SENSING, 2024, 16 (24)
  • [50] Cross-modality image feature fusion diagnosis in breast cancer
    Jiang, Mingkuan
    Han, Lu
    Sun, Hang
    Li, Jing
    Bao, Nan
    Li, Hong
    Zhou, Shi
    Yu, Tao
    PHYSICS IN MEDICINE AND BIOLOGY, 2021, 66 (10):