Semantic-Aware Visual Decomposition for Image Coding

被引:3
|
作者
Chang, Jianhui [1 ]
Zhang, Jian [2 ]
Li, Jiguo [3 ]
Wang, Shiqi [4 ]
Mao, Qi [5 ]
Jia, Chuanmin [1 ]
Ma, Siwei [1 ]
Gao, Wen [1 ]
机构
[1] Peking Univ, Natl Engn Res Ctr Visual Technol, Sch Comp Sci, Beijing 100871, Peoples R China
[2] Peking Univ, Sch Elect & Comp Engn, Shenzhen Grad Sch, Shenzhen 518055, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[5] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing 100024, Peoples R China
基金
中国国家自然科学基金;
关键词
Image coding; Semantic-aware visual decomposition; Structure-texture; Coherency regularization; Extremely low bitrate;
D O I
10.1007/s11263-023-01809-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel image coding framework with semantic-aware visual decomposition towards extremely low bitrate compression. In particular, an input image is analyzed into a semantic map as structural representation and semantic-wise texture representation and further compressed into bitstreams at the encoder side. On the decoder side, the received bitstreams of dual-layer representations are decoded and reconstructed for target image synthesis with generative models. Moreover, the attention mechanism is introduced into the model architecture for texture representation modeling and a coherency regularization is proposed to further optimize the texture representation space by aligning the representation space with the source pixel space for higher synthesis quality. Besides, we also propose a cross-channel entropy module and control the quantization scale to facilitate rate-distortion optimization. Upon compressing the decomposed components into the bitstream, the simple yet effective representation philosophy benefits image compression in many aspects. First, in terms of compression performance, compact representations, and high visual synthesis quality can bring remarkable advantages. Second, the proposed framework yields a physically explainable bitstream composed of the structural segment and semantic-wise texture segments. Third and most importantly, subsequent vision tasks (e.g., content manipulation) can receive fundamental support from the semantic-aware visual decomposition and synthesis mechanism. Extensive experimental results demonstrate the superiority of the proposed framework towards efficient visual representation learning, high efficiency image compression (< 0.1 bpp), and intelligent visual applications (e.g., manipulation and analysis).
引用
收藏
页码:2333 / 2355
页数:23
相关论文
共 50 条
  • [41] Learning Semantic-Aware Local Features for Long Term Visual Localization
    Fan, Bin
    Zhou, Junjie
    Feng, Wensen
    Pu, Huayan
    Yang, Yuzhu
    Kong, Qingqun
    Wu, Fuchao
    Liu, Hongmin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4842 - 4855
  • [42] Semantic-aware visual attributes learning for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Li, Wei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
  • [43] Semantic-aware visual attributes learning for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Li, Wei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
  • [44] Semantic-aware visual attributes learning for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Li, Wei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
  • [45] Semantic-aware visual attributes learning for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Li, Wei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74 (74)
  • [46] SDTP: Semantic-Aware Decoupled Transformer Pyramid for Dense Image Prediction
    Li, Zekun
    Liu, Yufan
    Li, Bing
    Feng, Bailan
    Wu, Kebin
    Peng, Chengwei
    Hu, Weiming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6160 - 6173
  • [47] Semantic-Aware Feature Aggregation for Few-Shot Image Classification
    Fusheng Hao
    Fuxiang Wu
    Fengxiang He
    Qieshi Zhang
    Chengqun Song
    Jun Cheng
    Neural Processing Letters, 2023, 55 : 6595 - 6609
  • [48] Semantic-Aware Feature Aggregation for Few-Shot Image Classification
    Hao, Fusheng
    Wu, Fuxiang
    He, Fengxiang
    Zhang, Qieshi
    Song, Chengqun
    Cheng, Jun
    NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6595 - 6609
  • [49] ObjectBook Construction for Large-Scale Semantic-Aware Image Retrieval
    Zhang, Shiliang
    Tian, Qi
    Huang, Qingming
    Gao, Wen
    2011 IEEE 13TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2011,
  • [50] SDPT: Semantic-Aware Dimension-Pooling Transformer for Image Segmentation
    Cao, Hu
    Chen, Guang
    Zhao, Hengshuang
    Jiang, Dongsheng
    Zhang, Xiaopeng
    Tian, Qi
    Knoll, Alois
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 15934 - 15946