An End-to-End Mutual Enhancement Network Toward Image Compression and Semantic Segmentation

被引:0
|
作者
Chen, Junru [1 ,2 ]
Yao, Chao [3 ]
Liu, Meiqin [1 ,2 ]
Zhao, Yao [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Beijing Key Lab Adv Informat Sci & Network Techno, Beijing 100044, Peoples R China
[3] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Learning-based compression; Video Coding for Machine; Semantic segmentation;
D O I
10.1007/978-3-030-88007-1_51
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image compression is to compress image data without compromising human vision feeling. However, the information loss through the image compression process may influence the following machine vision tasks, such as object detection and semantic segmentation. How to jointly consider the human vision and the machine vision to compress images for human and machine vision tasks is still an open problem. In this paper, we provide a multi-task framework for image compression and semantic segmentation. More specifically, an end-to-end mutual enhancement network is designed to efficiently compress the given image, and simultaneously segment the semantic information. Firstly, a uniform feature learning strategy is adopted to jointly learn the features for image compression and semantic segmentation in the encoder. Moreover, a multi-scale aggregation module in the encoder is employed to enhance the semantic features. Then, by transmitting the quantified features, both the decompressed image features and the learned semantic features can be reconstructed. Finally, we decode this information for the image compression task and the semantic segmentation task. On one hand, we can utilize the decompressed semantic features to implement semantic segmentation in the decoder. On the other hand, the quality of the decompressed image can be further improved depending on the obtained semantic segmentation map. Experimental results prove that our framework is effective to simultaneously support image compression and semantic segmentation, both in the subjective and objective evaluation.
引用
收藏
页码:623 / 635
页数:13
相关论文
共 50 条
  • [1] End-to-end dilated convolution network for document image semantic segmentation
    Xu Can-hui
    Shi Cao
    Chen Yi-nong
    [J]. JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2021, 28 (06) : 1765 - 1774
  • [2] Crowd Counting Using End-to-End Semantic Image Segmentation
    Khan, Khalil
    Khan, Rehan Ullah
    Albattah, Waleed
    Nayab, Durre
    Qamar, Ali Mustafa
    Habib, Shabana
    Islam, Muhammad
    [J]. ELECTRONICS, 2021, 10 (11)
  • [3] Object Bounding Transformed Network for End-to-End Semantic Segmentation
    Wang, Kuan-Chung
    Wang, Chien-Yao
    Tai, Tzu-Chiang
    Wang, Jia-Ching
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3217 - 3221
  • [4] End-to-end trainable network for superpixel and image segmentation
    Wang, Kai
    Li, Liang
    Zhang, Jiawan
    [J]. Pattern Recognition Letters, 2020, 140 : 135 - 142
  • [5] End-to-end trainable network for superpixel and image segmentation
    Wang, Kai
    Li, Liang
    Zhang, Jiawan
    [J]. PATTERN RECOGNITION LETTERS, 2020, 140 : 135 - 142
  • [6] End-to-End Facial Image Compression with Integrated Semantic Distortion Metric
    He, Tianyu
    Chen, Zhibo
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [7] Correction to: An end-to-end differential network learning method for semantic segmentation
    Tai Hu
    Ming Yang
    Wanqi Yang
    Aishi Li
    [J]. International Journal of Machine Learning and Cybernetics, 2019, 10 : 1925 - 1925
  • [8] A new end-to-end network model for medical image segmentation
    Chen, Hongyou
    Xu, Zengyong
    [J]. JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2021, 24 (02): : 207 - 213
  • [9] End-to-End Phoneme Recognition using Models from Semantic Image Segmentation
    Gao, Wei
    Hashemi-Sakhtsari, Ahmad
    McDonnell, Mark D.
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [10] An End-to-End Deep Learning Image Compression Framework Based on Semantic Analysis
    Wang, Cheng
    Han, Yifei
    Wang, Weidong
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (17):