An End-to-End Learning Architecture for Efficient Image Encoding and Deep Learning

被引:0
|
作者
Chamain, Lahiru D. [1 ]
Qi, Siyu [1 ]
Ding, Zhi [1 ]
机构
[1] Univ Calif Davis, Dept Elect & Comp Engn, Davis, CA 95616 USA
关键词
Grouping; end-to-end encoding; classification;
D O I
10.23919/EUSIPCO54536.2021.9616312
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Learning-based image/video codecs typically utilize the well known auto-encoder structure where the encoder transforms input data to a low-dimensional latent representation. Efficient latent encoding can reduce bandwidth needs during compression for transmission and storage. In this paper, we examine the effect of assigning high level coarse grouping labels to each latent vector. Designing coding profiles for each latent group can achieve high compression encoding. We show that such grouping can be learned via end-to-end optimization of the codec and the deep learning (DL) model to optimize rate-accuracy for a given data set. For cloud-based inference, source encoder can select a coding profile based on its learned grouping and encode the data features accordingly. Our test results on image classification show that significant performance improvement can be achieved with learned grouping over its non-grouping counterpart.
引用
收藏
页码:691 / 695
页数:5
相关论文
共 50 条
  • [1] An efficient end-to-end deep learning architecture for activity classification
    Amel Ben Mahjoub
    Mohamed Atri
    [J]. Analog Integrated Circuits and Signal Processing, 2019, 99 : 23 - 32
  • [2] An efficient end-to-end deep learning architecture for activity classification
    Ben Mahjoub, Amel
    Atri, Mohamed
    [J]. ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2019, 99 (01) : 23 - 32
  • [3] An End-to-End Deep Learning Architecture for Graph Classification
    Zhang, Muhan
    Cui, Zhicheng
    Neumann, Marion
    Chen, Yixin
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4438 - 4445
  • [4] A deep learning network based end-to-end image composition
    Zhu, Xiaoyu
    Wang, Haodi
    Zhang, Zhiyi
    Wu, Xiuping
    Guo, Junqi
    Wu, Hao
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 101
  • [5] End-to-End Learning of Deep Visual Representations for Image Retrieval
    Gordo, Albert
    Almazan, Jon
    Revaud, Jerome
    Larlus, Diane
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2017, 124 (02) : 237 - 254
  • [6] End-to-End Learning of Deep Visual Representations for Image Retrieval
    Albert Gordo
    Jon Almazán
    Jerome Revaud
    Diane Larlus
    [J]. International Journal of Computer Vision, 2017, 124 : 237 - 254
  • [7] An End-to-End Image Dehazing Method Based on Deep Learning
    Zhang, Yi
    Huang, Hongbing
    Liu, Junyi
    Fan, Chao
    Wang, Yanyan
    Cai, Qing
    Ruan, Yingying
    Gong, Xiaojin
    [J]. 2018 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING, 2019, 1169
  • [8] A Multimodal End-to-End Deep Learning Architecture for Music Popularity Prediction
    Martin-Gutierrez, David
    Hernandez Penaloza, Gustavo
    Belmonte-Hernandez, Alberto
    Alvarez Garcia, Federico
    [J]. IEEE ACCESS, 2020, 8 : 39361 - 39374
  • [9] New Results in End-to-end Image and Video Compression by Deep Learning
    Ozsoy, Gokberk
    Yilmaz, Melih
    Kirmemis, Ogun
    Tekalp, A. Murat
    [J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [10] Spectrum Monitoring Based on End-to-End Learning by Deep Learning
    Mahdiyeh Rahmani
    Reza Ghazizadeh
    [J]. International Journal of Wireless Information Networks, 2022, 29 : 180 - 192