Transform Network Architectures for Deep Learning Based End-to-End Image/Video Coding in Subsampled Color Spaces

被引:10
|
作者
Egilmez, Hilmi E. [1 ]
Singh, Ankitesh K. [1 ]
Coban, Muhammed [1 ]
Karczewicz, Marta [1 ]
Zhu, Yinhao [1 ]
Yang, Yang [1 ]
Said, Amir [1 ]
Cohen, Taco S. [2 ]
机构
[1] Qualcomm Technol Inc, San Diego, CA 92121 USA
[2] Qualcomm Technol Netherlands BV, NL-1098 XH Amsterdam, Netherlands
关键词
Transforms; Encoding; Image coding; Standards; Image color analysis; Video coding; Quantization (signal); Deep learning; neural networks; transform network; data compression; image coding; video coding; color spaces; YUV; RGB;
D O I
10.1109/OJSP.2021.3092257
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most of the existing deep learning based end-to-end image/video coding (DLEC) architectures are designed for non-subsampled RGB color format. However, in order to achieve a superior coding performance, many state-of-the-art block-based compression standards such as High Efficiency Video Coding (HEVC/H.265) and Versatile Video Coding (VVC/H.266) are designed primarily for YUV 4:2:0 format, where U and V components are subsampled by considering the human visual system. This paper investigates various DLEC designs to support YUV 4:2:0 format by comparing their performance against the main profiles of HEVC and VVC standards under a common evaluation framework. Moreover, a new transform network architecture is proposed to improve the efficiency of coding YUV 4:2:0 data. The experimental results on YUV 4:2:0 datasets show that the proposed architecture significantly outperforms naive extensions of existing architectures designed for RGB format and achieves about 10% average BD-rate improvement over the intra-frame coding in HEVC.
引用
收藏
页码:441 / 452
页数:12
相关论文
共 50 条
  • [1] A deep learning network based end-to-end image composition
    Zhu, Xiaoyu
    Wang, Haodi
    Zhang, Zhiyi
    Wu, Xiuping
    Guo, Junqi
    Wu, Hao
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 101
  • [2] An End-to-End Deep Generative Network for Low Bitrate Image Coding
    Pei, Yifei
    Liu, Ying
    Ling, Nam
    Ren, Yongxiong
    Liu, Lingzhi
    [J]. 2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [3] New Results in End-to-end Image and Video Compression by Deep Learning
    Ozsoy, Gokberk
    Yilmaz, Melih
    Kirmemis, Ogun
    Tekalp, A. Murat
    [J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [4] An End-to-End Video Steganography Network Based on a Coding Unit Mask
    Chai, Huanhuan
    Li, Zhaohong
    Li, Fan
    Zhang, Zhenzhen
    [J]. ELECTRONICS, 2022, 11 (07)
  • [5] An End-to-End Image Dehazing Method Based on Deep Learning
    Zhang, Yi
    Huang, Hongbing
    Liu, Junyi
    Fan, Chao
    Wang, Yanyan
    Cai, Qing
    Ruan, Yingying
    Gong, Xiaojin
    [J]. 2018 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING, 2019, 1169
  • [6] Learning-based End-to-End Video Compression Using Predictive Coding
    de Oliveira, Matheus C.
    Martins, Luiz G. R.
    Jung, Henrique Costa
    Guerin Jr, Nilson Donizete
    da Silva, Renam Castro
    Peixoto, Eduardo
    Macchiavello, Bruno
    Hung, Edson M.
    Testoni, Vanessa
    Freitas, Pedro Garcia
    [J]. 2021 34TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2021), 2021, : 160 - 167
  • [7] Satellite selection with an end-to-end deep learning network
    Huang, Panpan
    Rizos, Chris
    Roberts, Craig
    [J]. GPS SOLUTIONS, 2018, 22 (04)
  • [8] Satellite selection with an end-to-end deep learning network
    Panpan Huang
    Chris Rizos
    Craig Roberts
    [J]. GPS Solutions, 2018, 22
  • [9] Avoiding normalization uncertainties in deep learning architectures for end-to-end communication
    Bos, Simon
    Vinogradov, Evgenii
    Pollin, Sofie
    [J]. 2021 IEEE INTERNATIONAL MEDITERRANEAN CONFERENCE ON COMMUNICATIONS AND NETWORKING (IEEE MEDITCOM 2021), 2021, : 485 - 487
  • [10] End-to-end deep multispectral image compression based on interspectral prediction network
    Kong, Fanqiang
    Meng, Yuxin
    Li, Dan
    Hu, Kedi
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2022, 16 (03)