A Two-Level Auto-Encoder for Distributed Stereo Coding

被引:0
|
作者
Harel, Yuval [1 ]
Avidan, Shai [1 ]
机构
[1] Tel Aviv Univ, Sch Elect Engn, IL-69978 Tel Aviv, Israel
关键词
Image Compression; Deep Neural Networks; Distributed Stereo Coding; Computational Photography; COMPRESSION; VIDEO; INFORMATION;
D O I
10.1109/ICCP54855.2022.9887724
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose a new technique for stereo image compression that is based on Distributed Source Coding (DSC). In our setting, two cameras transmit their image back to a processing unit. Naively doing so requires each camera to compress and transmit its image independently. However, the images are correlated because they observe the same scene, and our goal is to take advantage of this fact. In our solution, one camera, assume the left camera, sends its image to the processing unit, as before. The right camera, on the other hand, transmits its image conditioned on the left image, even though the two cameras do not communicate. The processing unit can then decode the right image, using the left image. The solution is based on a two level Auto-Encoder (AE). During training, the first level AE learns a standard single image compression code. The second level AE further compresses the code of the right image, conditioned on the code of the left image. During inference, the left camera uses the first level AE to transmit its image to the processing unit. The right camera, on the other hand, uses the encoders of both levels to transmit its code to the processing unit. The processing unit uses the top level decoder to recover the left image, and the decoders of both levels, as well as the recovered left image, to recover the right image. The system achieves state of the art results in image compression on several popular datasets.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] HRTF Representation with Convolutional Auto-encoder
    Chen, Wei
    Hu, Ruimin
    Wang, Xiaochen
    Li, Dengshi
    MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 605 - 616
  • [22] Auto-encoder based dimensionality reduction
    Wang, Yasi
    Yao, Hongxun
    Zhao, Sicheng
    NEUROCOMPUTING, 2016, 184 : 232 - 242
  • [23] Variational Auto-Encoder for text generation
    Hu, Haojin
    Liao, Mengfan
    Mao, Weiming
    Liu, Wei
    Zhang, Chao
    Jing, Yanmei
    PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 595 - 598
  • [24] Deep auto-encoder based clustering
    Song, Chunfeng
    Huang, Yongzhen
    Liu, Feng
    Wang, Zhenyu
    Wang, Liang
    INTELLIGENT DATA ANALYSIS, 2014, 18 : S65 - S76
  • [25] Cramer-Wold Auto-Encoder
    Knop, Szymon
    Spurek, Przemyslaw
    Tabor, Jacek
    Podolak, Igor
    Mazur, Marcin
    Jastrzebski, Stanislaw
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [26] CONTRASTIVE AUTO-ENCODER FOR PHONEME RECOGNITION
    Zheng, Xin
    Wu, Zhiyong
    Meng, Helen
    Cai, Lianhong
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [27] Cramer-wold auto-encoder
    Knop, Szymon
    Spurek, Przemyslaw
    Tabor, Jacek
    Podolak, Igor
    Mazur, Marcin
    Jastrzebski, Stanislaw
    1600, Microtome Publishing (21):
  • [28] Feature Selection Guided Auto-Encoder
    Wang, Shuyang
    Ding, Zhengming
    Fu, Yun
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2725 - 2731
  • [29] A two-stage intrusion detection system with auto-encoder and LSTMs
    Mushtaq, Earum
    Zameer, Aneela
    Umer, Muhammad
    Abbasi, Asima Akber
    APPLIED SOFT COMPUTING, 2022, 121
  • [30] Unsupervised object-level video summarization with online motion auto-encoder
    Zhang, Yujia
    Liang, Xiaodan
    Zhang, Dingwen
    Tan, Min
    Xing, Eric P.
    PATTERN RECOGNITION LETTERS, 2020, 130 (130) : 376 - 385