A Two-Level Auto-Encoder for Distributed Stereo Coding

被引：0

作者：

Harel, Yuval ^{[1
]}

Avidan, Shai ^{[1
]}

机构：

[1] Tel Aviv Univ, Sch Elect Engn, IL-69978 Tel Aviv, Israel

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL PHOTOGRAPHY (ICCP) | 2022年

关键词：

Image Compression; Deep Neural Networks; Distributed Stereo Coding; Computational Photography; COMPRESSION; VIDEO; INFORMATION;

D O I：

10.1109/ICCP54855.2022.9887724

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We propose a new technique for stereo image compression that is based on Distributed Source Coding (DSC). In our setting, two cameras transmit their image back to a processing unit. Naively doing so requires each camera to compress and transmit its image independently. However, the images are correlated because they observe the same scene, and our goal is to take advantage of this fact. In our solution, one camera, assume the left camera, sends its image to the processing unit, as before. The right camera, on the other hand, transmits its image conditioned on the left image, even though the two cameras do not communicate. The processing unit can then decode the right image, using the left image. The solution is based on a two level Auto-Encoder (AE). During training, the first level AE learns a standard single image compression code. The second level AE further compresses the code of the right image, conditioned on the code of the left image. During inference, the left camera uses the first level AE to transmit its image to the processing unit. The right camera, on the other hand, uses the encoders of both levels to transmit its code to the processing unit. The processing unit uses the top level decoder to recover the left image, and the decoders of both levels, as well as the recovered left image, to recover the right image. The system achieves state of the art results in image compression on several popular datasets.

引用

页数：11

共 50 条

[21] HRTF Representation with Convolutional Auto-encoder
Chen, Wei
Hu, Ruimin
Wang, Xiaochen
Li, Dengshi
MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 605 - 616
[22] Auto-encoder based dimensionality reduction
Wang, Yasi
Yao, Hongxun
Zhao, Sicheng
NEUROCOMPUTING, 2016, 184 : 232 - 242
[23] Variational Auto-Encoder for text generation
Hu, Haojin
Liao, Mengfan
Mao, Weiming
Liu, Wei
Zhang, Chao
Jing, Yanmei
PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 595 - 598
[24] Deep auto-encoder based clustering
Song, Chunfeng
Huang, Yongzhen
Liu, Feng
Wang, Zhenyu
Wang, Liang
INTELLIGENT DATA ANALYSIS, 2014, 18 : S65 - S76
[25] Cramer-Wold Auto-Encoder
Knop, Szymon
Spurek, Przemyslaw
Tabor, Jacek
Podolak, Igor
Mazur, Marcin
Jastrzebski, Stanislaw
JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
[26] CONTRASTIVE AUTO-ENCODER FOR PHONEME RECOGNITION
Zheng, Xin
Wu, Zhiyong
Meng, Helen
Cai, Lianhong
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[27] Cramer-wold auto-encoder
Knop, Szymon
Spurek, Przemyslaw
Tabor, Jacek
Podolak, Igor
Mazur, Marcin
Jastrzebski, Stanislaw
1600, Microtome Publishing (21):
[28] Feature Selection Guided Auto-Encoder
Wang, Shuyang
Ding, Zhengming
Fu, Yun
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2725 - 2731
[29] A two-stage intrusion detection system with auto-encoder and LSTMs
Mushtaq, Earum
Zameer, Aneela
Umer, Muhammad
Abbasi, Asima Akber
APPLIED SOFT COMPUTING, 2022, 121
[30] Unsupervised object-level video summarization with online motion auto-encoder
Zhang, Yujia
Liang, Xiaodan
Zhang, Dingwen
Tan, Min
Xing, Eric P.
PATTERN RECOGNITION LETTERS, 2020, 130 (130) : 376 - 385

← 1 2 3 4 5 →