SimEnc: A High-Performance Similarity-Preserving Encryption Approach for Deduplication of Encrypted Docker Images

被引:0
|
作者
Sun, Tong [1 ,2 ]
Jiang, Bowen [1 ,2 ]
Li, Borui [3 ]
Lv, Jiamei [1 ,2 ]
Gao, Yi [1 ,2 ]
Dong, Wei [1 ,2 ]
机构
[1] Zhejiang Univ, State Key Lab Blockchain & Data Secur, Coll Comp Sci, Hangzhou, Peoples R China
[2] Zhejiang Univ, Sch Software Technol, Hangzhou, Peoples R China
[3] Southeast Univ, Sch Comp Sci & Engn, Dhaka, Bangladesh
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Encrypted Docker images are becoming increasingly popular in Docker registries for privacy. As the Docker registry is tasked with managing an increasing number of images, it becomes essential to implement deduplication to conserve storage space. However, deduplication for encrypted images is difficult because deduplication exploits identical content, while encryption tries to make all contents look random. Existing state-of-the-art works try to decompress images and perform message-locked encryption (MLE) to deduplicate encrypted images. Unfortunately, our measurements uncover two limitations in current works: (i) even minor modifications to the image content can hinder MLE deduplication, (ii) decompressing image layers would increase the size of the storage for duplicate data, and significantly compromise user pull latency and deduplication throughput. In this paper, we propose SimEnc, a high-performance similarity-preserving encryption approach for deduplication of encrypted Docker images. SimEnc is the first work that integrates the semantic hash technique into MLE to extract semantic information among layers for improving the deduplication ratio. SimEnc builds on a fast similarity space selection mechanism for flexibility. Unlike existing works completely decompressing the layer, we explore a new similarity space by Huffman decoding that achieves a better deduplication ratio and performance. Experiments show that SimEnc outperforms both the state-of-the-art encrypted serverless platform and plaintext Docker registry, reducing storage consumption by up to 261.7% and 54.2%, respectively. Meanwhile, SimEnc can surpass them in terms of pull latency.
引用
收藏
页码:615 / 630
页数:16
相关论文
共 6 条
  • [1] DupHunter: Flexible High-Performance Deduplication for Docker Registries
    Zhao, Nannan
    Albahar, Hadeel
    Abraham, Subil
    Chen, Keren
    Tarasov, Vasily
    Skourtis, Dimitrios
    Rupprecht, Lukas
    Anwar, Ali
    Butt, Ali R.
    PROCEEDINGS OF THE 2020 USENIX ANNUAL TECHNICAL CONFERENCE, 2020, : 769 - 783
  • [2] An End-to-end High-performance Deduplication Scheme for Docker Registries and Docker Container Storage Systems
    Zhao, Nannan
    Lin, Muhui
    Albahar, Hadeel
    Paul, Arnab K.
    Huang, Zhijie
    Abraham, Subil
    Chen, Keren
    Tarasov, Vasily
    Skourtis, Dimitrios
    Anwar, Ali
    Butt, Ali R.
    ACM TRANSACTIONS ON STORAGE, 2024, 20 (03)
  • [3] High-Performance Encryption Algorithms for Dynamic Images Transmission
    Yang, Ying
    Xiong, Xingchuang
    Liu, Zilong
    Jin, Shangzhong
    Wang, Juan
    ELECTRONICS, 2024, 13 (01)
  • [4] High-performance reversible data hiding in encrypted images with adaptive Huffman code
    Gao, Guangyong
    Zhang, Liping
    Lin, Yuan
    Tong, Shikun
    Yuan, Chengsheng
    DIGITAL SIGNAL PROCESSING, 2023, 133
  • [5] High-Performance Confidentiality-Preserving Blockchain via GPU-Accelerated Fully Homomorphic Encryption
    Guan, Rongxin
    Shen, Tianxiang
    Wang, Sen
    Zhang, Gong
    Cui, Heming
    Qi, Ji
    ADVANCED INFORMATION SYSTEMS ENGINEERING WORKSHOPS, CAISE 2024, 2024, 521 : 25 - 36
  • [6] A novel similarity search approach for high-performance thin-layer chromatography (HPTLC) fingerprinting of medicinal plants
    Ebrahimi-Najafabadi, Heshmatollah
    Kazemeini, Seyed Sina
    Pasdaran, Ardalan
    Hamedi, Azadeh
    PHYTOCHEMICAL ANALYSIS, 2019, 30 (04) : 405 - 414