Understanding Data Characteristics and Access Patterns in a Cloud Storage System

被引:25
|
作者
Liu, Songbin [1 ]
Huang, Xiaomeng [1 ]
Fu, Haohuan [1 ]
Yang, Guangwen [1 ]
机构
[1] Tsinghua Univ, Minist Educ, Key Lab Earth Syst Modeling, Beijing 100084, Peoples R China
关键词
Cloud Storage; File System; Data Characteristic; Access Pattern;
D O I
10.1109/CCGrid.2013.11
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Understanding the inherent system characteristics is crucial to the design and optimization of cloud storage system, and few studies have systematically investigated its data characteristics and access patterns. This paper presents an analysis of file system snapshot and five-month access trace of a campus cloud storage system that has been deployed on Tsinghua campus for three years. The system provides online storage and data sharing services for more than 19,000 students and 500 student groups. We report several data characteristics including file size and file type, as well as some access patterns, including read/write ratio, read-write dependency and daily traffic. We find that there are many differences between cloud storage system and traditional file systems: our cloud storage system has larger file sizes, lower read/ write ratio, and smaller set of active files than those of a typical traditional file system. With a trace-driven simulation, we find that the cache efficiency can be improved by 5 times using the guidance from our observations.
引用
收藏
页码:327 / 334
页数:8
相关论文
共 50 条
  • [1] Understanding Data Access Patterns for dCache System
    Bellavita, Julian
    Sim, Caitlin
    Wu, Kesheng
    Sim, Alex
    Yoo, Shinjae
    Ito, Hiro
    Garonne, Vincent
    Lancon, Eric
    26TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS, CHEP 2023, 2024, 295
  • [2] Efficient Data Access in Hybrid Cloud Storage
    Samy, Islam
    Koyluoglu, O. Ozan
    Rawat, Ankit Singh
    2017 55TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2017, : 1 - 8
  • [3] An optimization algorithm of data access storage in cloud computing based on distributed system
    Ye, Lunqiang
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 27 - 28
  • [4] Blockchain based Secure Data Storage and Access Control System using Cloud
    Desai, Shubham
    Deshmukh, Onkar
    Shelke, Rahul
    Choudhary, Harish
    Sambhare, S. S.
    Yadav, Arjunsingh
    2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,
  • [5] A Survey: Secure Cloud Data Storage and Access Control System Using Blockchain
    Patel, Kashyap
    Modi, Ritiksha
    Sharma, Shital
    Patel, Minal
    SOFT COMPUTING FOR SECURITY APPLICATIONS, ICSCS 2022, 2023, 1428 : 195 - 207
  • [6] Secure Data Access and Sharing Scheme for Cloud Storage
    Li, Xiong
    Kumari, Saru
    Shen, Jian
    Wu, Fan
    Chen, Caisen
    Islam, S. K. Hafizul
    WIRELESS PERSONAL COMMUNICATIONS, 2017, 96 (04) : 5295 - 5314
  • [7] Access Protocols in Data Partitioning Based Cloud Storage
    Ye, Yunqi
    Xiao, Liangliang
    Chen, Yinzi
    Yen, I-Ling
    Bastani, Farokh
    Chen, Ing-Ray
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 398 - 405
  • [8] Secure Data Access and Sharing Scheme for Cloud Storage
    Xiong Li
    Saru Kumari
    Jian Shen
    Fan Wu
    Caisen Chen
    SK Hafizul Islam
    Wireless Personal Communications, 2017, 96 : 5295 - 5314
  • [9] Sanitizable Access Control System for Secure Cloud Storage Against Malicious Data Publishers
    Susilo, Willy
    Jiang, Peng
    Lai, Jianchang
    Guo, Fuchun
    Yang, Guomin
    Deng, Robert H.
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2022, 19 (03) : 2138 - 2148
  • [10] An access control and authentication scheme for secure data sharing in the decentralized cloud storage system
    Khatiwada, Pankaj
    Yang, Bian
    2022 5TH CONFERENCE ON CLOUD AND INTERNET OF THINGS, CIOT, 2022, : 137 - 144