Understanding Data Characteristics and Access Patterns in a Cloud Storage System

被引:25
|
作者
Liu, Songbin [1 ]
Huang, Xiaomeng [1 ]
Fu, Haohuan [1 ]
Yang, Guangwen [1 ]
机构
[1] Tsinghua Univ, Minist Educ, Key Lab Earth Syst Modeling, Beijing 100084, Peoples R China
关键词
Cloud Storage; File System; Data Characteristic; Access Pattern;
D O I
10.1109/CCGrid.2013.11
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Understanding the inherent system characteristics is crucial to the design and optimization of cloud storage system, and few studies have systematically investigated its data characteristics and access patterns. This paper presents an analysis of file system snapshot and five-month access trace of a campus cloud storage system that has been deployed on Tsinghua campus for three years. The system provides online storage and data sharing services for more than 19,000 students and 500 student groups. We report several data characteristics including file size and file type, as well as some access patterns, including read/write ratio, read-write dependency and daily traffic. We find that there are many differences between cloud storage system and traditional file systems: our cloud storage system has larger file sizes, lower read/ write ratio, and smaller set of active files than those of a typical traditional file system. With a trace-driven simulation, we find that the cache efficiency can be improved by 5 times using the guidance from our observations.
引用
收藏
页码:327 / 334
页数:8
相关论文
共 50 条
  • [41] Data Rearrange based on Mining Block Access Sequence in Cloud Storage
    Du, Hongtao
    Li, Zhanhuai
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 2507 - 2511
  • [42] Secure Data Collection, Storage, and Access in Cloud-Assisted IoT
    Wang, Wei
    Xu, Peng
    Yang, Laurence Tianruo
    IEEE CLOUD COMPUTING, 2018, 5 (04): : 77 - 88
  • [43] A heuristic storage for minimizing access time of arbitrary data patterns
    AlMouhamed, MA
    Seiden, SS
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1997, 8 (04) : 441 - 447
  • [44] Distributed Storage Algorithm for Geospatial Image Data Based on Data Access Patterns
    Pan, Shaoming
    Li, Yongkai
    Xu, Zhengquan
    Chong, Yanwen
    PLOS ONE, 2015, 10 (07):
  • [45] Research on data encryption system and technology for cloud storage
    Han P.
    Liu C.
    Wang J.
    Duan S.
    Pan H.
    Fang B.
    Tongxin Xuebao/Journal on Communications, 2020, 41 (08): : 55 - 65
  • [46] K-ear: Extracting data access periodic characteristics for energy-aware data clustering and storing in cloud storage systems
    You, Xindong
    Sun, Tian
    Sun, Dawei
    Liu, Xunyun
    Lv, Xueqiang
    Buyya, Rajkumar
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (09):
  • [47] Understanding Complex Cloud Patterns
    Linthicum, David S.
    IEEE CLOUD COMPUTING, 2016, 3 (01): : 8 - 11
  • [48] Developing Secure Cloud Storage System Using Access Control Models
    Ubale, S. A.
    Apte, S. S.
    Bokefode, J. D.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 2, 2017, 469 : 141 - 147
  • [49] Cloud Storage Monitoring System analyzing through File Access Pattern
    Devarajan, Augustus A.
    SudalaiMuthu, T.
    2019 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS 2019), 2019,
  • [50] Comments on "DAC-MACS: Effective Data Access Control for Multiauthority Cloud Storage Systems"/Security Analysis of Attribute Revocation in Multiauthority Data Access Control for Cloud Storage Systems
    Hong, Jianan
    Xue, Kaiping
    Li, Wei
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2015, 10 (06) : 1315 - 1317