Dissecting the Workload of Cloud Storage System

被引:2
|
作者
Ren, Yaodanjun [1 ]
Sun, Xiaoyi [1 ]
Li, Kaishi [1 ]
Lin, Jiale [1 ]
Feng, Shuzhi [2 ]
Ren, Zhenyu [2 ]
Yin, Jian [2 ]
Qi, Zhengwei [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Alibaba Grp, Shanghai, Peoples R China
关键词
cloud storage workloads; workload characterization; network measurement; cloud services;
D O I
10.1109/ICDCS54860.2022.00068
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The innovation and evolution of file and storage systems have been influenced by workload analysis. Though cloud storage systems have been widely deployed and used, real-world and large-scale cloud storage workload studies are rare. Previous large-scale distributed storage systems can meet versatility, stability, and reliability requirements. Furthermore, modern cloud storage systems need to meet additional challenges, such as coping with surges in peak loads and rapid expansion of requests. These changes may lead to different characteristics. In this work, we propose DiTing data tracing system and collect workloads with over 242,000 billion requests from the Alibaba cloud. By comparing the normal days and the Single's Day (the world's largest online shopping festival), we analyze characteristics such as I/O scale, latency, locality, and load distribution. Our analysis reveals four key observations as follows. First, the virtual layer is the performance bottleneck of modern cloud storage systems during extreme peak periods. Second, the write operations dominate the data access because the application and operating system buffers absorb reads better than writes. Third, the workload is heavily skewed toward a small percentage of virtual cloud disks, with 20% of cloud disks accounting for 80% of I/0 requests. Finally, data access shows poor temporal and spatial locality, and the I/O requests are mostly small-scaled. Based on these observations, we propose several suggestions for cloud storage systems, including separating I/O processing from the virtual layer to the proxy layer, deploying heavy and light workload applications on the same node, and adopting a write-friendly cloud disk design for write-skewed requests, etc. In summary, these workload characteristics and suggestions are useful for designing and implementing next-generation cloud storage systems.
引用
收藏
页码:647 / 657
页数:11
相关论文
共 50 条
  • [41] File Deduplication with Cloud Storage File System
    Ku, Chan-I
    Luo, Guo-Heng
    Chang, Che-Pin
    Yuan, Shyan-Ming
    [J]. 2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 280 - 287
  • [42] Audinet: A Decentralized Auditing System for Cloud Storage
    Yan, Meng
    Xu, Jiajia
    Marbach, Trent G.
    Li, Haitao
    Wang, Gang
    Liu, Xiaoguang
    [J]. 2020 INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS 2020), 2020, : 215 - 224
  • [43] Torus Networking for Exascale Cloud Storage System
    Park, Jeongsook
    Kim, Hongyeon
    Kim, Youngkyun
    [J]. 2015 9TH INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING (FGCN), 2015, : 37 - 40
  • [44] Design and Implementation of cloud storage security system
    Zhou, Meirong
    Jiang, Zhengtao
    [J]. ADVANCES IN MANUFACTURING TECHNOLOGY, PTS 1-4, 2012, 220-223 : 2325 - 2329
  • [45] Design of Cloud Data Storage and Processing System
    Zhou, Baoke
    Chou, Wusheng
    [J]. 2018 INTERNATIONAL CONFERENCE ON BIG DATA AND ARTIFICIAL INTELLIGENCE (BDAI 2018), 2018, : 6 - 9
  • [46] On construction of a distributed data storage system in cloud
    Chao-Tung Yang
    Wen-Chung Shih
    Chih-Lin Huang
    Fuu-Cheng Jiang
    William Cheng-Chung Chu
    [J]. Computing, 2016, 98 : 93 - 118
  • [47] Data Replica Placement in Cloud Storage System
    Zhang Tao
    [J]. PROCEEDINGS OF THE 1ST INTERNATIONAL WORKSHOP ON CLOUD COMPUTING AND INFORMATION SECURITY (CCIS 2013), 2013, 52 : 551 - 554
  • [48] Storage System for Point Cloud Geographic Data
    Svalec, Marian
    Takac, Lubos
    Zabovsky, Michal
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION ENGINEERING (CSIE 2015), 2015, : 304 - 310
  • [49] Dynamic Secure Storage System in Cloud Services
    Rathanam, G. Jeeva
    Sumalatha, M. R.
    [J]. 2014 INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2014,
  • [50] A Secure Hierarchical Deduplication System in Cloud Storage
    Yao, Xin
    Lin, Yaping
    Liu, Qin
    Zhang, Yanchao
    [J]. 2016 IEEE/ACM 24TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2016,