Dissecting the Workload of Cloud Storage System

被引:2
|
作者
Ren, Yaodanjun [1 ]
Sun, Xiaoyi [1 ]
Li, Kaishi [1 ]
Lin, Jiale [1 ]
Feng, Shuzhi [2 ]
Ren, Zhenyu [2 ]
Yin, Jian [2 ]
Qi, Zhengwei [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Alibaba Grp, Shanghai, Peoples R China
关键词
cloud storage workloads; workload characterization; network measurement; cloud services;
D O I
10.1109/ICDCS54860.2022.00068
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The innovation and evolution of file and storage systems have been influenced by workload analysis. Though cloud storage systems have been widely deployed and used, real-world and large-scale cloud storage workload studies are rare. Previous large-scale distributed storage systems can meet versatility, stability, and reliability requirements. Furthermore, modern cloud storage systems need to meet additional challenges, such as coping with surges in peak loads and rapid expansion of requests. These changes may lead to different characteristics. In this work, we propose DiTing data tracing system and collect workloads with over 242,000 billion requests from the Alibaba cloud. By comparing the normal days and the Single's Day (the world's largest online shopping festival), we analyze characteristics such as I/O scale, latency, locality, and load distribution. Our analysis reveals four key observations as follows. First, the virtual layer is the performance bottleneck of modern cloud storage systems during extreme peak periods. Second, the write operations dominate the data access because the application and operating system buffers absorb reads better than writes. Third, the workload is heavily skewed toward a small percentage of virtual cloud disks, with 20% of cloud disks accounting for 80% of I/0 requests. Finally, data access shows poor temporal and spatial locality, and the I/O requests are mostly small-scaled. Based on these observations, we propose several suggestions for cloud storage systems, including separating I/O processing from the virtual layer to the proxy layer, deploying heavy and light workload applications on the same node, and adopting a write-friendly cloud disk design for write-skewed requests, etc. In summary, these workload characteristics and suggestions are useful for designing and implementing next-generation cloud storage systems.
引用
收藏
页码:647 / 657
页数:11
相关论文
共 50 条
  • [1] Workload balancing and adaptive resource management for the swift storage system on cloud
    Wang, Zhenhua
    Chen, Haopeng
    Fu, Ying
    Liu, Delin
    Ban, Yunmeng
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 51 : 120 - 131
  • [2] Characterization of a Big Data Storage Workload in the Cloud
    Talluri, Sacheendra
    Luszczak, Alicja
    Abad, Cristina L.
    Iosup, Alexandru
    [J]. PROCEEDINGS OF THE 2019 ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING (ICPE '19), 2019, : 33 - 44
  • [3] Workload-aware storage policies for cloud object storage
    Chen, Yu
    Tong, Wei
    Feng, Dan
    Wang, Zike
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2022, 163 : 232 - 247
  • [4] Workload models and performance evaluation of cloud storage services
    Goncalves, Glauber D.
    Drago, Idilio
    Vieira, Alex B.
    Couto da Silva, Ana Paula
    Almeida, Jussara M.
    Mellia, Marco
    [J]. COMPUTER NETWORKS, 2016, 109 : 183 - 199
  • [5] Optimal Workload and Energy Storage Management for Cloud Data Centers
    Guo, Yuanxiong
    Fang, Yuguang
    Khargonekar, Pramod P.
    [J]. 2013 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2013), 2013, : 1850 - 1855
  • [6] Client Layer Becomes Bottleneck: Workload Analysis of an Ultra-large-scale Cloud Storage System
    Sun, Xiaoyi
    Li, Kaishi
    Ren, Yaodanjun
    Lin, Jiale
    Ren, Zhenyu
    Feng, Shuzhi
    Jian, Yin
    Qi, Zhengwei
    [J]. COMPANION PROCEEDINGS OF THE 14TH IEEE/ACM INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC'21 COMPANION), 2021,
  • [7] Latency-Sensitive Data Allocation and Workload Consolidation for Cloud Storage
    Yang, Song
    Wieder, Philipp
    Aziz, Muzzamil
    Yahyapour, Ramin
    Fu, Xiaoming
    Chen, Xu
    [J]. IEEE ACCESS, 2018, 6 : 76098 - 76110
  • [8] Cloud Storage Workload Characterization: An Approach with Time-Series Analysis
    Adegboyega, Abiola
    [J]. 2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 1090 - 1091
  • [9] Characteristics analysis of aggregate workload on storage system
    Wang, Min
    Xu, Lu
    [J]. SNAPI 2007: FOURTH INTERNATIONAL WORKSHOP ON STORAGE NETWORK ARCHITECTURE AND PARALLEL I/OS, PROCEEDINGS, 2007, : 84 - 89
  • [10] Queries Based Workload Management System for the Cloud Environment
    Maghawry, Eman A.
    Ismail, Rasha M.
    Badr, Nagwa L.
    Tolba, Mohamed Fahmy
    [J]. ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS, AMLTA 2014, 2014, 488 : 77 - 86