Dissecting the Workload of Cloud Storage System

被引:2
|
作者
Ren, Yaodanjun [1 ]
Sun, Xiaoyi [1 ]
Li, Kaishi [1 ]
Lin, Jiale [1 ]
Feng, Shuzhi [2 ]
Ren, Zhenyu [2 ]
Yin, Jian [2 ]
Qi, Zhengwei [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Alibaba Grp, Shanghai, Peoples R China
关键词
cloud storage workloads; workload characterization; network measurement; cloud services;
D O I
10.1109/ICDCS54860.2022.00068
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The innovation and evolution of file and storage systems have been influenced by workload analysis. Though cloud storage systems have been widely deployed and used, real-world and large-scale cloud storage workload studies are rare. Previous large-scale distributed storage systems can meet versatility, stability, and reliability requirements. Furthermore, modern cloud storage systems need to meet additional challenges, such as coping with surges in peak loads and rapid expansion of requests. These changes may lead to different characteristics. In this work, we propose DiTing data tracing system and collect workloads with over 242,000 billion requests from the Alibaba cloud. By comparing the normal days and the Single's Day (the world's largest online shopping festival), we analyze characteristics such as I/O scale, latency, locality, and load distribution. Our analysis reveals four key observations as follows. First, the virtual layer is the performance bottleneck of modern cloud storage systems during extreme peak periods. Second, the write operations dominate the data access because the application and operating system buffers absorb reads better than writes. Third, the workload is heavily skewed toward a small percentage of virtual cloud disks, with 20% of cloud disks accounting for 80% of I/0 requests. Finally, data access shows poor temporal and spatial locality, and the I/O requests are mostly small-scaled. Based on these observations, we propose several suggestions for cloud storage systems, including separating I/O processing from the virtual layer to the proxy layer, deploying heavy and light workload applications on the same node, and adopting a write-friendly cloud disk design for write-skewed requests, etc. In summary, these workload characteristics and suggestions are useful for designing and implementing next-generation cloud storage systems.
引用
收藏
页码:647 / 657
页数:11
相关论文
共 50 条
  • [21] Agent Based Cloud Storage System
    Hegazy, Abdel-Fattah
    Badr, Amr
    Kassab, Mohammed
    [J]. NEW ASPECTS OF APPLIED INFORMATICS, BIOMEDICAL ELECTRONICS AND INFORMATICS AND COMMUNICATION, 2010, : 240 - +
  • [22] Revamping Optimal Cloud Storage System
    Ghogare, Shraddha
    Pawar, Ambika
    Dani, Ajay
    [J]. PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS, ICACNI 2015, VOL 2, 2016, 44 : 457 - 463
  • [23] The PocketLocker Personal Cloud Storage System
    Nandugudi, Anandatirtha
    Nuessle, Carl
    Challen, Geoffrey
    Miluzzo, Emiliano
    Chen, Yih-Farn
    [J]. 2014 6TH INTERNATIONAL CONFERENCE ON MOBILE COMPUTING, APPLICATIONS AND SERVICES (MOBICASE), 2014, : 235 - 244
  • [24] Heterogeneous Cloud Storage System for Privacy
    Lee, Woonghee
    Kim, Hwangnam
    [J]. 2014 SIXTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2014), 2014, : 193 - 198
  • [25] Building a Cloud Storage Service System
    Peng, Chengzhang
    Jiang, Zejun
    [J]. 2011 3RD INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND INFORMATION APPLICATION TECHNOLOGY ESIAT 2011, VOL 10, PT A, 2011, 10 : 691 - 696
  • [26] Cloudy: A Modular Cloud Storage System
    Kossmann, Donald
    Kraska, Tim
    Loesing, Simon
    Merkli, Stephan
    Mittal, Raman
    Pfaffhauser, Flavio
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (02): : 1533 - 1536
  • [27] Unified Logging System for Monitoring Multiple Cloud Storage Providers in Cloud Storage Broker
    Sukmana, Muhammad I. H.
    Torkura, Kennedy A.
    Cheng, Feng
    Meinel, Christoph
    Graupner, Hendrik
    [J]. 2018 32ND INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN), 2018, : 44 - 49
  • [28] An Energy-Aware Workload Balancing Method for Cloud Video Data Storage Management
    Xiong, Yonghua
    Cheng, Zhihao
    Lu, Chengda
    Wu, Min
    Jiang, Keyuan
    [J]. 2016 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2016), 2016, : 7 - 12
  • [29] DST: Leveraging Delay-insensitive Workload in Cloud Storage for Smart Home Network
    Guo, Suiming
    Chen, Liang
    Chiu, Dah Ming
    [J]. 24TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS ICCCN 2015, 2015,
  • [30] The Cloud Streaming Service Migration in Cloud Video Storage System
    Tsai, Yi-Hsing
    [J]. 2013 IEEE 27TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2013, : 672 - 677