Evaluating Scalable Distributed Erlang for Scalability and Reliability

被引:7
|
作者
Chechina, Natalia [1 ]
MacKenzie, Kenneth [1 ]
Thompson, Simon [3 ]
Trinder, Phil [2 ]
Boudeville, Olivier [4 ]
Fordos, Viktoria [5 ]
Hoch, Csaba [5 ]
Ghaffari, Amir [1 ]
Hernandez, Mario Moro [1 ]
机构
[1] Univ Glasgow, Glasgow G12 8QQ, Lanark, Scotland
[2] Univ Glasgow, Comp Sci, Glasgow G12 8QQ, Lanark, Scotland
[3] Univ Kent, Sch Comp, Log & Computat, Canterbury CT2 7NZ, Kent, England
[4] EDF R&D, SINETICS Dept, F-92141 Clamart, France
[5] Erlang Solut AB, H-1093 Budapest, Hungary
基金
英国工程与自然科学研究理事会;
关键词
Scalability; reliability; actors; Erlang;
D O I
10.1109/TPDS.2017.2654246
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Large scale servers with hundreds of hosts and tens of thousands of cores are becoming common. To exploit these platforms software must be both scalable and reliable, and distributed actor languages like Erlang are a proven technology in this area. While distributed Erlang conceptually supports the engineering of large scale reliable systems, in practice it has some scalability limits that force developers to depart from the standard language mechanisms at scale. In earlier work we have explored these scalability limitations, and addressed them by providing a Scalable Distributed (SD) Erlang library that partitions the network of Erlang Virtual Machines (VMs) into scalable groups (s_groups). This paper presents the first systematic evaluation of SD Erlang s_groups and associated tools, and how they can be used. We present a comprehensive evaluation of the scalability and reliability of SD Erlang using three typical benchmarks and a case study. We demonstrate that s_groups improve the scalability of reliable and unreliable Erlang applications on up to 256 hosts (6,144 cores). We show that SD Erlang preserves the class-leading distributed Erlang reliability model, but scales far better than the standard model. We present a novel, systematic, and tool-supported approach for refactoring distributed Erlang applications into SD Erlang. We outline the new and improved monitoring, debugging and deployment tools for large scale SD Erlang applications. We demonstrate the scaling characteristics of key tools on systems comprising up to 10 K Erlang VMs.
引用
收藏
页码:2244 / 2257
页数:14
相关论文
共 50 条
  • [31] Evaluating Future Power Distribution System Reliability Including Distributed Generation
    Al-Muhaini, Mohammad
    Heydt, Gerald T.
    [J]. IEEE TRANSACTIONS ON POWER DELIVERY, 2013, 28 (04) : 2264 - 2272
  • [32] Evaluating Future Power Distribution System Reliability Including Distributed Generation
    AlMuhaini, Mohammad
    Heydt, Gerald
    [J]. 2014 IEEE PES GENERAL MEETING - CONFERENCE & EXPOSITION, 2014,
  • [33] Evaluating Scalability Bottlenecks by Workload Extrapolation
    Shi, Rong
    Gan, Yifan
    Wang, Yang
    [J]. 2018 IEEE 26TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS), 2018, : 333 - 347
  • [34] EVALUATING THE SCALABILITY OF BIG DATA FRAMEWORKS
    Sanchez, David
    Solarte, Oswaldo
    Bucheli, Victor
    Ordonez, Hugo
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2018, 19 (03): : 301 - 307
  • [35] Reliability, scalability and robustness issues in IRI
    AlShaer, E
    Youssef, A
    AbdelWahab, H
    Maly, K
    Overstreet, CM
    [J]. SIXTH IEEE WORKSHOPS ON ENABLING TECHNOLOGIES: INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES, PROCEEDINGS, 1997, : 320 - 325
  • [36] Evaluating Service Scalability of Network Architectures
    Li, Qi
    Xu, Mingwei
    Xu, Ke
    Wu, Jianping
    [J]. ICN 2008: SEVENTH INTERNATIONAL CONFERENCE ON NETWORKING, PROCEEDINGS, 2008, : 434 - +
  • [37] Metaserver locality and scalability in a distributed NFS
    Hermann, Everton
    Avila, Rafael
    Navaux, Philippe
    Denneulin, Yves
    [J]. HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2006, 2007, 4395 : 145 - +
  • [38] Scalar: A Distributed Scalability Analysis Framework
    Heyman, Thomas
    Preuveneers, Davy
    Joosen, Wouter
    [J]. QUANTITATIVE EVALUATION OF SYSTEMS, QEST 2014, 2014, 8657 : 241 - 244
  • [39] ANALYSIS OF DISTRIBUTED SYSTEMS DYNAMICS WITH ERLANG PERFORMANCE LAB
    Slaski, Michal
    Turek, Wojciech
    Gil, Arkadiusz
    Szafran, Bartosz
    Paciorek, Mateusz
    Byrski, Aleksander
    [J]. COMPUTER SCIENCE-AGH, 2018, 19 (02): : 139 - 155
  • [40] Distributed Denial of Service is a Scalability Problem
    Chung, Yoo
    [J]. ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2012, 42 (01) : 69 - 71