CloudSeer: Workflow Monitoring of Cloud Infrastructures via Interleaved Logs

被引:99
|
作者
Yu, Xiao [1 ]
Joshi, Pallavi [2 ]
Xu, Jianwu [2 ]
Jin, Guoliang [1 ]
Zhang, Hui [2 ]
Jiang, Guofei [2 ]
机构
[1] North Carolina State Univ, Raleigh, NC 27695 USA
[2] NEC Labs Amer, Raleigh, NC USA
关键词
Cloud Infrastructures; Distributed Systems; Log Analysis; Workflow Monitoring;
D O I
10.1145/2954679.2872407
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Cloud infrastructures provide a rich set of management tasks that operate computing, storage, and networking resources in the cloud. Monitoring the executions of these tasks is crucial for cloud providers to promptly find and understand problems that compromise cloud availability. However, such monitoring is challenging because there are multiple distributed service components involved in the executions. CloudSeer enables effective workflow monitoring. It takes a lightweight non-intrusive approach that purely works on interleaved logs widely existing in cloud infrastructures. CloudSeer first builds an automaton for the workflow of each management task based on normal executions, and then it checks log messages against a set of automata for workflow divergences in a streaming manner. Divergences found during the checking process indicate potential execution problems, which may or may not be accompanied by error log messages. For each potential problem, CloudSeer outputs necessary context information including the affected task automaton and related log messages hinting where the problem occurs to help further diagnosis. Our experiments on OpenStack, a popular open-source cloud infrastructure, show that CloudSeer's efficiency and problem-detection capability are suitable for online monitoring.
引用
收藏
页码:489 / 502
页数:14
相关论文
共 50 条
  • [21] A Cloud Computing Based Network Monitoring and Threat Detection System for Critical Infrastructures
    Chen, Zhijiang
    Xu, Guobin
    Mahalingam, Vivek
    Ge, Linqiang
    James Nguyen
    Yu, Wei
    Lu, Chao
    BIG DATA RESEARCH, 2016, 3 : 10 - 23
  • [22] FEDARGOS-V1: A Monitoring Architecture for Federated Cloud Computing Infrastructures
    Nzanzu, Vingi Patrick
    Adetiba, Emmanuel
    Badejo, Joke A.
    Molo, Mbasa Joaquim
    Akanle, Matthew Boladele
    Mughole, Kalimumbalo Daniella
    Akande, Victor
    Oshin, Oluwadamilola
    Oguntosin, Victoria
    Takenga, Claude
    Mbaye, Maissa
    Diongue, Dame
    Adebiyi, Ezekiel F.
    IEEE ACCESS, 2022, 10 : 133557 - 133573
  • [23] Elastic Cloud Computing Infrastructures in the Open Cirrus Testbed Implemented via Eucalyptus
    Baun, Christian
    Kunze, Marcel
    MANAGED GRIDS AND CLOUD SYSTEMS IN THE ASIA-PACIFIC RESEARCH COMMUNITY, 2010, : 219 - 230
  • [24] Towards Runtime Verification via Event Stream Processing in Cloud Computing Infrastructures
    Cotroneo, Domenico
    De Simone, Luigi
    Liguori, Pietro
    Natella, Roberto
    Scibelli, Angela
    SERVICE-ORIENTED COMPUTING, ICSOC 2020, 2021, 12632 : 162 - 175
  • [25] Facilitating the monitoring and management of structural health in civil infrastructures with an Edge/Fog/Cloud architecture
    Martin, Cristian
    Garrido, Daniel
    Llopis, Luis
    Rubio, Bartolome
    Diaz, Manuel
    COMPUTER STANDARDS & INTERFACES, 2022, 81
  • [26] Towards a Multi-Model Cloud Workflow Resource Monitoring, Adaptation, and Prediction
    Serhani, Mohamed Adel
    El Kassabi, Hadeel T.
    Al-Qirim, Nabeel
    Navaz, Alramzana N.
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (IEEE TRUSTCOM) / 12TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (IEEE BIGDATASE), 2018, : 1755 - 1762
  • [27] Using Virtual Machine Monitors to Overcome the Challenges of Monitoring and Managing Virtualized Cloud Infrastructures
    Bamiah, Mervat Adib
    Brohi, Sarfraz Nawaz
    Chuprat, Suriayati
    FOURTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2011): MACHINE VISION, IMAGE PROCESSING, AND PATTERN ANALYSIS, 2012, 8349
  • [28] CareFi: Sedentary Behavior Monitoring System via Commodity WiFi Infrastructures
    Yang, Jianfei
    Zou, Han
    Jiang, Hao
    Xie, Lihua
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (08) : 7620 - 7629
  • [29] Data velocity scaling via dynamic monitoring frequency on ultrascale infrastructures
    Mastelic, Toni
    Brandic, Ivona
    2015 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2015, : 422 - 425
  • [30] A web framework for workflow submission and monitoring via UNICORE 6 based on distributable scientific workflow templates
    Bergmann, Sandra
    Demuth, Bastian
    Sander, Volker
    UNICORE Summit 2011, Proceedings, 2011, 9 : 45 - 50