CloudSeer: Workflow Monitoring of Cloud Infrastructures via Interleaved Logs

被引:99
|
作者
Yu, Xiao [1 ]
Joshi, Pallavi [2 ]
Xu, Jianwu [2 ]
Jin, Guoliang [1 ]
Zhang, Hui [2 ]
Jiang, Guofei [2 ]
机构
[1] North Carolina State Univ, Raleigh, NC 27695 USA
[2] NEC Labs Amer, Raleigh, NC USA
关键词
Cloud Infrastructures; Distributed Systems; Log Analysis; Workflow Monitoring;
D O I
10.1145/2954679.2872407
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Cloud infrastructures provide a rich set of management tasks that operate computing, storage, and networking resources in the cloud. Monitoring the executions of these tasks is crucial for cloud providers to promptly find and understand problems that compromise cloud availability. However, such monitoring is challenging because there are multiple distributed service components involved in the executions. CloudSeer enables effective workflow monitoring. It takes a lightweight non-intrusive approach that purely works on interleaved logs widely existing in cloud infrastructures. CloudSeer first builds an automaton for the workflow of each management task based on normal executions, and then it checks log messages against a set of automata for workflow divergences in a streaming manner. Divergences found during the checking process indicate potential execution problems, which may or may not be accompanied by error log messages. For each potential problem, CloudSeer outputs necessary context information including the affected task automaton and related log messages hinting where the problem occurs to help further diagnosis. Our experiments on OpenStack, a popular open-source cloud infrastructure, show that CloudSeer's efficiency and problem-detection capability are suitable for online monitoring.
引用
收藏
页码:489 / 502
页数:14
相关论文
共 50 条
  • [41] Adaptive Cloud Bundle Provisioning and Multi-Workflow Scheduling via Coalition Reinforcement Learning
    Wang, Xiaogang
    Cao, Jian
    Buyya, Rajkumar
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (04) : 1041 - 1054
  • [42] Scheduling Constrained Cloud Workflow Tasks via Evolutionary Multitasking Optimization With Adaptive Knowledge Transfer
    Zhou, Jiajun
    Gao, Liang
    Rao, Shijie
    Li, Yun
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (06) : 4254 - 4266
  • [43] The Research on Electric Power Control Center Credit Monitoring and Management Using Cloud Computing and Smart Workflow
    Wang, X. Z.
    Ge, Z. Q.
    Ge, M. H.
    Wang, L.
    Li, L.
    2018 CHINA INTERNATIONAL CONFERENCE ON ELECTRICITY DISTRIBUTION (CICED), 2018, : 2732 - 2735
  • [44] MONITORASSISTANT: Simplifying Cloud Service Monitoring via Large Language Models
    Yu, Zhaoyang
    Ma, Minghua
    Zhang, Chaoyun
    Qin, Si
    Kang, Yu
    Bansal, Chetan
    Rajmohan, Saravan
    Dang, Yingnong
    Pei, Changhua
    Pei, Dan
    Lin, Qingwei
    Zhang, Dongmei
    COMPANION PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, FSE COMPANION 2024, 2024, : 38 - 49
  • [45] Application-Aware Latency Monitoring for Cloud Tenants via CloudWatch
    Liu, Dapeng
    Pei, Dan
    Zhao, Youjian
    2014 10TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM), 2014, : 73 - 81
  • [46] Point cloud stacking: A workflow to enhance 3D monitoring capabilities using time-lapse cameras
    Blanch X.
    Abellan A.
    Guinau M.
    Blanch, Xabier (xabierblanch@ub.edu), 1600, MDPI AG (12):
  • [47] Point Cloud Stacking: A Workflow to Enhance 3D Monitoring Capabilities Using Time-Lapse Cameras
    Blanch, Xabier
    Abellan, Antonio
    Guinau, Marta
    REMOTE SENSING, 2020, 12 (08)
  • [48] Dynamic Monitoring of Bridge Structures via an Integrated Cloud and Edge Computing System
    Zhang, Guoqi
    Zhang, Pengcheng
    Li, Xingwang
    Yang, Yizhe
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (09) : 267 - 275
  • [49] Nutritional Monitoring of Rhodena Lettuce via Neural Networks and Point Cloud Analysis
    Ramirez-Pedraza, Alfonso
    Salazar-Colores, Sebastian
    Terven, Juan
    Romero-Gonzalez, Julio-Alejandro
    Gonzalez-Barbosa, Jose-Joel
    Cordova-Esparza, Diana-Margarita
    AGRIENGINEERING, 2024, 6 (03): : 3474 - 3493
  • [50] Ubiquitous Cloud-based Monitoring via a Mobile App in Smartphones: An Overview
    Al-Turjman, Fadi
    Betin-Can, Aysu
    Ever, Enver
    Alturjman, Sinem
    2016 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2016, : 196 - 201