Unveiling Clusters of Events for Alert and Incident Management in Large-Scale Enterprise IT

被引:22
|
作者
Lin, Derek [1 ]
Raghu, Rashmi [1 ]
Ramamurthy, Vivek [1 ]
Yu, Jin [2 ]
Radhakrishnan, Regunathan [1 ]
Fernandez, Joseph [3 ]
机构
[1] Pivotal Software Inc, 3495 Deer Creek Rd, Palo Alto, CA 94304 USA
[2] Pivotal Software Inc, Melbourne, Vic, Australia
[3] Visa Inc, Foster City, CA USA
关键词
Hierarchical clustering; Connected Components; Graph cut; Complete Linkage; kd-tree; Non-Negative Matrix Factorization; Tickets Analysis; Alerts and Incidents management; PARTS;
D O I
10.1145/2623330.2623360
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large enterprise IT (Information Technology) infrastructure components generate large volumes of alerts and incident tickets. These are manually screened, but it is otherwise difficult to extract information automatically from them to gain insights in order to improve operational efficiency. We propose a framework to cluster alerts and incident tickets based on the text in them, using unsupervised machine learning. This would be a step towards eliminating manual classification of the alerts and incidents, which is very labor intense and costly. Our framework can handle the semi-structured text in alerts generated by IT infrastructure components such as storage devices, network devices, servers etc., as well as the unstructured text in incident tickets created manually by operations support personnel. After text pre-processing and application of appropriate distance metrics, we apply different graph-theoretic approaches to cluster the alerts and incident tickets, based on their semi-structured and unstructured text respectively. For automated interpretation and read-ability on semi-structured text clusters, we propose a method to visualize clusters that preserves the structure and human-readability of the text data as compared to traditional word clouds where the text structure is not preserved; for unstructured text clusters, we find a simple way to define prototypes of clusters for easy interpretation. This framework for clustering and visualization will enable enterprises to prioritize the issues in their IT infrastructure and improve the reliability and availability of their services.
引用
收藏
页码:1630 / 1639
页数:10
相关论文
共 50 条
  • [41] Network Control for Large-Scale Container Clusters
    Zhang, Weiqi
    Wang, Baosheng
    Deng, Wenping
    Zeng, Hao
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS (WASA 2018), 2018, 10874 : 827 - 833
  • [44] Large-Scale Structure studies with clusters of galaxies
    Nichol, RC
    TRACING COSMIC EVOLUTION WITH GALAXY CLUSTERS, PROCEEDINGS, 2002, 268 : 57 - 68
  • [45] The alignment of clusters using large-scale simulations
    Onuora, LI
    Thomas, PA
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2000, 319 (02) : 614 - 618
  • [46] LARGE-SCALE DISPERSION OF CLUSTERS OF PARTICLES IN ATMOSPHERE
    KAO, SK
    ALGAIN, AA
    JOURNAL OF THE ATMOSPHERIC SCIENCES, 1968, 25 (02) : 214 - +
  • [47] LARGE-SCALE STRUCTURE - EVOLUTION OF GALAXY CLUSTERS
    HENRY, JP
    NATURE, 1995, 377 (6544) : 13 - 13
  • [48] Deep Learning on Large-scale Muticore Clusters
    Sakiyama, Kazumasa
    Kato, Shinpei
    Ishikawa, Yutaka
    Hori, Atsushi
    Monrroy, Abraham
    2018 30TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2018), 2018, : 314 - 321
  • [49] A large-scale bulk flow of galaxy clusters
    Hudson, MJ
    Smith, RJ
    Lucey, JR
    Schlegel, DJ
    Davies, RL
    ASTROPHYSICAL JOURNAL, 1999, 512 (02): : L79 - L82
  • [50] Recreating a Large-Scale BGP Incident in a Realistic Environment
    Karaarslan, Enis
    Perez, Andres Garcia
    Siaterlis, Christos
    INFORMATION SCIENCES AND SYSTEMS 2013, 2013, 264 : 349 - 357