Inferring software behavioral models with MapReduce

被引:8
|
作者
Luo, Chen [1 ,2 ,4 ]
He, Fei [1 ]
Ghezzi, Carlo [3 ]
机构
[1] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol TNList, Key Lab Informat Syst Secur, Minist Educ,Sch Software, Beijing 100084, Peoples R China
[2] Univ Calif Irvine, Irvine, CA USA
[3] Politecn Milan, Milan, Italy
[4] Tsinghua Univ, Beijing, Peoples R China
关键词
Model inference; Parametric trace; Log analysis; MapReduce;
D O I
10.1016/j.scico.2017.04.004
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In the real world practice, software systems are often built without developing any explicit upfront model. This can cause serious problems that may hinder: the almost inevitable future evolution, since at best the only documentation about the software is in the form of source code comments. To address this problem, research has been focusing on automatic inference of models by applying machine learning algorithms to execution logs. However, the logs generated by a real software system may be very large and the inference algorithm can exceed the processing capacity of,a single computer. This paper proposes a scalable, general approach to the inference of behavior models that can handle large execution logs via parallel and distributed algorithms implemented using the MapReduce programming model and executed on a cluster-of interconnected execution nodes. The approach consists of two distributed phases that perform trace slicing and model synthesis. For each phase, a distributed algorithm using MapReduce is developed. With the parallel data processing capacity of MapReduce, the problem of inferring behavior models from large logs can be efficiently solved. The technique is implemented on top of Hadoop. Experiments on Amazon clusters show efficiency and scalability of our approach. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:13 / 36
页数:24
相关论文
共 50 条
  • [31] Inferring models of gene expression dynamics
    Perkins, TJ
    Hallett, M
    Glass, L
    JOURNAL OF THEORETICAL BIOLOGY, 2004, 230 (03) : 289 - 299
  • [32] Diagnostic Inferring on the Bases of Nonlinear Models
    Dabrowski, Z.
    ACTA PHYSICA POLONICA A, 2010, 118 (01) : 45 - 48
  • [33] Inferring transferable intermolecular potential models
    Ucyigitler, Sinan
    Camurdan, Mehmet C.
    Turkay, Metin
    Elliott, J. Richard
    MOLECULAR SIMULATION, 2008, 34 (02) : 147 - 154
  • [34] Inferring Physical Units in B Models
    Krings, Sebastian
    Leuschel, Michael
    SOFTWARE ENGINEERING AND FORMAL METHODS, SEFM 2013, 2013, 8137 : 137 - 151
  • [35] Inferring physical units in formal models
    Krings, Sebastian
    Leuschel, Michael
    SOFTWARE AND SYSTEMS MODELING, 2017, 16 (01): : 25 - 47
  • [36] On the desirability of models for inferring genome phylogenies
    McInerney, JO
    TRENDS IN MICROBIOLOGY, 2006, 14 (01) : 1 - 2
  • [37] Functional Models of Hadoop MapReduce with Application to Scan
    Kiminori Matsuzaki
    International Journal of Parallel Programming, 2017, 45 : 362 - 381
  • [38] Connecting MapReduce Computations to Realistic Machine Models
    Sanders, Peter
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 84 - 93
  • [39] Functional Models of Hadoop MapReduce with Application to Scan
    Matsuzaki, Kiminori
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2017, 45 (02) : 362 - 381
  • [40] Inferring progression models for CGH data
    Liu, Jun
    Bandyopadhyay, Nirmalya
    Ranka, Sanjay
    Baudis, M.
    Kahveci, Tamer
    BIOINFORMATICS, 2009, 25 (17) : 2208 - 2215