Inferring software behavioral models with MapReduce

被引:8
|
作者
Luo, Chen [1 ,2 ,4 ]
He, Fei [1 ]
Ghezzi, Carlo [3 ]
机构
[1] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol TNList, Key Lab Informat Syst Secur, Minist Educ,Sch Software, Beijing 100084, Peoples R China
[2] Univ Calif Irvine, Irvine, CA USA
[3] Politecn Milan, Milan, Italy
[4] Tsinghua Univ, Beijing, Peoples R China
关键词
Model inference; Parametric trace; Log analysis; MapReduce;
D O I
10.1016/j.scico.2017.04.004
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In the real world practice, software systems are often built without developing any explicit upfront model. This can cause serious problems that may hinder: the almost inevitable future evolution, since at best the only documentation about the software is in the form of source code comments. To address this problem, research has been focusing on automatic inference of models by applying machine learning algorithms to execution logs. However, the logs generated by a real software system may be very large and the inference algorithm can exceed the processing capacity of,a single computer. This paper proposes a scalable, general approach to the inference of behavior models that can handle large execution logs via parallel and distributed algorithms implemented using the MapReduce programming model and executed on a cluster-of interconnected execution nodes. The approach consists of two distributed phases that perform trace slicing and model synthesis. For each phase, a distributed algorithm using MapReduce is developed. With the parallel data processing capacity of MapReduce, the problem of inferring behavior models from large logs can be efficiently solved. The technique is implemented on top of Hadoop. Experiments on Amazon clusters show efficiency and scalability of our approach. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:13 / 36
页数:24
相关论文
共 50 条
  • [21] Analytical Performance Models for MapReduce Workloads
    Emanuel Vianna
    Giovanni Comarela
    Tatiana Pontes
    Jussara Almeida
    Virgílio Almeida
    Kevin Wilkinson
    Harumi Kuno
    Umeshwar Dayal
    International Journal of Parallel Programming, 2013, 41 : 495 - 525
  • [22] Statically Inferring Performance Properties of Software Configurations
    Li, Chi
    Wang, Shu
    Hoffmann, Henry
    Lu, Shan
    PROCEEDINGS OF THE FIFTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS (EUROSYS'20), 2020,
  • [23] Learning by sampling: learning behavioral family models from software product lines
    Nascimento Damasceno, Carlos Diego
    Mousavi, Mohammad Reza
    Simao, Adenilso da Silva
    EMPIRICAL SOFTWARE ENGINEERING, 2021, 26 (01)
  • [24] Learning by sampling: learning behavioral family models from software product lines
    Carlos Diego Nascimento Damasceno
    Mohammad Reza Mousavi
    Adenilso da Silva Simao
    Empirical Software Engineering, 2021, 26
  • [25] Mining Component-Based Software Behavioral Models Using Dynamic Analysis
    Lu, Ting
    Liu, Cong
    Duan, Hua
    Zeng, Qingtian
    IEEE ACCESS, 2020, 8 : 68883 - 68894
  • [26] Bridging the Gap Between Structural and Behavioral Models in a Software-Centric Environment
    Hagemann, Noel
    Bauer, Bernhard
    BUSINESS MODELING AND SOFTWARE DESIGN (BMSD 2021), 2021, 422 : 353 - 362
  • [27] Behavioral characterization: finding and using the influential factors in software process simulation models
    Houston, DX
    Ferreira, S
    Collofello, JS
    Montgomery, DC
    Mackulak, GT
    Shunk, DL
    JOURNAL OF SYSTEMS AND SOFTWARE, 2001, 59 (03) : 259 - 270
  • [28] Behavioral Software Contracts
    Findler, Robert Bruce
    ACM SIGPLAN NOTICES, 2014, 49 (09) : 137 - 138
  • [29] Behavioral Software Contracts
    Findler, Robert Bruce
    ICFP'14: PROCEEDINGS OF THE 2014 ACM SIGPLAN INTERNATIONAL CONFERENCE ON FUNCTIONAL PROGRAMMING, 2014, : 137 - 138
  • [30] Inferring physical units in formal models
    Sebastian Krings
    Michael Leuschel
    Software & Systems Modeling, 2017, 16 : 25 - 47