ModelSet: a dataset for machine learning in model-driven engineering

被引:0
|
作者
Hernandez Lopez, Jose Antonio [1 ]
Canovas Izquierdo, Javier Luis [2 ]
Sanchez Cuadrado, Jesus [1 ]
机构
[1] Univ Murcia, Fac Informat, Murcia, Spain
[2] UOC IN3, Castelldefels, Spain
来源
SOFTWARE AND SYSTEMS MODELING | 2022年 / 21卷 / 03期
关键词
Dataset; Machine learning; Model-driven engineering;
D O I
10.1007/s10270-021-00929-3
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The application of machine learning (ML) algorithms to address problems related to model-driven engineering (MDE) is currently hindered by the lack of curated datasets of software models. There are several reasons for this, including the lack of large collections of good quality models, the difficulty to label models due to the required domain expertise, and the relative immaturity of the application of ML to MDE. In this work, we present ModelSet, a labelled dataset of software models intended to enable the application of ML to address software modelling problems. To create it we have devised a method designed to facilitate the exploration and labelling of model datasets by interactively grouping similar models using off-the-shelf technologies like a search engine. We have built an Eclipse plug-in to support the labelling process, which we have used to label 5,466 Ecore meta-models and 5,120 UML models with its category as the main label plus additional secondary labels of interest. We have evaluated the ability of our labelling method to create meaningful groups of models in order to speed up the process, improving the effectiveness of classical clustering methods. We showcase the usefulness of the dataset by applying it in a real scenario: enhancing the MAR search engine. We use ModelSet to train models able to infer useful metadata to navigate search results. The dataset and the tooling are available at and a live version at http://modelset.github.io..
引用
收藏
页码:967 / 986
页数:20
相关论文
共 50 条
  • [1] ModelSet: a dataset for machine learning in model-driven engineering
    José Antonio Hernández López
    Javier Luis Cánovas Izquierdo
    Jesús Sánchez Cuadrado
    [J]. Software and Systems Modeling, 2022, 21 : 967 - 986
  • [2] Using the ModelSet Dataset to Support Machine Learning in Model-Driven Engineering
    Hernandez Lopez, Jose Antonio
    Canovas Izquierdo, Javier Luis
    Sanchez Cuadrado, Jesus
    [J]. ACM/IEEE 25TH INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS, MODELS 2022 COMPANION, 2022, : 66 - 70
  • [3] A Model-Driven Engineering Approach for Monitoring Machine Learning Models
    Kourouklidis, Panagiotis
    Kolovos, Dimitris
    Noppen, Joost
    Matragkas, Nicholas
    [J]. 24TH ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL-DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION (MODELS-C 2021), 2021, : 162 - 166
  • [4] ModelSet: A labelled dataset of software models for machine learning
    López, José Antonio Hernández
    Cánovas Izquierdo, Javier Luis
    Cuadrado, Jesús Sánchez
    [J]. Science of Computer Programming, 2024, 231
  • [5] ModelSet: A labelled dataset of software models for machine learning
    Lopez, Jose Antonio Hernandez
    Izquierdo, Javier Luis Canovas
    Cuadrado, Jesus Sanchez
    [J]. SCIENCE OF COMPUTER PROGRAMMING, 2024, 231
  • [6] A Systematic Literature Review of Model-Driven Engineering Using Machine Learning
    Marcen, Ana C.
    Iglesias, Antonio
    Lapena, Raul
    Perez, Francisca
    Cetina, Carlos
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2024, 50 (09) : 2269 - 2293
  • [7] EA ModelSet - A FAIR Dataset for Machine Learning in Enterprise Modeling
    Glaser, Philipp-Lorenz
    Sallinger, Emanuel
    Bork, Dominik
    [J]. PRACTICE OF ENTERPRISE MODELING, POEM 2023, 2024, 497 : 19 - 36
  • [8] Model-Driven Engineering of Machine Executable Code
    Eichberg, Michael
    Monperrus, Martin
    Kloppenburg, Sven
    Mezini, Mira
    [J]. MODELLING FOUNDATIONS AND APPLICATIONS, PROCEEDINGS, 2010, 6138 : 104 - +
  • [9] Putting performance engineering into model-driven engineering: Model-driven performance engineering
    Fritzsche, Mathias
    Johannes, Jendrik
    [J]. MODELS IN SOFTWARE ENGINEERING, 2008, 5002 : 164 - +
  • [10] Model Engineering for Model-Driven Engineering
    van Lamsweerde, Axel
    [J]. EICS 2010: PROCEEDINGS OF THE 2010 ACM SIGCHI SYMPOSIUM ON ENGINEERING INTERACTIVE COMPUTING SYSTEMS, 2010, : 211 - 211