ModelSet: a dataset for machine learning in model-driven engineering

被引:21
|
作者
Hernandez Lopez, Jose Antonio [1 ]
Canovas Izquierdo, Javier Luis [2 ]
Sanchez Cuadrado, Jesus [1 ]
机构
[1] Univ Murcia, Fac Informat, Murcia, Spain
[2] UOC IN3, Castelldefels, Spain
来源
SOFTWARE AND SYSTEMS MODELING | 2022年 / 21卷 / 03期
关键词
Dataset; Machine learning; Model-driven engineering;
D O I
10.1007/s10270-021-00929-3
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The application of machine learning (ML) algorithms to address problems related to model-driven engineering (MDE) is currently hindered by the lack of curated datasets of software models. There are several reasons for this, including the lack of large collections of good quality models, the difficulty to label models due to the required domain expertise, and the relative immaturity of the application of ML to MDE. In this work, we present ModelSet, a labelled dataset of software models intended to enable the application of ML to address software modelling problems. To create it we have devised a method designed to facilitate the exploration and labelling of model datasets by interactively grouping similar models using off-the-shelf technologies like a search engine. We have built an Eclipse plug-in to support the labelling process, which we have used to label 5,466 Ecore meta-models and 5,120 UML models with its category as the main label plus additional secondary labels of interest. We have evaluated the ability of our labelling method to create meaningful groups of models in order to speed up the process, improving the effectiveness of classical clustering methods. We showcase the usefulness of the dataset by applying it in a real scenario: enhancing the MAR search engine. We use ModelSet to train models able to infer useful metadata to navigate search results. The dataset and the tooling are available at and a live version at http://modelset.github.io..
引用
收藏
页码:967 / 986
页数:20
相关论文
共 50 条
  • [31] Experiences with Model-Driven Engineering in Neurorobotics
    Hinkel, Georg
    Denninger, Oliver
    Krach, Sebastian
    Groenda, Henning
    [J]. MODELLING FOUNDATIONS AND APPLICATIONS, ECMFA 2016, 2016, 9764 : 217 - 228
  • [32] Model-Driven Engineering for Augmented Reality
    Campos-Lopez, Ruben
    Guerra, Esther
    de Lara, Juan
    Colantoni, Alessandro
    Garmendia, Antonio
    [J]. JOURNAL OF OBJECT TECHNOLOGY, 2023, 22 (02):
  • [33] Collaborative Repositories in Model-Driven Engineering
    Di Rocco, Juri
    Di Ruscio, Davide
    Iovino, Ludovico
    Pierantonio, Alfonso
    [J]. IEEE SOFTWARE, 2015, 32 (03) : 28 - 34
  • [34] On Watermarking for Collaborative Model-Driven Engineering
    Martinez, Salvador
    Gerard, Sebastien
    Cabot, Jordi
    [J]. IEEE ACCESS, 2018, 6 : 29715 - 29728
  • [35] Model-driven visual requirements engineering
    Solheim, H
    Lillehagen, F
    Petersen, SA
    Jorgensen, H
    Anastasiou, M
    [J]. 13th IEEE International Conference on Requirements Engineering, Proceedings, 2005, : 421 - 425
  • [36] Model-Driven Software Engineering in Practice
    Brambilla, Marco
    Cabot, Jordi
    Wimmer, Manuel
    [J]. Synthesis Lectures on Software Engineering, 2012, 1 (01): : 1 - 184
  • [37] Model-Driven Engineering of Service Orchestrations
    Brambilla, Marco
    Dosmi, Matteo
    Fraternali, Piero
    [J]. 2009 IEEE CONGRESS ON SERVICES (SERVICES-1 2009), VOLS 1 AND 2, 2009, : 562 - 569
  • [38] Model-Driven Engineering Practices in Industry
    Hutchinson, John
    Rouncefield, Mark
    Whittle, Jon
    [J]. 2011 33RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2011, : 633 - 642
  • [39] MODEL-DRIVEN DISTRIBUTED SIMULATION ENGINEERING
    Bocciarelli, Paolo
    D'Ambrogio, Andrea
    Giglio, Andrea
    Paglia, Emiliano
    [J]. 2019 WINTER SIMULATION CONFERENCE (WSC), 2019, : 75 - 89
  • [40] The State of Practice in Model-Driven Engineering
    Whittle, Jon
    Hutchinson, John
    Rouncefield, Mark
    [J]. IEEE SOFTWARE, 2014, 31 (03) : 79 - 85