Tool Support for Improving Software Quality in Machine Learning Programs

被引:0
|
作者
Cheng, Kwok Sun [1 ]
Huang, Pei-Chi [1 ]
Ahn, Tae-Hyuk [2 ]
Song, Myoungkyu [1 ]
机构
[1] Univ Nebraska Omaha, Dept Comp Sci, Omaha, NE 68182 USA
[2] St Louis Univ, Dept Comp Sci, St Louis, MO 63103 USA
关键词
software quality; anomaly detection; quality validation; machine learning applications; ARTIFICIAL-INTELLIGENCE AI; CANCER;
D O I
10.3390/info14010053
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning (ML) techniques discover knowledge from large amounts of data. Modeling in ML is becoming essential to software systems in practice. The accuracy and efficiency of ML models have been focused on ML research communities, while there is less attention on validating the qualities of ML models. Validating ML applications is a challenging and time-consuming process for developers since prediction accuracy heavily relies on generated models. ML applications are written by relatively more data-driven programming based on the black box of ML frameworks. All of the datasets and the ML application need to be individually investigated. Thus, the ML validation tasks take a lot of time and effort. To address this limitation, we present a novel quality validation technique that increases the reliability for ML models and applications, called MLVal. Our approach helps developers inspect the training data and the generated features for the ML model. A data validation technique is important and beneficial to software quality since the quality of the input data affects speed and accuracy for training and inference. Inspired by software debugging/validation for reproducing the potential reported bugs, MLVal takes as input an ML application and its training datasets to build the ML models, helping ML application developers easily reproduce and understand anomalies in the ML application. We have implemented an Eclipse plugin for MLVal that allows developers to validate the prediction behavior of their ML applications, the ML model, and the training data on the Eclipse IDE. In our evaluation, we used 23,500 documents in the bioengineering research domain. We assessed the ability of the MLVal validation technique to effectively help ML application developers: (1) investigate the connection between the produced features and the labels in the training model, and (2) detect errors early to secure the quality of models from better data. Our approach reduces the cost of engineering efforts to validate problems, improving data-centric workflows of the ML application development.
引用
下载
收藏
页数:20
相关论文
共 50 条
  • [41] An Optimized Extreme Learning Machine Algorithm for Improving Software Maintainability Prediction
    Gupta, Shkha
    Chug, Anuradha
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 829 - 836
  • [42] Tool support to model-based quality analysis of software architecture
    Zhang, Qian
    Wu, Jian
    Zhu, Hong
    30TH ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE, VOL 1, REGULAR PAPERS/PANELS, PROCEEDINGS, 2006, : 121 - +
  • [43] Tool Support for Code Change Inspection with Deep Learning in Evolving Software
    Ayinala, Krishna Teja
    Cheng, Kwok Sun
    Oh, Kwangsung
    Song, Myoungkyu
    2020 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY (EIT), 2020, : 13 - 17
  • [44] Effectiveness of AutoCAD 3D Software as a Learning Support Tool
    Abu Ziden, Azidah
    Zakaria, Fatariah
    Othman, Ahmad Nizam
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2012, 7 (02): : 57 - 60
  • [45] Developer Modelling using Software Quality Metrics and Machine Learning
    Beal, Franciele
    de Bassi, Patricia Rucker
    Paraiso, Emerson Cabrera
    ICEIS: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 1, 2017, : 424 - 432
  • [46] Machine-learning techniques for software product quality assessment
    Lounis, H
    Ait-Mehedine, L
    QSIC 2004: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON QUALITY SOFTWARE, 2004, : 102 - 109
  • [47] A survey on machine learning techniques used for software quality prediction
    Pattnaik S.
    Pattanayak B.K.
    International Journal of Reasoning-based Intelligent Systems, 2016, 8 (1-2) : 3 - 14
  • [48] An experimental study for software quality prediction with machine learning methods
    Ceran, A. Ayberk
    Tanriover, O. Ozgur
    2ND INTERNATIONAL CONGRESS ON HUMAN-COMPUTER INTERACTION, OPTIMIZATION AND ROBOTIC APPLICATIONS (HORA 2020), 2020, : 93 - 96
  • [49] A hybrid learning course on software development - requirements validation of tool support
    Yu, Y. T.
    Choy, M. Y.
    Chan, E. Y. K.
    Lo, Y. T.
    HYBRID LEARNING AND EDUCATION, PROCEEDINGS, 2008, 5169 : 316 - 327
  • [50] Optimal adjustment of a machine tool for improving the geometrical quality of machined parts
    Patrick Martin
    Fabien Schneider
    Jean-Yves Dantan
    The International Journal of Advanced Manufacturing Technology, 2005, 26 : 559 - 564