QuoTe: Quality-oriented Testing for Deep Learning Systems

被引:2
|
作者
Chen, Jialuo [1 ]
Wang, Jingyi [1 ]
Ma, Xingjun [2 ]
Sun, Youcheng [3 ]
Sun, Jun [4 ]
Zhang, Peixin [1 ]
Cheng, Peng [1 ]
机构
[1] Zhejiang Univ, Hangzhou 310027, Peoples R China
[2] Fudan Univ, Shanghai 200433, Peoples R China
[3] Univ Manchester, Manchester M13 9PL, Lancs, England
[4] Singapore Management Univ, Singapore 188065, Singapore
基金
国家重点研发计划;
关键词
Deep learning; testing; robustness; fairness; ROBUSTNESS;
D O I
10.1145/3582573
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, there has been significant growth of interest in applying software engineering techniques for the quality assurance of deep learning (DL) systems. One popular direction is DL testing-that is, given a property of test, defects of DL systems are found either by fuzzing or guided search with the help of certain testing metrics. However, recent studies have revealed that the neuron coverage metrics, which are commonly used by most existing DL testing approaches, are not necessarily correlated with model quality (e.g., robustness, the most studied model property), and are also not an effective measurement on the confidence of the model quality after testing. In this work, we address this gap by proposing a novel testing framework calledQuoTe (i.e., Quality-oriented Testing). A key part of QuoTe is a quantitative measurement on (1) the value of each test case in enhancing the model property of interest (often via retraining) and (2) the convergence quality of the model property improvement. QuoTe utilizes the proposed metric to automatically select or generate valuable test cases for improving model quality. The proposedmetric is also a lightweight yet strong indicator of how well the improvement converged. Extensive experiments on both image and tabular datasets with a variety of model architectures confirm the effectiveness and efficiency of QuoTe in improving DL model quality-that is, robustness and fairness. As a generic quality-oriented testing framework, future adaptations can be made to other domains (e.g., text) as well as other model properties.
引用
收藏
页数:33
相关论文
共 50 条
  • [31] TOTAL QUALITY-ORIENTED HUMAN-RESOURCES MANAGEMENT
    BOWEN, DE
    LAWLER, EE
    ORGANIZATIONAL DYNAMICS, 1992, 20 (04) : 29 - 41
  • [32] A Quality-oriented Approach to Recommend Move Method Refactorings
    Souza Couto, Christian Marlon
    Terra, Ricardo
    SBQS: PROCEEDINGS OF THE 18TH BRAZILIAN SYMPOSIUM ON SOFTWARE QUALITY, 2019, : 315 - 315
  • [33] Measurement uncertainty and metrological confirmation in quality-oriented organizations
    Carbone, P
    Macii, D
    Petri, D
    MEASUREMENT, 2003, 34 (04) : 263 - 271
  • [34] A Quality-oriented Approach to Recommend Move Method Refactorings
    Souza Couto, Christian Marlon
    Rocha, Henrique
    Terra, Ricardo
    PROCEEDINGS OF THE 17TH BRAZILIAN SYMPOSIUM ON SOFTWARE QUALITY (SBQS), 2015, : 11 - 20
  • [35] Modelling supply chain network: a quality-oriented approach
    Das, K.
    Sengupta, S.
    INTERNATIONAL JOURNAL OF QUALITY & RELIABILITY MANAGEMENT, 2010, 27 (05) : 506 - +
  • [36] Quality-oriented concurrent design of statistical tolerance and SPC
    Zhang, Y
    Yang, MS
    Zhang, YX
    Proceedings of the 4th International Conference on Quality & Reliability, 2005, : 239 - 245
  • [37] Construct Innovative Laboratory, Promote Quality-oriented Education
    Ding Lixing
    Li Xuemei
    Lu Jin-hu
    2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 115 - +
  • [38] Quality-oriented adaptation scheme for video-on-demand
    Muntean, GM
    Perry, P
    Murphy, L
    ELECTRONICS LETTERS, 2003, 39 (23) : 1689 - 1690
  • [39] Thinking on some issues in the practice of quality-oriented education
    Lin Xinglan
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON PRODUCT INNOVATION MANAGEMENT, 2006, : 399 - 403
  • [40] Quality-oriented planning and development - a sound basis for reliability
    Gruener, Heinz
    Telcom Report (English Edition), 1988, 11 (06): : 206 - 209