Are We Training with The Right Data? Evaluating Collective Confidence in Training Data using Dempster Shafer Theory

被引:0
|
作者
Dey, Sangeeta [1 ]
Lee, Seok-Won [1 ]
机构
[1] Ajou Univ, Suwon, Gyeonggi Do, South Korea
基金
新加坡国家研究基金会;
关键词
data uncertainty; safety; machine learning; Dempster Shafer theory;
D O I
10.1145/3510455.3512779
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The latest trend of incorporating various data-centric machine learning (ML) models in software-intensive systems has posed new challenges in the quality assurance practice of software engineering, especially in a high-risk environment. ML experts are now focusing on explaining ML models to assure the safe behavior of ML-based systems. However, not enough attention has been paid to explain the inherent uncertainty of the training data. The current practice of ML-based system engineering lacks transparency in the systematic fitness assessment process of the training data before engaging in the rigorous ML model training. We propose a method of assessing the collective confidence in the quality of a training dataset by using Dempster Shafer theory and its modified combination rule (Yager's rule). With the example of training datasets for pedestrian detection of autonomous vehicles, we demonstrate how the proposed approach can be used by the stakeholders with diverse expertise to combine their beliefs in the quality arguments and evidences about the data. Our results open up a scope of future research on data requirements engineering that can facilitate evidence-based data assurance for ML-based safety-critical systems.
引用
收藏
页码:11 / 15
页数:5
相关论文
共 50 条
  • [1] Using Dempster Shafer theory to aggregate usability study data
    Iourinski, D
    Ramalingam, S
    [J]. Third International Conference on Information Technology and Applications, Vol 1, Proceedings, 2005, : 429 - 434
  • [2] Integrated Data Fusion Using Dempster-Shafer Theory
    Zhang, Yang
    Zeng, Qing-An
    Liu, Yun
    Shen, Bo
    [J]. 2015 FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE THEORY, SYSTEMS AND APPLICATIONS (CCITSA 2015), 2015, : 98 - 103
  • [3] EDDY CURRENT AND ULTRASOUND DATA FUSION USING DEMPSTER - SHAFER THEORY
    Bruma, Alina
    Iftimie, Nicoleta
    Steigmann, Rozina
    Savin, Adriana
    Grimberg, Raimond
    [J]. EUROPEAN NDT DAYS IN PRAGUE 2007: NDT IN PROGRESS, PROCEEDINGS, 2007, : 33 - +
  • [4] Decision making in data fusion using Dempster-Shafer's theory
    Rombaut, M
    Cherfaoui, V
    [J]. INTELLIGENT COMPONENTS AND INSTRUMENTS FOR CONTROL APPLICATIONS 1997 (SICICA'97), 1997, : 339 - 343
  • [5] Data classification using the Dempster-Shafer method
    Chen, Qi
    Whitbrook, Amanda
    Aickelin, Uwe
    Roadknight, Chris
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2014, 26 (04) : 493 - 517
  • [6] Reliable Classifications with Guaranteed Confidence Using the Dempster-Shafer Theory of Evidence
    Kempkes, Marie C.
    Dunjko, Vedran
    van Nieuwenburg, Evert
    Spiegelberg, Jakob
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT II, ECML PKDD 2024, 2024, 14942 : 89 - 105
  • [7] Fusing evidences from intracranial pressure data using Dempster-Shafer theory
    Conte, R.
    Longo, M.
    Allai-Ano, S.
    Matta, V.
    Velardi, E.
    [J]. PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 159 - +
  • [8] Multi-scale data fusion using Dempster-Shafer evidence theory
    Le Hégarat-Mascle, S
    Richard, D
    Ottlé, C
    [J]. IGARSS 2002: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM AND 24TH CANADIAN SYMPOSIUM ON REMOTE SENSING, VOLS I-VI, PROCEEDINGS: REMOTE SENSING: INTEGRATING OUR VIEW OF THE PLANET, 2002, : 911 - 913
  • [9] Multi-scale data fusion using Dempster-Shafer evidence theory
    Le Hégarat-Mascle, S
    Richard, D
    Ottlé, C
    [J]. INTEGRATED COMPUTER-AIDED ENGINEERING, 2003, 10 (01) : 9 - 22
  • [10] Data fusion using improved Dempster-Shafer evidence theory for vehicle detection
    Zhao, Wentao
    Fang, Tao
    Jiang, Yan
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2007, : 487 - 491