How Good is Good Enough? Quantifying the Effects of Training Set Quality

被引:3
|
作者
Swan, Benjamin [1 ]
Laverdiere, Melanie [1 ]
Yang, H. Lexie [1 ]
机构
[1] Oak Ridge Natl Lab, Oak Ridge, TN 37830 USA
关键词
convolutional neural networks; remote sensing; training data; building detection;
D O I
10.1145/3281548.3281557
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is a general consensus in the neural network community that noise in training data has a negative impact on model output; however, efforts to quantify the impact of varying levels have been limited, particularly for semantic segmentation tasks. This is a question of particular importance for remote sensing applications where the cost of producing a large training set can lead to reliance on publicly available data with varying degrees of noise. This work explores the effects of different degrees and types of training label noise on a pre-trained building extraction deep learner. Quantitative and qualitative evaluations of these effects can help inform decisions about trade-offs between the cost of producing training data and the quality of model outputs. We found that, relative to the base model, models trained with small amounts of noise showed little change in precision but achieved considerable increases in recall. Conversely, as noise levels increased, both precision and recall decreased. Precision and recall both lagged behind a model trained with pristine data. These exploratory results indicate the importance of quality control for training and, more broadly, that the relationship between degrees and types of training data noise and model performance is more complex than trade-offs between precision and recall.
引用
收藏
页码:47 / 51
页数:5
相关论文
共 50 条
  • [21] Veracity in big data: How good is good enough
    Reimer, Andrew P.
    Madigan, Elizabeth A.
    HEALTH INFORMATICS JOURNAL, 2019, 25 (04) : 1290 - 1298
  • [22] A therapeutic HIV vaccine: how good is good enough?
    Walensky, RP
    Paltiel, AD
    Goldie, SJ
    Gandhi, RT
    Weinstein, MC
    Seage, GR
    Smith, HE
    Zhang, H
    Freedberg, KA
    VACCINE, 2004, 22 (29-30) : 4044 - 4053
  • [23] GOOD ENOUGH IS GOOD ENOUGH
    EVANS, RA
    IEEE TRANSACTIONS ON RELIABILITY, 1984, 33 (02) : 137 - 137
  • [24] Is good enough, good enough?
    Greco, Peter M.
    AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2017, 151 (02) : 242 - 242
  • [25] GOOD ENOUGH IS NOT GOOD ENOUGH
    PETTIT, GD
    VETERINARY SURGERY, 1991, 20 (05) : 279 - 280
  • [26] Good enough is not good enough
    Reed, K
    IEEE SOFTWARE, 2003, 20 (05) : 109 - 109
  • [27] Good enough is good enough
    Pournelle, J
    BYTE, 1998, 23 (04): : 135 - +
  • [28] Good enough is good enough!
    Sidebotham, Charlotte
    BRITISH JOURNAL OF GENERAL PRACTICE, 2017, 67 (660): : 311 - 311
  • [29] HOW GOOD IS GOOD ENOUGH - DILEMMA IN ACCEPTANCE TESTING OF CONCRETE
    CHUNG, HW
    JOURNAL OF THE AMERICAN CONCRETE INSTITUTE, 1978, 75 (08): : 374 - 380
  • [30] Late diagnosis of anorectal malformation: how good is good enough?
    Davidson, Joseph
    Zaparackaite, Indre
    Holbrook, Charlotte
    Thakkar, Hemanshoo
    PEDIATRIC SURGERY INTERNATIONAL, 2024, 40 (01)