Effect of Label Noise on the Machine-Learned Classification of Earthquake Damage

被引:27
|
作者
Frank, Jared [1 ,2 ]
Rebbapragada, Umaa [2 ]
Bialas, James [3 ]
Oommen, Thomas [3 ]
Havens, Timothy C. [3 ]
机构
[1] Cornell Univ, Dept Comp Sci, 402 Gates Hall, Ithaca, NY 14850 USA
[2] CALTECH, Jet Prop Lab, 4800 Oak Grove Dr, Pasadena, CA 91109 USA
[3] Michigan Technol Univ, 1400 Townsend Dr, Houghton, MI 49931 USA
基金
美国国家科学基金会;
关键词
machine learning; classification; crowdsourcing; earthquake damage; damage detection; GEOBIA; mislabeled training data; LAND-COVER CHANGE; HAITI EARTHQUAKE; RANDOM FORESTS; IMAGE; SATELLITE; ACCURACY;
D O I
10.3390/rs9080803
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Automated classification of earthquake damage in remotely-sensed imagery using machine learning techniques depends on training data, or data examples that are labeled correctly by a human expert as containing damage or not. Mislabeled training data are a major source of classifier error due to the use of imprecise digital labeling tools and crowdsourced volunteers who are not adequately trained on or invested in the task. The spatial nature of remote sensing classification leads to the consistent mislabeling of classes that occur in close proximity to rubble, which is a major byproduct of earthquake damage in urban areas. In this study, we look at how mislabeled training data, or label noise, impact the quality of rubble classifiers operating on high-resolution remotely-sensed images. We first study how label noise dependent on geospatial proximity, or geospatial label noise, compares to standard random noise. Our study shows that classifiers that are robust to random noise are more susceptible to geospatial label noise. We then compare the effects of label noise on both pixel- and object-based remote sensing classification paradigms. While object-based classifiers are known to outperform their pixel-based counterparts, this study demonstrates that they are more susceptible to geospatial label noise. We also introduce a new labeling tool to enhance precision and image coverage. This work has important implications for the Sendai framework as autonomous damage classification will ensure rapid disaster assessment and contribute to the minimization of disaster risk.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Machine-learned approximations to Density Functional Theory Hamiltonians
    Ganesh Hegde
    R. Chris Bowen
    Scientific Reports, 7
  • [42] Nonlocal machine-learned exchange functional for molecules and solids
    Bystrom, Kyle
    Kozinsky, Boris
    PHYSICAL REVIEW B, 2024, 110 (07)
  • [43] Machine-learned pattern identification in olfactory subtest results
    Jörn Lötsch
    Thomas Hummel
    Alfred Ultsch
    Scientific Reports, 6
  • [44] Machine-learned approximations to Density Functional Theory Hamiltonians
    Hegde, Ganesh
    Bowen, R. Chris
    SCIENTIFIC REPORTS, 2017, 7
  • [45] A theoretical case study of the generalization of machine-learned potentials
    Wang, Yangshuai
    Patel, Shashwat
    Ortner, Christoph
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2024, 422
  • [46] A FRAMEWORK FOR A GENERALIZATION ANALYSIS OF MACHINE-LEARNED INTERATOMIC POTENTIALS
    Ortner, Christoph
    Wang, Yangshuai
    MULTISCALE MODELING & SIMULATION, 2023, 21 (03): : 1053 - 1080
  • [47] Applications of machine-learned electron densities of nucleic acids
    Lee, Alex J.
    Rackers, Joshua A.
    Bricker, William P.
    BIOPHYSICAL JOURNAL, 2023, 122 (03) : 211A - 211A
  • [48] Problems in the deployment of machine-learned models in health care
    Cohen, Joseph Paul
    Cao, Tianshi
    Viviano, Joseph D.
    Huang, Chin-Wei
    Fralick, Michael
    Ghassemi, Marzyeh
    Mamdani, Muhammad
    Greiner, Russell
    Bengio, Yoshua
    CANADIAN MEDICAL ASSOCIATION JOURNAL, 2021, 193 (35) : E1391 - E1394
  • [49] Machine-learned pattern identification in olfactory subtest results
    Lotsch, Jorn
    Hummel, Thomas
    Ultsch, Alfred
    SCIENTIFIC REPORTS, 2016, 6
  • [50] Machine-learned and codified synthesis parameters of oxide materials
    Edward Kim
    Kevin Huang
    Alex Tomala
    Sara Matthews
    Emma Strubell
    Adam Saunders
    Andrew McCallum
    Elsa Olivetti
    Scientific Data, 4