Labeling Quality Problem for Large-Scale Image Recognition

被引:2
|
作者
Pilch, Agnieszka [1 ]
Maciejewski, Henryk [1 ]
机构
[1] Wroclaw Univ Sci & Technol, Wroclaw, Poland
关键词
CNN; Realibility of deep models; Annotations of ImageNet;
D O I
10.1007/978-3-031-06746-4_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most CNN models trained on the popular ImageNet dataset are created under the assumption that a single label is used per training image. These models realize remarkable performance on the ImageNet benchmark (with top-1 scores over 90%). Despite this, recognition of several categories is not reliable, as models for these categories can be easily attacked by natural adversarial examples. We show that this effect is related to ambiguous, single labels assigned to training and testing data for these categories. The CNN models tend to learn representations based on parts of an image not related to the label/category. We analyze the labeling scheme used to annotate the popular ImageNet benchmark dataset and compare it with two recent annotation schemes - CloudVision and Real labeling schemes, which are both crowd-sourced annotation efforts. We show that these two schemes lead to a very different granularity of annotations; we also argue that new annotations schemes should not rely on the accuracy on current ImageNet benchmarks as the hint for their correctness (at the Real scheme does).
引用
收藏
页码:206 / 216
页数:11
相关论文
共 50 条
  • [21] High efficient framework for large-scale zero-shot image recognition
    Zhang Z.
    Liu Q.
    Guo D.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2022, 49 (06): : 103 - 110
  • [22] I-Nema: a large-scale microscopic image dataset for nematode recognition
    Shenglin Lu
    Sheldon Fung
    Yihao Wang
    Xuequan Lu
    Wanli Ouyang
    Xue Qing
    Hongmei Li
    Neural Computing and Applications, 2025, 37 (4) : 2763 - 2773
  • [23] Large-Scale Visual Font Recognition
    Chen, Guang
    Yang, Jianchao
    Jin, Hailin
    Brandt, Jonathan
    Shechtman, Eli
    Agarwala, Aseem
    Han, Tony X.
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3598 - 3605
  • [24] Large-Scale Visual Speech Recognition
    Shillingford, Brendan
    Assael, Yannis
    Hoffman, Matthew W.
    Paine, Thomas
    Hughes, Cian
    Prabhu, Utsav
    Liao, Hank
    Sak, Hasim
    Rao, Kanishka
    Bennett, Lorrayne
    Mulville, Marie
    Denil, Misha
    Coppin, Ben
    Laurie, Ben
    Senior, Andrew
    de Freitas, Nando
    INTERSPEECH 2019, 2019, : 4135 - 4139
  • [25] Labeling algorithm for the shortest path problem with turn prohibitions with application to large-scale road networks
    Eliécer Gutiérrez
    Andrés L. Medaglia
    Annals of Operations Research, 2008, 157 : 169 - 182
  • [26] LARGE-SCALE SEMANTIC CLASSIFICATION: OUTCOME OF THE FIRST YEAR OF INRIA AERIAL IMAGE LABELING BENCHMARK
    Huang, Bohao
    Lu, Kangkang
    Audebert, Nicolas
    Khalel, Andrew
    Tarabalka, Yuliya
    Malof, Jordan
    Boulch, Alexandre
    Le Saux, Bertrand
    Collins, Leslie
    Bradbury, Kyle
    Lefevre, Sebastien
    El-Saban, Motaz
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 6947 - 6950
  • [27] Labeling algorithm for the shortest path problem with turn prohibitions with application to large-scale road networks
    Gutierrez, Eliecer
    Medaglia, Andres L.
    ANNALS OF OPERATIONS RESEARCH, 2008, 157 (01) : 169 - 182
  • [28] Large-scale multi-task image labeling with adaptive relevance discovery and feature hashing
    Deng, Cheng
    Liu, Xianglong
    Mu, Yadong
    Li, Jie
    SIGNAL PROCESSING, 2015, 112 : 137 - 145
  • [29] LARGE-SCALE IMAGE-PROCESSING
    CHEN, CC
    BULLETIN OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1987, 13 (06): : 15 - 16
  • [30] THE NEW PROBLEM OF LARGE-SCALE UNEMPLOYABILITY
    ROSE, AM
    AMERICAN JOURNAL OF ECONOMICS AND SOCIOLOGY, 1964, 23 (04) : 337 - 350