Labeling Quality Problem for Large-Scale Image Recognition

被引:2
|
作者
Pilch, Agnieszka [1 ]
Maciejewski, Henryk [1 ]
机构
[1] Wroclaw Univ Sci & Technol, Wroclaw, Poland
关键词
CNN; Realibility of deep models; Annotations of ImageNet;
D O I
10.1007/978-3-031-06746-4_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most CNN models trained on the popular ImageNet dataset are created under the assumption that a single label is used per training image. These models realize remarkable performance on the ImageNet benchmark (with top-1 scores over 90%). Despite this, recognition of several categories is not reliable, as models for these categories can be easily attacked by natural adversarial examples. We show that this effect is related to ambiguous, single labels assigned to training and testing data for these categories. The CNN models tend to learn representations based on parts of an image not related to the label/category. We analyze the labeling scheme used to annotate the popular ImageNet benchmark dataset and compare it with two recent annotation schemes - CloudVision and Real labeling schemes, which are both crowd-sourced annotation efforts. We show that these two schemes lead to a very different granularity of annotations; we also argue that new annotations schemes should not rely on the accuracy on current ImageNet benchmarks as the hint for their correctness (at the Real scheme does).
引用
收藏
页码:206 / 216
页数:11
相关论文
共 50 条
  • [1] Large-Scale Semantic Co-Labeling of Image Sets
    Alvarez, Jose M.
    Salzmann, Mathieu
    Barnes, Nick
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 501 - 508
  • [2] Large-scale simulation studies in image pattern recognition
    Ho, TK
    Baird, HS
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (10) : 1067 - 1079
  • [3] Image segmentation for large-scale subcategory flower recognition
    Angelova, Anelia
    Zhu, Shenghuo
    Lin, Yuanqing
    2013 IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION (WACV), 2013, : 39 - 45
  • [4] Large-Scale Location Recognition and the Geometric Burstiness Problem
    Sattler, Torsten
    Havlena, Michal
    Schindler, Konrad
    Pollefeys, Marc
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1582 - 1590
  • [5] Gradually Updated Neural Networks for Large-Scale Image Recognition
    Qiao, Siyuan
    Zhang, Zhishuai
    Shen, Wei
    Wang, Bo
    Yuille, Alan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [6] An high quality image scaling engine for large-scale LCD
    Xiang, Zuquan
    Zou, Xuecheng
    Liu, Zhenglin
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 621 - +
  • [7] A Feature Encoding based on Fuzzy Codebook for Large-Scale Image Recognition
    Shinomiya, Yuki
    Hoshino, Yukinobu
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 2908 - 2913
  • [8] Techniques for solving the large-scale classification problem in Chinese handwriting recognition
    Chang, Fu
    ARABIC AND CHINESE HANDWRITING RECOGNITION, 2008, 4768 : 161 - 169
  • [9] A New Approach to Large-Scale Image Recognition for Visual Search Engines
    Sezganov, Dmitry
    Porat, Moshe
    2013 5TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT), 2013, : 151 - 157
  • [10] High-Performance Large-Scale Image Recognition Without Normalization
    Brock, Andrew
    De, Soham
    Smith, Samuel L.
    Simonyan, Karen
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139