Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets

被引:11
|
作者
Liao, Yuan-Hong [1 ,2 ]
Kar, Amlan [1 ,2 ,3 ]
Fidler, Sanja [1 ,2 ,3 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Vector Inst, Toronto, ON, Canada
[3] NVIDIA, Santa Clara, CA USA
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/CVPR46437.2021.00433
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data is the engine of modern computer vision, which necessitates collecting large-scale datasets. This is expensive, and guaranteeing the quality of the labels is a major challenge. In this paper, we investigate efficient annotation strategies for collecting multi-class classification labels for a large collection of images. While methods that exploit learnt models for labeling exist, a surprisingly prevalent approach is to query humans for a fixed number of labels per datum and aggregate them, which is expensive. Building on prior work on online joint probabilistic modeling of human annotations and machine-generated beliefs, we propose modifications and best practices aimed at minimizing human labeling effort. Specifically, we make use of advances in self-supervised learning, view annotation as a semi-supervised learning problem, identify and mitigate pitfalls and ablate several key design choices to propose effective guidelines for labeling. Our analysis is done in a more realistic simulation that involves querying human labelers, which uncovers issues with evaluation using existing worker simulation methods. Simulated experiments on a 125k image subset of the ImageNet100 show that it can be annotated to 80% top-1 accuracy with 0.35 annotations per image on average, a 2.7x and 6.7x improvement over prior work and manual annotation, respectively.(1)
引用
收藏
页码:4348 / 4357
页数:10
相关论文
共 50 条
  • [1] Towards Good Practice in Large-Scale Learning for Image Classification
    Perronnin, Florent
    Akata, Zeynep
    Harchaoui, Zaid
    Schmid, Cordelia
    [J]. 2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3482 - 3489
  • [2] Towards Good Practices for Multi-modal Fusion in Large-Scale Video Classification
    Liu, Jinlai
    Yuan, Zehuan
    Wang, Changhu
    [J]. COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 287 - 296
  • [3] Good Practice in Large-Scale Learning for Image Classification
    Akata, Zeynep
    Perronnin, Florent
    Harchaoui, Zaid
    Schmid, Cordelia
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (03) : 507 - 520
  • [4] Towards algorithmic analytics for large-scale datasets
    Bzdok, Danilo
    Nichols, Thomas E.
    Smith, Stephen M.
    [J]. NATURE MACHINE INTELLIGENCE, 2019, 1 (07) : 296 - 306
  • [5] Towards algorithmic analytics for large-scale datasets
    Danilo Bzdok
    Thomas E. Nichols
    Stephen M. Smith
    [J]. Nature Machine Intelligence, 2019, 1 : 296 - 306
  • [6] Parallel incremental power mean SVM for the classification of large-scale image datasets
    Thanh-Nghi Doan
    Thanh-Nghi Do
    Poulet, Francois
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2014, 3 (02) : 89 - 96
  • [7] Iterative Classification for Sanitizing Large-Scale Datasets
    Li, Bo
    Vorobeychik, Yevgeniy
    Li, Muqun
    Malin, Bradley
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 841 - 846
  • [8] Scalable Iterative Classification for Sanitizing Large-Scale Datasets
    Li, Bo
    Vorobeychik, Yevgeniy
    Li, Muqun
    Malin, Bradley
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (03) : 698 - 711
  • [9] Problems in Large-Scale Image Classification
    Guo, Yuchen
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5038 - 5039
  • [10] Exploring Large-scale Public Medical Image Datasets
    Oakden-Rayner, Luke
    [J]. ACADEMIC RADIOLOGY, 2020, 27 (01) : 106 - 112