GOGGLES: Automatic Image Labeling with Affinity Coding

被引:17
|
作者
Das, Nilaksh [1 ]
Chaba, Sanya [1 ]
Wu, Renzhi [1 ]
Gandhi, Sakshi [1 ]
Chau, Duen Horng [1 ]
Chu, Xu [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
关键词
affinity coding; probabilistic labels; data programming; weak supervision; computer vision; image labeling;
D O I
10.1145/3318464.3380592
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generating large labeled training data is becoming the biggest bottleneck in building and deploying supervised machine learning models. Recently, the data programming paradigm has been proposed to reduce the human cost in labeling training data. However, data programming relies on designing labeling functions which still requires significant domain expertise. Also, it is prohibitively difficult to write labeling functions for image datasets as it is hard to express domain knowledge using raw features for images (pixels). We propose affinity coding, a new domain-agnostic paradigm for automated training data labeling. The core premise of affinity coding is that the affinity scores of instance pairs belonging to the same class on average should be higher than those of pairs belonging to different classes, according to some affinity functions. We build the GOGGLES system that implements affinity coding for labeling image datasets by designing a novel set of reusable affinity functions for images, and propose a novel hierarchical generative model for class inference using a small development set. We compare GOGGLES with existing data programming systems on 5 image labeling tasks from diverse domains. GOGGLES achieves labeling accuracies ranging from a minimum of 71% to a maximum of 98% without requiring any extensive human annotation. In terms of end-to-end performance, GOGGLES outperforms the state-of-the-art data programming system Snuba by 21% and a state-of-the-art few-shot learning technique by 5%, and is only 7% away from the fully supervised upper bound.
引用
收藏
页码:1717 / 1732
页数:16
相关论文
共 50 条
  • [31] AFFINITY LABELING OF RECEPTOR PROTEINS
    KATZENELLENBOGEN, JA
    [J]. FEDERATION PROCEEDINGS, 1979, 38 (03) : 312 - 312
  • [32] AFFINITY LABELING WITH PEROXIDASE CONJUGATES
    AVRAMEAS, S
    GONATAS, NK
    [J]. JOURNAL OF CELL BIOLOGY, 1972, 55 (02): : A9 - A9
  • [33] A REAGENT FOR AFFINITY LABELING OF HEMOPROTEINS
    WARME, PK
    HAGER, LP
    [J]. FEDERATION PROCEEDINGS, 1968, 27 (02) : 525 - &
  • [34] AFFINITY LABELING OF HUMAN TRANSCORTIN
    LEGAILLARD, F
    DAUTREVAUX, M
    [J]. BIOCHIMICA ET BIOPHYSICA ACTA, 1977, 495 (02) : 312 - 323
  • [35] AFFINITY LABELING OF PANCREATIC RIBONUCLEASE
    MEANS, GE
    FEENEY, RE
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 1971, 246 (17) : 5532 - &
  • [36] AFFINITY LABELING OF CHOLINERGIC RECEPTORS
    WASER, PG
    [J]. BRAIN RESEARCH, 1973, 62 (02) : 551 - 556
  • [37] AFFINITY LABELING OF THE CHOLINE CARRIER
    MARCHBANKS, RM
    CURTI, D
    KOTAS, AM
    [J]. DEVELOPMENTS IN NEUROSCIENCE, 1984, 17 : 489 - 489
  • [38] AFFINITY LABELING OF YEAST ALDOLASE
    HARTMAN, FC
    NORTON, IL
    LIN, Y
    KOBES, RD
    [J]. FEDERATION PROCEEDINGS, 1971, 30 (03) : 1157 - &
  • [39] Automatic Labeling Of Topics
    Magatti, Davide
    Calegari, Silvia
    Ciucci, Davide
    Stella, Fabio
    [J]. 2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2009, : 1227 - 1232
  • [40] Semi-automatic Labeling with Active Learning for Multi-label Image Classification
    Wu, Jian
    Ye, Chen
    Sheng, Victor S.
    Yao, Yufeng
    Zhao, Pengpeng
    Cui, Zhiming
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2015, PT I, 2015, 9314 : 473 - 482