Good Practice in Large-Scale Learning for Image Classification

被引:104
|
作者
Akata, Zeynep [1 ]
Perronnin, Florent [1 ,2 ]
Harchaoui, Zaid [2 ]
Schmid, Cordelia [2 ]
机构
[1] Xerox Res Ctr Europe, F-38240 Meylan, Isere, France
[2] INRIA Grenoble Rhone Alpes, F-38330 Montbonnot St Martin, Isere, France
关键词
Large scale; fine-grained visual categorization; image classification; ranking; SVM; stochastic learning; CLASSIFIERS;
D O I
10.1109/TPAMI.2013.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We benchmark several SVM objective functions for large-scale image classification. We consider one-versus-rest, multiclass, ranking, and weighted approximate ranking SVMs. A comparison of online and batch methods for optimizing the objectives shows that online methods perform as well as batch methods in terms of classification accuracy, but with a significant gain in training speed. Using stochastic gradient descent, we can scale the training to millions of images and thousands of classes. Our experimental evaluation shows that ranking-based algorithms do not outperform the one-versus-rest strategy when a large number of training examples are used. Furthermore, the gap in accuracy between the different algorithms shrinks as the dimension of the features increases. We also show that learning through cross-validation the optimal rebalancing of positive and negative examples can result in a significant improvement for the one-versus-rest strategy. Finally, early stopping can be used as an effective regularization strategy when training with online algorithms. Following these "good practices," we were able to improve the state of the art on a large subset of 10K classes and 9M images of ImageNet from 16.7 percent Top-1 accuracy to 19.1 percent.
引用
收藏
页码:507 / 520
页数:14
相关论文
共 50 条
  • [1] Towards Good Practice in Large-Scale Learning for Image Classification
    Perronnin, Florent
    Akata, Zeynep
    Harchaoui, Zaid
    Schmid, Cordelia
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3482 - 3489
  • [2] Large-Scale Image Classification Using Active Learning
    Alajlan, Naif
    Pasolli, Edoardo
    Melgani, Farid
    Franzoso, Andrea
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (01) : 259 - 263
  • [3] Incremental Learning of Random Forests for Large-Scale Image Classification
    Ristin, Marko
    Guillaumin, Matthieu
    Gall, Juergen
    Van Gool, Luc
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (03) : 490 - 503
  • [4] A Deep Multiview Active Learning for Large-Scale Image Classification
    Yao, Tuozhong
    Wang, Wenfeng
    Gu, Yuhong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [5] Incremental Learning of NCM Forests for Large-Scale Image Classification
    Ristin, Marko
    Guillaumin, Matthieu
    Gall, Juergen
    Van Gool, Luc
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3654 - 3661
  • [6] Learning Compact Visual Attributes for Large-Scale Image Classification
    Su, Yu
    Jurie, Frederic
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 51 - 60
  • [7] Hierarchical learning of large-margin metrics for large-scale image classification
    Lei, Hao
    Mei, Kuizhi
    Xin, Jingmin
    Dong, Peixiang
    Fan, Jianping
    NEUROCOMPUTING, 2016, 208 : 46 - 58
  • [8] Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets
    Liao, Yuan-Hong
    Kar, Amlan
    Fidler, Sanja
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4348 - 4357
  • [9] Problems in Large-Scale Image Classification
    Guo, Yuchen
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5038 - 5039
  • [10] Deep Multi-Task Learning for Large-Scale Image Classification
    Kuang, Zhenzhong
    Li, Zongmin
    Zhao, Tianyi
    Fan, Jianping
    2017 IEEE THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2017), 2017, : 310 - 317