Large-Scale Image Categorization with Explicit Data Embedding

被引:64
|
作者
Perronnin, Florent
Sanchez, Jorge
Liu, Yan
机构
关键词
D O I
10.1109/CVPR.2010.5539914
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Kernel machines rely on an implicit mapping of the data such that non-linear classification in the original space corresponds to linear classification in the new space. As kernel machines are difficult to scale to large training sets, it has been proposed to perform an explicit mapping of the data and to learn directly linear classifiers in the new space. In this paper, we consider the problem of learning image categorizers on large image sets (e. g. > 100k images) using bag-of-visual-words (BOV) image representations and Support Vector Machine classifiers. We experiment with three approaches to BOV embedding: 1) kernel PCA (kPCA) [15], 2) a modified kPCA we propose for additive kernels and 3) random projections for shift-invariant kernels [14]. We report experiments on 3 datasets: Caltech101, VOC07 and ImageNet. An important conclusion is that simply square-rooting BOV vectors - which corresponds to an exact mapping for the Bhattacharyya kernel already leads to large improvements, often quite close to the best results obtained with additive kernels. Another conclusion is that, although it is possible to go beyond additive kernels, the embedding comes at a much higher cost.
引用
收藏
页码:2297 / 2304
页数:8
相关论文
共 50 条
  • [1] Network of Experts for Large-Scale Image Categorization
    Ahmed, Karim
    Baig, Mohammad Haris
    Torresani, Lorenzo
    [J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 516 - 532
  • [2] Evolutionary compact embedding for large-scale image classification
    Liu, Li
    Shao, Ling
    Li, Xuelong
    [J]. INFORMATION SCIENCES, 2015, 316 : 567 - 581
  • [3] Coupled Binary Embedding for Large-Scale Image Retrieval
    Zheng, Liang
    Wang, Shengjin
    Tian, Qi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (08) : 3368 - 3380
  • [4] DeepMCAT: Large-Scale Deep Clustering for Medical Image Categorization
    Kart, Turkay
    Bai, Wenjia
    Glocker, Ben
    Rueckert, Daniel
    [J]. DEEP GENERATIVE MODELS, AND DATA AUGMENTATION, LABELLING, AND IMPERFECTIONS, 2021, 13003 : 259 - 267
  • [5] Iterative Manifold Embedding Layer Learned by Incomplete Data for Large-Scale Image Retrieval
    Xu, Jian
    Wang, Chunheng
    Qi, Chengzuo
    Shi, Cunzhao
    Xiao, Baihua
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (06) : 1551 - 1562
  • [6] Large-Scale Embedding Learning in Heterogeneous Event Data
    Gui, Huan
    Liu, Jialu
    Tao, Fangbo
    Jiang, Meng
    Norick, Brandon
    Han, Jiawei
    [J]. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 907 - 912
  • [7] InvVis: Large-Scale Data Embedding for Invertible Visualization
    Ye, Huayuan
    Li, Chenhui
    Li, Yang
    Wang, Changbo
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (01) : 1139 - 1149
  • [8] Product Embedding for Large-Scale Disaggregated Sales Data
    Li, Yinxing
    Terui, Nobuhiko
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KDIR), VOL 1:, 2021, : 69 - 75
  • [9] Large-Scale Aerial Image Categorization Using a Multitask Topological Codebook
    Zhang, Luming
    Wang, Meng
    Hong, Richang
    Yin, Bao-Cai
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (02) : 535 - 545
  • [10] Random Projection Tree and Multiview Embedding for Large-Scale Image Retrieval
    Xie, Bo
    Mu, Yang
    Song, Mingli
    Tao, Dacheng
    [J]. NEURAL INFORMATION PROCESSING: MODELS AND APPLICATIONS, PT II, 2010, 6444 : 641 - +