Image-to-Class Distance Metric Learning for Image Classification

被引:0
|
作者
Wang, Zhengxiang [1 ]
Hu, Yiqun [1 ]
Chia, Liang-Tien [1 ]
机构
[1] Nanyang Technol Univ, Ctr Multimedia & Network Technol, Sch Comp Engn, Singapore 639798, Singapore
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image-To-Class (I2C) distance is first used in Naive-Bayes Nearest-Neighbor (NBNN) classifier for image classification and has successfully handled datasets with large intra-class variances. However, the performance of this distance relies heavily on the large number of local features in the training set and test image, which need heavy computation cost for nearest-neighbor (NN) search in the testing phase. If using small number of local features for accelerating the NN search, the performance will be poor. In this paper, we propose a large margin framework to improve the discrimination of I2C distance especially for small number of local features by learning Per-Class Mahalanobis metrics. Our I2C distance is adaptive to different class by combining with the learned metric for each class. These multiple Per-Class metrics are learned simultaneously by forming a convex optimization problem with the constraints that the I2C distance from each training image to its belonging class should be less than the distance to other classes by a large margin. A gradient descent method is applied to efficiently solve this optimization problem. For efficiency and performance improved, we also adopt the idea of spatial pyramid restriction and learning I2C distance function to improve this I2C distance. We show in experiments that the proposed method can significantly outperform the original NBNN in several prevalent image datasets, and our best results can achieve state-of-the-art performance on most datasets.
引用
收藏
页码:706 / 719
页数:14
相关论文
共 50 条
  • [1] Learning Image-to-Class Distance Metric for Image Classification
    Wang, Zhengxiang
    Hu, Yiqun
    Chia, Liang-Tien
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2013, 4 (02)
  • [2] Image-to-class distance ratio: A feature filtering metric for image classification
    Tan, Shoubiao
    Liu, Li
    Peng, Chunyu
    Shao, Ling
    [J]. NEUROCOMPUTING, 2015, 165 : 211 - 221
  • [3] A Local Learning Based Image-To-Class Distance for Image Classification
    Cai, Xinyuan
    Xiao, Baihua
    Wang, Chunheng
    Zhang, Rongguo
    [J]. 2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 667 - 671
  • [4] Scene Text Character Recognition Based on Image-to-Class Distance Metric Learning
    Wang, Xiao
    Wang, Chunheng
    Xiao, Baihua
    Shi, Cunzhao
    Gao, Song
    [J]. 2014 7TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP 2014), 2014, : 721 - 725
  • [5] Saliency-aware image-to-class distances for image classification
    Peng, Peng
    Shao, Ling
    Han, Jungong
    Han, Junwei
    [J]. NEUROCOMPUTING, 2015, 166 : 337 - 345
  • [6] Class-Specific Mahalanobis Distance Metric Learning for Biological Image Classification
    Mohan, B. S. Shajee
    Sekhar, C. Chandra
    [J]. IMAGE ANALYSIS AND RECOGNITION, PT II, 2012, 7325 : 240 - 248
  • [7] Class-specific representation based distance metric learning for image set classification
    Gao, Xizhan
    Feng, Zeming
    Wei, Dong
    Niu, Sijie
    Zhao, Hui
    Dong, Jiwen
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 254
  • [8] Remote sensing image retrieval with ant colony optimization and a weighted image-to-class distance
    Ye, Famao
    Meng, Xianglong
    Dong, Meng
    Nie, Yunju
    Ge, Yun
    Chen, Xiaoyong
    [J]. Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2021, 50 (05): : 612 - 620
  • [9] A Two-Stream Network with Image-to-Class Deep Metric for Few-Shot Classification
    Gu, Qinghua
    Luo, Zhengding
    Zhu, Yuesheng
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2704 - 2711
  • [10] Face and Human Gait Recognition Using Image-to-Class Distance
    Huang, Yi
    Xu, Dong
    Cham, Tat-Jen
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2010, 20 (03) : 431 - 438