Distance Metric Learning for Large Margin Nearest Neighbor Classification

被引:0
|
作者
Weinberger, Kilian Q. [1 ]
Saul, Lawrence K. [2 ]
机构
[1] Yahoo Res, Santa Clara, CA USA
[2] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
基金
美国国家科学基金会;
关键词
convex optimization; semi-definite programming; Mahalanobis distance; metric learning; multi-class classification; support vector machines;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The accuracy of k-nearest neighbor (kNN) classification depends significantly on the metric used to compute distances between different examples. In this paper, we show how to learn a Mahalanobis distance metric for kNN classification from labeled examples. The Mahalanobis metric can equivalently be viewed as a global linear transformation of the input space that precedes kNN classification using Euclidean distances. In our approach, the metric is trained with the goal that the k-nearest neighbors always belong to the same class while examples from different classes are separated by a large margin. As in support vector machines (SVMs), the margin criterion leads to a convex optimization based on the hinge loss. Unlike learning in SVMs, however, our approach requires no modification or extension for problems in multiway (as opposed to binary) classification. In our framework, the Mahalanobis distance metric is obtained as the solution to a semidefinite program. On several data sets of varying size and difficulty, we find that metrics trained in this way lead to significant improvements in kNN classification. Sometimes these results can be further improved by clustering the training examples and learning an individual metric within each cluster. We show how to learn and combine these local metrics in a globally integrated manner.
引用
收藏
页码:207 / 244
页数:38
相关论文
共 50 条
  • [1] HIERARCHICAL DISTANCE METRIC LEARNING FOR LARGE MARGIN NEAREST NEIGHBOR CLASSIFICATION
    Sun, Shiliang
    Chen, Qiaona
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2011, 25 (07) : 1073 - 1087
  • [2] Parameter Free Large Margin Nearest Neighbor for Distance Metric Learning
    Song, Kun
    Nie, Feiping
    Han, Junwei
    Li, Xuelong
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2555 - 2561
  • [3] Margin distribution explanation on metric learning for nearest neighbor classification
    Zou, Peng-Cheng
    Wang, Jiandong
    Chen, Songcan
    Chen, Haiyan
    [J]. NEUROCOMPUTING, 2016, 177 : 168 - 178
  • [4] Modified Large Margin Nearest Neighbor Metric Learning for Regression
    Assi, Kondo C.
    Labelle, Hubert
    Cheriet, Farida
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (03) : 292 - 296
  • [5] Convergence of Multi-pass Large Margin Nearest Neighbor Metric Learning
    Goepfert, Christina
    Paassen, Benjamin
    Hammer, Barbara
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT I, 2016, 9886 : 510 - 517
  • [6] An adaptive large margin nearest neighbor classification algorithm
    Yang, Liu
    Yu, Jian
    Jing, Liping
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2013, 50 (11): : 2269 - 2277
  • [7] Dynamic time warping constraint learning for large margin nearest neighbor classification
    Yu, Daren
    Yu, Xiao
    Hu, Qinghua
    Liu, Jinfu
    Wu, Anqi
    [J]. INFORMATION SCIENCES, 2011, 181 (13) : 2787 - 2796
  • [8] Deep Metric Learning for K Nearest Neighbor Classification
    Liao, Tingting
    Lei, Zhen
    Zhu, Tianqing
    Zeng, Shan
    Li, Yaqin
    Yuan, Cao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 264 - 275
  • [9] Generative Local Metric Learning for Nearest Neighbor Classification
    Noh, Yung-Kyun
    Zhang, Byoung-Tak
    Lee, Daniel D.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (01) : 106 - 118
  • [10] BoostML: An Adaptive Metric Learning for Nearest Neighbor Classification
    Zaidi, Nayyar Abbas
    Squire, David McG
    Suter, David
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I, PROCEEDINGS, 2010, 6118 : 142 - +