Metric learning based object recognition and retrieval

被引:14
|
作者
Yang, Jianyu [1 ]
Xu, Haoran [1 ]
机构
[1] Soochow Univ, Sch Urban Rail Transportat, 8 Jixue Rd, Suzhou, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Metric learning; Object recognition; Object retrieval; Robot learning; Intelligent analysis; NONRIGID SHAPES; REPRESENTATION; ROBUST; DESCRIPTORS; SYSTEMS;
D O I
10.1016/j.neucom.2016.01.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object recognition and retrieval is an important topic in intelligent robotics and pattern recognition, where an effective recognition engine plays an important role. To achieve a good performance, we propose a metric learning based object recognition algorithm. To represent the invariant object features, including local shape details and global body parts, a novel multi-scale invariant descriptor is proposed. Different types of invariant features are represented in multiple scales, which makes the following metric learning algorithm effective. To reduce the effect of noise and improve the computing efficiency, an adaptive discrete contour evolution method is also proposed to extract the salient feature points of object. The recognition algorithm is explored based on metric learning method and the object features are summarized as histograms inspired from the Bag of Words (BoW). The metric learning methods are employed to learn object features according to their scales. The proposed method is invariant to rotation, scale variation, intra-class variation, articulated deformation and partial occlusion. The recognition process is fast and robust for noise. This method is evaluated on multiple benchmark datasets and the comparable experimental results indicate the effectiveness of our method. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:70 / 81
页数:12
相关论文
共 50 条
  • [21] Oracle Bone Inscriptions Image Retrieval Based on Metric Learning
    Ding, Jun
    Wang, Jiaoyan
    Aysa, Alimjan
    Xu, Xuebin
    Ubul, Kurban
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT III, 2024, 14806 : 159 - 173
  • [22] Liver Histopathological Image Retrieval Based on Deep Metric Learning
    Yang, Pengshuai
    Zhai, Yupeng
    Li, Lin
    Lv, Hairong
    Wang, Jigang
    Zhu, Chengzhan
    Jiang, Rui
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 914 - 919
  • [23] Learning Object Recognition based on Compressive Sampling
    Li, Congjian
    Cheng, Yu
    Bi, Sheng
    Cai, Yingfeng
    Xi, Ning
    2017 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (IEEE ROBIO 2017), 2017, : 2663 - 2668
  • [24] Object recognition with gradient-based learning
    LeCun, Y
    Haffner, P
    Bottou, L
    Bengio, Y
    SHAPE, CONTOUR AND GROUPING IN COMPUTER VISION, 1999, 1681 : 319 - 345
  • [25] Feedback-based metric learning for activity recognition
    Wang, Xiaofeng
    Xu, Yang
    Hu, Haixiao
    Liu, Ming
    Li, Gang
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 162
  • [26] A Metric Learning-based Image Recognition Method
    Sun, Mingsi
    Zhao, Hongwei
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2022, 31 (04)
  • [27] Dysarthric Speech Recognition Based on Deep Metric Learning
    Takashima, Yuki
    Takashima, Ryoichi
    Takiguchi, Tetsuya
    Ariki, Yasuo
    INTERSPEECH 2020, 2020, : 4796 - 4800
  • [28] Learning through sharing and distributing knowledge with application to object recognition and information retrieval
    Mignon, Alexis
    Bronisz, Alban
    Le Hy, Ronan
    Mekhnacha, Kamel
    Santos, Luis
    2017 26TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2017, : 1273 - 1278
  • [29] Object recognition and matching for image retrieval
    Zhang, YJ
    Gao, YY
    Merzlykov, NS
    SECOND INTERNATION CONFERENCE ON IMAGE AND GRAPHICS, PTS 1 AND 2, 2002, 4875 : 1083 - 1089
  • [30] Local Metric Learning for Exemplar-Based Object Detection
    You, Xinge
    Li, Qiang
    Tao, Dacheng
    Ou, Weihua
    Gong, Mingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2014, 24 (08) : 1265 - 1276