Memory-Based Neighbourhood Embedding for Visual Recognition

被引:27
|
作者
Li, Suichan [1 ,2 ,4 ]
Chen, Dapeng [3 ]
Liu, Bin [1 ,2 ]
Yu, Nenghai [1 ,2 ]
Zhao, Rui [3 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei, Peoples R China
[2] Chinese Acad Sci, Key Lab Electromagnet Space Informat, Beijing, Peoples R China
[3] SenseTime Res, Beijing, Peoples R China
[4] SenseTime, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV.2019.00620
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning discriminative image feature embeddings is of great importance to visual recognition. To achieve better feature embeddings, most current methods focus on designing different network structures or loss functions, and the estimated feature embeddings are usually only related to the input images. In this paper, we propose Memory-based Neighbourhood Embedding (MNE) to enhance a general CNN feature by considering its neighbourhood. The method aims to solve two critical problems, i.e., how to acquire more relevant neighbours in the network training and how to aggregate the neighbourhood information for a more discriminative embedding. We first augment an episodic memory module into the network, which can provide more relevant neighbours for both training and testing. Then the neighbours are organized in a tree graph with the target instance as the root node. The neighbourhood information is gradually aggregated to the root node in a bottom-up manner, and aggregation weights are supervised by the class relationships between the nodes. We apply MNE on image search and few shot learning tasks. Extensive ablation studies demonstrate the effectiveness of each component, and our method significantly outperforms the state-of-the-art approaches.
引用
收藏
页码:6101 / 6110
页数:10
相关论文
共 50 条
  • [1] Matching Video Net: Memory-based embedding for video action recognition
    Kim, Daesik
    Lee, Myunggi
    Kwak, Nojun
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 432 - 438
  • [2] Memory-based learning for visual odometry
    Roberts, Richard
    Nguyen, Hai
    Krishnamurthi, Niyant
    Balch, Tucker
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 47 - 52
  • [3] Memory-Based Jitter: Improving Visual Recognition on Long-Tailed Data with Diversity in Memory
    Liu, Jialun
    Li, Wenhui
    Sun, Yifan
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1720 - 1728
  • [4] Face Recognition Based on Neighbourhood Discriminant Preserving Embedding
    Teoh, Andrew Beng Jin
    Han, Pang Ying
    [J]. 2008 10TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION: ICARV 2008, VOLS 1-4, 2008, : 428 - +
  • [5] Towards Implicit Visual Memory-Based Authentication
    Castelluccia, Claude
    Duermuth, Markus
    Golla, Maximilian
    Deniz, Fatma
    [J]. 24TH ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2017), 2017,
  • [6] MEMORY-BASED AUTOMATICITY IN THE DISCRIMINATION OF VISUAL NUMEROSITY
    LASSALINE, ME
    LOGAN, GD
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1993, 19 (03) : 561 - 581
  • [7] The effect of visual salience on memory-based choices
    Pooresmaeili, Arezoo
    Bach, Dominik R.
    Dolan, Raymond J.
    [J]. JOURNAL OF NEUROPHYSIOLOGY, 2014, 111 (03) : 481 - 487
  • [8] Neighbourhood Discriminant Embedding in face recognition
    Pang, Ying Han
    Teoh, Andrew B. J.
    Kiong, Wong Eng
    [J]. IEICE ELECTRONICS EXPRESS, 2008, 5 (24): : 1036 - 1041
  • [9] Memory-based preattentional processing in visual mismatch negativity
    Liu Tongran
    Shi Jiannong
    [J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 793 - 793
  • [10] Involvement of memory-based change detection in visual distraction
    Kimura, Motohiro
    Katayama, Jun'ichi
    Murohashi, Harumitsu
    [J]. PSYCHOPHYSIOLOGY, 2007, 44 : S98 - S98