Memory-Based Neighbourhood Embedding for Visual Recognition

被引:27
|
作者
Li, Suichan [1 ,2 ,4 ]
Chen, Dapeng [3 ]
Liu, Bin [1 ,2 ]
Yu, Nenghai [1 ,2 ]
Zhao, Rui [3 ]
机构
[1] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei, Peoples R China
[2] Chinese Acad Sci, Key Lab Electromagnet Space Informat, Beijing, Peoples R China
[3] SenseTime Res, Beijing, Peoples R China
[4] SenseTime, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV.2019.00620
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning discriminative image feature embeddings is of great importance to visual recognition. To achieve better feature embeddings, most current methods focus on designing different network structures or loss functions, and the estimated feature embeddings are usually only related to the input images. In this paper, we propose Memory-based Neighbourhood Embedding (MNE) to enhance a general CNN feature by considering its neighbourhood. The method aims to solve two critical problems, i.e., how to acquire more relevant neighbours in the network training and how to aggregate the neighbourhood information for a more discriminative embedding. We first augment an episodic memory module into the network, which can provide more relevant neighbours for both training and testing. Then the neighbours are organized in a tree graph with the target instance as the root node. The neighbourhood information is gradually aggregated to the root node in a bottom-up manner, and aggregation weights are supervised by the class relationships between the nodes. We apply MNE on image search and few shot learning tasks. Extensive ablation studies demonstrate the effectiveness of each component, and our method significantly outperforms the state-of-the-art approaches.
引用
收藏
页码:6101 / 6110
页数:10
相关论文
共 50 条
  • [31] MEMORY-BASED PARSING
    LEBOWITZ, M
    [J]. ARTIFICIAL INTELLIGENCE, 1983, 21 (04) : 363 - 404
  • [32] Memory-Based Attention Capture When Multiple Items Are Maintained in Visual Working Memory
    Hollingworth, Andrew
    Beck, Valerie M.
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2016, 42 (07) : 911 - 917
  • [33] Memory-Based Neural Network for Radar HRRP Noncooperative Target Recognition
    Jia, Ying
    Chen, Bo
    Tian, Long
    Chen, Wenchao
    Liu, Hongwei
    [J]. 2020 IEEE 11TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2020,
  • [34] FUZZY VISUAL CONTROL FOR MEMORY-BASED NAVIGATION USING THE TRIFOCAL TENSOR
    Becerra, Hector M.
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2014, 20 (02): : 245 - 262
  • [35] Neighbourhood Discriminant Locally Linear Embedding in Face Recognition
    Han, Pang Ying
    Jin, Andrew Teoh Beng
    Kiong, Wong Eng
    [J]. COMPUTER GRAPHICS, IMAGING AND VISUALISATION - MODERN TECHNIQUES AND APPLICATIONS, PROCEEDINGS, 2008, : 223 - +
  • [36] Memory-based Exploration-value Evaluation Model for Visual Navigation
    Feng, Yongquan
    Xu, Liyang
    Li, Minglong
    Jin, Ruochun
    Huang, Da
    Yang, Shaowu
    Yang, Wenjing
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2011 - 2017
  • [37] Improved complete neighbourhood preserving embedding for face recognition
    Lu, Gui-Fu
    Wang, Yong
    Zou, Jian
    [J]. IET COMPUTER VISION, 2013, 7 (01) : 71 - 79
  • [38] Cross-Modality Person Re-identification with Memory-Based Contrastive Embedding
    Cheng, De
    Wang, Xiaolong
    Wang, Nannan
    Wang, Zhen
    Wang, Xiaoyu
    Gao, Xinbo
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 425 - 432
  • [39] MEMORY-BASED REASONING APPROACH FOR PATTERN-RECOGNITION OF BINARY IMAGES
    WANG, W
    IYENGAR, SS
    PATNAIK, LM
    [J]. PATTERN RECOGNITION, 1989, 22 (05) : 505 - 518
  • [40] Cognitive integration of recognition information and additional cues in memory-based decisions
    Gloeckner, Andreas
    Broeder, Arndt
    [J]. JUDGMENT AND DECISION MAKING, 2014, 9 (01): : 35 - 50