Few-shot Food Recognition via Multi-view Representation Learning

被引:26
|
作者
Jiang, Shuqiang [1 ]
Min, Weiqing [1 ]
Lyu, Yongqiang [2 ,3 ]
Liu, Linhu [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, 6 Kexueyuan South Rd, Beijing 100190, Peoples R China
[2] Qingdao KingAgroot Precis Agr Technol Co Ltd, Qingdao, Peoples R China
[3] Shandong Reebow Automat Equipment Co LTD, Qingdao Branch, Room 1901,Bldg 5, Qingdao, Shandong, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Food recognition; few-shot learning; visual recognition; deep learning;
D O I
10.1145/3391624
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article considers the problem of few-shot learning for food recognition. Automatic food recognition can support various applications, e.g., dietary assessment and food journaling. Most existing works focus on food recognition with large numbers of labelled samples, and fail to recognize food categories with few samples. To address this problem, we propose a Multi-View Few-Shot Learning (MVFSL) framework to explore additional ingredient information for few-shot food recognition. Besides category-oriented deep visual features, we introduce ingredient-supervised deep network to extract ingredient-oriented features. As general and intermediate attributes of food, ingredient-oriented features are informative and complementary to category-oriented features, and thus they play an important role in improving food recognition. Particularly in few-shot food recognition, ingredient information can bridge the gap between disjoint training categories and test categories. To take advantage of ingredient information, we fuse these two kinds of features by first combining their feature maps from their respective deep networks and then convolving combined feature maps. Such convolution is further incorporated into a multi-view relation network, which is capable of comparing pairwise images to enable fine-grained feature learning. MVFSL is trained in an end-to-end fashion for joint optimization on two types of feature learning subnetworks and relation subnetworks. Extensive experiments on different food datasets have consistently demonstrated the advantage of MVFSL in multi-view feature fusion. Furthermore, we extend another two types of networks, namely, Siamese Network and Matching Network, by introducing ingredient information for few-shot food recognition. Experimental results have also shown that introducing ingredient information into these two networks can improve the performance of few-shot food recognition.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] CONTAINER: Few-Shot Named Entity Recognition via Contrastive Learning
    Das, Sarkar Snigdha Sarathi
    Katiyar, Arzoo
    Passonneau, Rebecca J.
    Zhang, Rui
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6338 - 6353
  • [22] CONTRASTIVE REPRESENTATION FOR FEW-SHOT VEHICLE FOOTPRINT RECOGNITION
    Wang, Yongxiong
    Hu, Chuanfei
    Wang, Guangpeng
    Lin, Xu
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2021,
  • [23] Multi-level Metric Learning for Few-Shot Image Recognition
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 243 - 254
  • [24] Few-Shot Named Entity Recognition via Meta-Learning
    Li, Jing
    Chiu, Billy
    Feng, Shanshan
    Wang, Hao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (09) : 4245 - 4256
  • [25] Multi-label Few-shot Learning for Sound Event Recognition
    Cheng, Kai-Hsiang
    Chou, Szu-Yu
    Yang, Yi-Hsuan
    [J]. 2019 IEEE 21ST INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2019), 2019,
  • [26] Learning Compositional Representations for Few-Shot Recognition
    Tokmakov, Pavel
    Wang, Yu-Xiong
    Hebert, Martial
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6381 - 6390
  • [27] Multimodal Few-Shot Learning for Gait Recognition
    Moon, Jucheol
    Nhat Anh Le
    Minaya, Nelson Hebert
    Choi, Sang-Il
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 15
  • [28] Iris recognition based on few-shot learning
    Lei, Songze
    Dong, Baihua
    Li, Yonggang
    Xiao, Feng
    Tian, Feng
    [J]. COMPUTER ANIMATION AND VIRTUAL WORLDS, 2021, 32 (3-4)
  • [29] Meta-free few-shot learning via representation learning with weight averaging
    Chen, Kuilin
    Lee, Chi-Guhn
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [30] Improving Few-Shot Learning Through Multi-task Representation Learning Theory
    Bouniot, Quentin
    Redko, Ievgen
    Audigier, Romaric
    Loesch, Angelique
    Habrard, Amaury
    [J]. COMPUTER VISION, ECCV 2022, PT XX, 2022, 13680 : 435 - 452