Zero-Shot Food Image Detection Based on Transformer

被引:0
|
作者
Song, Jingru [1 ]
Min, Weiqing [2 ,3 ]
Zhou, Pengfei [2 ,3 ]
Rao, Quanrui [1 ]
Sheng, Guorui [1 ]
Yang, Yancun [1 ]
Wang, Lili [1 ]
Jiang, Shuqiang [2 ,3 ]
机构
[1] School of Information and Electrical Engineering, Ludong University, Yantai,264025, China
[2] Institute of Computing Technology, Chinese Academy of Sciences, Beijing,100190, China
[3] Key Lab of Intelligent Information Processing, Chinese Academy of Sciences, Beijing,100190, China
关键词
Food chemistry - Food ingredients;
D O I
10.13386/j.issn1002-0306.2024030027
中图分类号
学科分类号
摘要
As a fundamental task in food computing, food detection played a crucial role in locating and identifying food items from input images, particularly in applications such as intelligent canteen settlement and dietary health management. However, food categories were constantly updating in practical scenarios, making it difficult for food detectors trained on fixed categories to accurately detect previously unseen food categories. To address this issue, this paper proposed a zero-shot food image detection method. Firstly, a Transformer-based food primitive generator was constructed, where each primitive contained fine-grained attributes relevant to food categories. These primitives could be selectively assembled based on the food characteristics to synthesize new food features. Secondly, an enhancement component of visual feature disentanglement was proposed in order to impose more constraints on the visual features of unseen food categories. The visual features of food images were decomposed into semantically related features and semantically unrelated features, thereby better transferring semantic knowledge of food categories to their visual features. The proposed method was extensively evaluated on the ZSFooD and UEC-FOOD256 datasets through numerous experiments and ablation studies. Under the zero-shot detection (ZSD) setting, optimal average precision on unseen classes reached 4.9% and 24.1%, respectively, demonstrating the effectiveness of the proposed approach. Under the generalized zero-shot detection (GZSD) setting, the harmonic mean of visible and unseen classes reaches 5.8% and 22.0%, respectively, further validating the effectiveness of the proposed method. © The Author(s) 2024.
引用
收藏
页码:18 / 26
相关论文
共 50 条
  • [1] Zero-Shot Sketch Based Image Retrieval Using Graph Transformer
    Gupta, Sumrit
    Chaudhuri, Ushasi
    Banerjee, Biplab
    Kumar, Saurabh
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1685 - 1691
  • [2] Transformer-Based Zero-Shot Detection via Contrastive Learning
    Liu, Wei
    Chen, Hui
    Ma, Yongqiang
    Wang, Jianji
    Zheng, Nanning
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART I, 2022, 646 : 316 - 327
  • [3] Transformer-Based Approach Via Contrastive Learning for Zero-Shot Detection
    Liu, Wei
    Chen, Hui
    Ma, Yongqiang
    Wang, Jianji
    Zheng, Nanning
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2023, 33 (07)
  • [4] Zero-Shot Image Classification Based on Attribute
    Zhang, Wei
    Chen, Wenbai
    Chen, Xiangfeng
    Han, Hu
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 25 - 30
  • [5] Zero-Shot Image Dehazing
    Li, Boyun
    Gou, Yuanbiao
    Liu, Jerry Zitao
    Zhu, Hongyuan
    Zhou, Joey Tianyi
    Peng, Xi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8457 - 8466
  • [6] Contour detection network for zero-shot sketch-based image retrieval
    Zhang, Qing
    Zhang, Jing
    Su, Xiangdong
    Bao, Feilong
    Gao, Guanglai
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (06) : 6781 - 6795
  • [7] Contour detection network for zero-shot sketch-based image retrieval
    Qing Zhang
    Jing Zhang
    Xiangdong Su
    Feilong Bao
    Guanglai Gao
    Complex & Intelligent Systems, 2023, 9 : 6781 - 6795
  • [8] Zero-Shot Object Detection
    Bansal, Ankan
    Sikka, Karan
    Sharma, Gaurav
    Chellappa, Rama
    Divakaran, Ajay
    COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 397 - 414
  • [9] Zero-shot image classification based on factor space
    Guan, Shijie
    Guan, Qixue
    Yin, Anqi
    International Journal of Web Engineering and Technology, 2021, 16 (01) : 1 - 29
  • [10] A Zero-Shot Framework for Sketch Based Image Retrieval
    Yelamarthi, Sasi Kiran
    Reddy, Shiva Krishna
    Mishra, Ashish
    Mittal, Anurag
    COMPUTER VISION - ECCV 2018, PT IV, 2018, 11208 : 316 - 333