Zero-Shot Food Image Detection Based on Transformer

被引:0
|
作者
Song, Jingru [1 ]
Min, Weiqing [2 ,3 ]
Zhou, Pengfei [2 ,3 ]
Rao, Quanrui [1 ]
Sheng, Guorui [1 ]
Yang, Yancun [1 ]
Wang, Lili [1 ]
Jiang, Shuqiang [2 ,3 ]
机构
[1] School of Information and Electrical Engineering, Ludong University, Yantai,264025, China
[2] Institute of Computing Technology, Chinese Academy of Sciences, Beijing,100190, China
[3] Key Lab of Intelligent Information Processing, Chinese Academy of Sciences, Beijing,100190, China
关键词
Food chemistry - Food ingredients;
D O I
10.13386/j.issn1002-0306.2024030027
中图分类号
学科分类号
摘要
As a fundamental task in food computing, food detection played a crucial role in locating and identifying food items from input images, particularly in applications such as intelligent canteen settlement and dietary health management. However, food categories were constantly updating in practical scenarios, making it difficult for food detectors trained on fixed categories to accurately detect previously unseen food categories. To address this issue, this paper proposed a zero-shot food image detection method. Firstly, a Transformer-based food primitive generator was constructed, where each primitive contained fine-grained attributes relevant to food categories. These primitives could be selectively assembled based on the food characteristics to synthesize new food features. Secondly, an enhancement component of visual feature disentanglement was proposed in order to impose more constraints on the visual features of unseen food categories. The visual features of food images were decomposed into semantically related features and semantically unrelated features, thereby better transferring semantic knowledge of food categories to their visual features. The proposed method was extensively evaluated on the ZSFooD and UEC-FOOD256 datasets through numerous experiments and ablation studies. Under the zero-shot detection (ZSD) setting, optimal average precision on unseen classes reached 4.9% and 24.1%, respectively, demonstrating the effectiveness of the proposed approach. Under the generalized zero-shot detection (GZSD) setting, the harmonic mean of visible and unseen classes reaches 5.8% and 22.0%, respectively, further validating the effectiveness of the proposed method. © The Author(s) 2024.
引用
收藏
页码:18 / 26
相关论文
共 50 条
  • [41] A zero-shot intrusion detection method based on regression model
    Zhang, Xiao
    Gao, Ling
    Jiang, Yang
    Yang, Xudong
    Zheng, Jie
    Wang, Hai
    2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, : 186 - 191
  • [42] A transformer-based dual contrastive learning approach for zero-shot learning
    Lei, Yu
    Jing, Ran
    Li, Fangfang
    Gao, Quanxue
    Deng, Cheng
    NEUROCOMPUTING, 2025, 626
  • [43] Visual Language Based Succinct Zero-Shot Object Detection
    Zheng, Ye
    Huang, Xi
    Cui, Li
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5410 - 5418
  • [44] Method for improving zero-shot image classification
    Chen, Xiangfeng
    Chen, Wenbai
    Zhang, Chong
    Lv, Mengyao
    Han, Hu
    JOURNAL OF ENGINEERING-JOE, 2018, (16): : 1688 - 1691
  • [45] Zero-shot Object Detection Based on Dynamic Semantic Vectors
    Li, Haoyu
    Mei, Jilin
    Zhou, Jiancong
    Hu, Yu
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9267 - 9273
  • [46] Zero-shot object rumor detection based on contrastive learning
    Chen, Ke
    Zhang, Wenhao
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (09): : 1790 - 1800
  • [47] Vision transformer-based generalized zero-shot learning with data criticizingVision transformer-based generalized zero-shot learning with data criticizingQ. Zhou et al.
    Quan Zhou
    Yucuan Liang
    Zhenqi Zhang
    Wenming Cao
    Applied Intelligence, 2025, 55 (6)
  • [48] EntroCap: Zero-shot image captioning with entropy-based retrieval
    Yan, Jie
    Xie, Yuxiang
    Zou, Shiwei
    Wei, Yingmei
    Luan, Xidao
    NEUROCOMPUTING, 2025, 611
  • [49] Underwater image enhancement based on zero-shot learning and level adjustment
    Xie, Qiang
    Gao, Xiujing
    Liu, Zhen
    Huang, Hongwu
    HELIYON, 2023, 9 (04)
  • [50] Generative Model for Zero-Shot Sketch-Based Image Retrieval
    Verma, Vinay Kumar
    Mishra, Aakansha
    Mishra, Ashish
    Rai, Piyush
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 704 - 713